Publication Date
In 2025 | 67 |
Since 2024 | 901 |
Since 2021 (last 5 years) | 3415 |
Since 2016 (last 10 years) | 7595 |
Since 2006 (last 20 years) | 14761 |
Descriptor
Test Reliability | 14547 |
Test Validity | 9865 |
Reliability | 9544 |
Foreign Countries | 6751 |
Test Construction | 4608 |
Validity | 4120 |
Measures (Individuals) | 3750 |
Factor Analysis | 3720 |
Psychometrics | 3393 |
Interrater Reliability | 3054 |
Correlation | 3009 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 703 |
Practitioners | 447 |
Teachers | 204 |
Administrators | 121 |
Policymakers | 62 |
Counselors | 42 |
Students | 37 |
Parents | 11 |
Community | 7 |
Media Staff | 5 |
Support Staff | 5 |
More ▼ |
Location
Turkey | 1246 |
Australia | 428 |
Canada | 371 |
China | 329 |
United States | 264 |
United Kingdom | 246 |
Taiwan | 221 |
Netherlands | 217 |
Indonesia | 214 |
California | 208 |
Spain | 201 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 8 |
Meets WWC Standards with or without Reservations | 9 |
Does not meet standards | 6 |
WEITZ, HENRY – 1967
COUNSELORS OFTEN ADMINISTER TESTS OF QUESTIONABLE VALIDITY. IN RELIABILITY STUDIES, EVERY PRECAUTION IS TAKEN TO STABILIZE THE STIMULUS SITUATION. IN ASSESSING VALIDITY, CONCERN CENTERS ON BEHAVIOR UNDER DIFFERENT STIMULUS CONDITIONS. CRONBACH'S THEORETICAL LIMIT FOR A VALIDITY COEFFICIENT OF A TEST IS THE SQUARE ROOT OF THE RELIABILITY…
Descriptors: Aptitude Tests, Career Counseling, Counseling, Counseling Objectives
KYME, GEORGE – 1967
DEFINING MUSICALITY AS THE ABILITY TO GRASP A MUSICAL IDEA IN ITS TOTALITY, THIS RESEARCH INVESTIGATED THE RELATIVE EFFECTIVENESS OF MUSICAL PERFORMANCE (BOTH ORCHESTRAL AND CHORAL), GUIDED LISTENING, MUSIC READING, AND MUSICAL COMPOSITION AS MEANS OF DEVELOPING SUCH MUSICALITY. THE INSTRUMENT OF EVALUATION WAS A TEST OF AESTHETIC JUDGMENTS IN…
Descriptors: Bibliographies, Creative Activities, Curriculum Development, Evaluation
Anderson, Beverly L.; And Others – 1980
Before selecting an achievement test, it is essential to determine its purpose. There are eight purposes, relating to three educational decision making contexts: (1) instructional managerial (diagnosis, course placement, career guidance); (2) screening (selection and certification) and (3) programmatic (survey assessment, formative evaluation,…
Descriptors: Achievement Tests, Admission Criteria, Basic Skills, Daily Living Skills
Ross, G. Robert – 1977
A set of eight widely used inductive reasoning tests were investigated to determine whether or not they have different factorial structures. The eight inductive tests and three deductive tests, taken from the French Kit of Reference Tests for Cognitive Factors and the Watson-Glaser Critical Thinking Appraisal, were administered to 157 high school…
Descriptors: Abstract Reasoning, Cognitive Tests, Comparative Testing, Deduction
Crocker, Linda; And Others – 1979
The relationship between children's performance on a standardized achievement battery and compositional writing performance was examined. One hundred thirty-eight writing samples were collected from fourth-grade students. Compositions were scored for both mechanistic and holistic qualities. Four subscores on the Metropolitan Achievement Test were…
Descriptors: Achievement Tests, Creative Writing, Essay Tests, Expository Writing
Goodman, Marvin; Mina, Elias – 1977
Variability in diagnostic procedures and a lack of valid and reliable measures led to the development of a comprehensive battery, which incorporated an operational definition of learning disabilities. The battery consisted of forms for observing these functions: intelligence, academic achievement, gross and fine motor control, visual perception,…
Descriptors: Cognitive Processes, Diagnostic Tests, Educational Diagnosis, Educational Testing
Miles, David T. – 1968
The purpose of this first phase of a continuing research program was the development of a test of creative problem solving in general design. A design class of 186 members was divided into an experimental and control group; a non-design control group (an educational psychology class) of 45 was also tested. Multivariate interpretation of creative…
Descriptors: Cognitive Processes, Cognitive Tests, Creative Thinking, Creativity

Littlefield, John H.; And Others – 1977
Generalizability theory extends previous methods of estimating the reliability of rating instruments such that one can estimate the precision of a measurement system for differentiating among students, scales, or any other important dimension. In this study, generalizability theory is applied to faculty ratings of junior and senior dental students…
Descriptors: Analysis of Variance, Clinical Experience, College Faculty, Data Collection
Halasa, Ofelia – 1977
A bilingual rating scale was constructed to determine teachers' ratings of attitude and proficiency among Anglo and Spanish children in Title VII classes. This instrument was designed to ascertain how teachers perceive the pupils in their classroom and how two teachers representing different backgrounds perceive children of similar and different…
Descriptors: Bilingual Education, Bilingual Students, Bilingual Teachers, Bilingualism
Shuford, Emir H., Jr.; Brown, Thomas A. – 1974
A student's choice of an answer to a test question is a coarse measure of his knowledge about the subject matter of the question. Much finer measurement might be achieved if the student were asked to estimate, for each possible answer, the probability that it is the correct one. Such a procedure could yield two classes of benefits: (a) students…
Descriptors: Bias, Computer Programs, Confidence Testing, Decision Making
Moore, Shirley G. – 1973
A picture-board sociometric interview for preschool children is described and information on reliability and validity is presented. While the reliability of this measure of peer popularity is low to moderate using test-retest or split-half measures, the interview data does seem to predict other relevant measures of the social behavior of young…
Descriptors: Group Status, Guides, Interaction, Interviews
Morrow, William R. – 1974
The primary aim of this research project was to test the hypothesis that successful teacher- and parent-mediated direct modification, by operant techniques, of youngsters' deviant behavior would tend to be followed by significant positive changes in the youngsters' self-concepts. Two studies were done. In the first, focusing on teacher-mediated…
Descriptors: Attitude Change, Behavior Change, Behavior Problems, Change Agents
Schaefer, Earl S. – 1975
This paper reports the development of a Classroom Behavior Inventory and a series of studies which have developed and refined methods for collecting teacher ratings of children's social, emotional and task-oriented behavior from preschool through high school. Findings suggest that the Classroom Behavior Inventory is a relatively economical,…
Descriptors: Academic Achievement, Behavior Rating Scales, Classroom Observation Techniques, Early Childhood Education
Guilliams, Clark I. – 1975
Chicano and Amerindian vocabulary scale responses from the Stanford-Binet (LM) and Wechsler Intelligence Scale for Children were item-analyzed for 1,009 subjects. The response patterns differed both by ethnic group and test, as well as by age. The most common, and recurring, pattern found was "level-of-difficulty" gradient…
Descriptors: American Indians, Correlation, Disadvantaged, Elementary Education
Fleming, Dan B. – 1974
In this paper, the author reviews a number of criticisms that have been made of the use of standardized tests in the social studies and reviews 11 general, skills, and discipline-oriented standardized tests for social studies. Standardized tests evaluating the results of the new social studies courses are open to criticism on the basis of…
Descriptors: Evaluation Methods, Measurement Techniques, National Competency Tests, Predictive Measurement