Publication Date
In 2025 | 67 |
Since 2024 | 824 |
Since 2021 (last 5 years) | 3138 |
Since 2016 (last 10 years) | 6928 |
Since 2006 (last 20 years) | 13210 |
Descriptor
Test Reliability | 9315 |
Reliability | 7369 |
Test Validity | 6537 |
Foreign Countries | 6311 |
Measures (Individuals) | 3419 |
Validity | 3285 |
Factor Analysis | 3208 |
Test Construction | 2970 |
Psychometrics | 2955 |
Interrater Reliability | 2502 |
Correlation | 2428 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 395 |
Practitioners | 224 |
Teachers | 104 |
Administrators | 58 |
Counselors | 22 |
Policymakers | 13 |
Media Staff | 5 |
Students | 5 |
Parents | 2 |
Location
Turkey | 1236 |
Australia | 368 |
China | 314 |
Canada | 298 |
United Kingdom | 222 |
Indonesia | 212 |
Taiwan | 210 |
United States | 204 |
Netherlands | 201 |
Spain | 192 |
Germany | 178 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 3 |
Meets WWC Standards with or without Reservations | 4 |
Does not meet standards | 5 |

McLeod, P. J. – Evaluation and the Health Professions, 1991
Faculty opinions of an evaluation program for medical school clinical tutors were obtained through a survey of 24 undergraduate clinical tutors. Although students had been using the evaluation instrument to rate teachers for five years, faculty expressed many reservations about its reliability and validity. (SLD)
Descriptors: Clinical Teaching (Health Professions), Evaluation Methods, Higher Education, Medical Education

Bontempo, Robert – Journal of Cross-Cultural Psychology, 1993
Describes a method for assessing the quality of translations based on item response theory (IRT). Results from the IRT technique with French and Chinese versions of a scale measuring individualism-collectivism for samples of 250 U.S., 357 French, and 290 Chinese undergraduates show how several biased items are detected. (SLD)
Descriptors: Chinese, Comparative Testing, Cross Cultural Studies, Foreign Countries

Buchmann, Margret; Floden, Robert E. – Educational Researcher, 1992
Among concepts that seem to be the guardian angels of school reform, coherence is a rebel angel, advancing human learning, but escaping control. Coherence must not be confused with consistency. It allows for change and imagination but remains true to concepts and experiences that construct coherence without fabricating consistency. (SLD)
Descriptors: Coherence, Comprehension, Educational Change, Educational Planning

Bers, Trudy H.; Smith, Kerry E. – Community College Review, 1990
Describes a study of the validity and reliability of a writing skills assessment test taken by 4,284 2-year college students in 1986-87. Assesses interrater reliability, influences of nonperformance factors (e.g., gender, native language, and form of test), predictive validity of test for future performance, and implications of findings. (DMM)
Descriptors: Basic Writing, Community Colleges, High Risk Students, Predictive Validity

Moss, Pamela A. – Educational Researcher, 1994
The assumption that reliability is a necessary but insufficient condition for validity in assessment is challenged by exploring a dialectic between psychometric and hermeneutic approaches to drawing and warranting interpretations of human products of performance. Hermeneutic alternatives for epistemological and ethical purposes expand the range of…
Descriptors: Educational Assessment, Educational Research, Epistemology, Ethics

Stokes, Julie E.; And Others – Journal of Black Psychology, 1994
This paper investigates the psychometric properties of the African Self-Consciousness (ASC) Scale in a noncollege heterogeneous population of 147 African Americans to determine the reliability and validity of the ASC Scale. Based on analysis of the scale's reliability, factor structure, and construct validity, the study shows the ASC Scale to be a…
Descriptors: Behavior Rating Scales, Behavioral Science Research, Blacks, Construct Validity

Merrell, Kenneth W. – School Psychology Review, 1993
Constructed School Social Behavior Scales (SSBS) to include teacher-related and peer-related forms of social competence and antisocial behavior. Standardized SSBS using teacher ratings on 1,858 kindergarten through grade 12 students across United States Evidence presented from several related studies in present investigation indicated that SSBS…
Descriptors: Antisocial Behavior, Behavior Rating Scales, Elementary School Students, Elementary Secondary Education

Ghuman, Jaswinder Kaur; Peebles, Claire D.; Ghuman, Harinder Singh – Infants and Young Children, 1998
A review of 36 social interaction measures found that there are no measures available to evaluate infants and preschool children's basic capacity for social interaction. The available measures are described and grouped into parent-child interaction, social skills, social competence, play, adaptive behavior, communication, general development, and…
Descriptors: Adaptive Behavior (of Disabled), Behavior Problems, Emotional Disturbances, Evaluation Methods

O'Neil, Harold F.; Abedi, Jamal – Journal of Educational Research, 1996
Describes research on the development of a measure of student metacognition. The brief, domain-independent measure serves as a collateral measure in construct validation, supporting exploration of the self-regulatory demands of performance assessment. Results show that metacognition can be directly and explicitly measured in the context of…
Descriptors: Alternative Assessment, Cognitive Ability, College Students, Elementary Secondary Education

Cillessen, Antonius H. N.; Bellmore, Amy D. – Merrill-Palmer Quarterly, 1999
Examined the role of fourth graders' social self-perceptions in their social development. Found significant relationships among self-perception measures, and they were moderately stable over the school year. Found significant sociometric status and gender effects for generalized and dyadic perceptions. Inaccurate social self-perceptions predicted…
Descriptors: Childhood Attitudes, Children, Comparative Analysis, Intermediate Grades
Paschall, Mallie J.; Fishbein, Diana H.; Hubal, Robert C.; Eldreth, Diana – Health Education Research, 2005
This study examined the psychometric properties of performance measures for three novel, interactive virtual reality vignette exercises developed to assess social competency skills of at-risk adolescents. Performance data were collected from 117 African-American male 15-17 year olds. Data for 18 performance measures were obtained, based on…
Descriptors: Interpersonal Communication, Computer Simulation, Drug Use, Validity
Kane, Thomas J.; Staiger, Douglas O. – Brookings Papers on Education Policy, 2002
By the spring of 2000, forty states had begun using student test scores to rate school performance. Twenty states have gone a step further and are attaching explicit monetary rewards or sanctions to a school's test performance. In this paper, the authors focus on accountability programs in which states measure the effectiveness of individual…
Descriptors: Elementary Schools, Accountability, Scores, Risk
Gorsuch, Greta – CALICO Journal, 2004
In this study, retrospective interviews were used to investigate reliability (and thus validity) threats to a computerized ESL listening comprehension test administered at a university in the US. The participants in the investigation, six international graduate students, were asked to respond to semi- and open-ended questions during individual…
Descriptors: Graduate Students, Listening Comprehension, Investigations, Listening Comprehension Tests
Pike, Gary R. – Assessment Update, 2004
Recently the Educational Testing Service (ETS) has modified its Student Instructional Report II (SIR II) for use in online distance education courses. The SIR II is a second-generation survey based on more than thirty years of experience with student evaluations (Centra, 1998; Centra and Gaubatz, n.d.). The e-SIR II is based on the highly…
Descriptors: Student Evaluation, Distance Education, Educational Testing, Prior Learning
VanDerHeyden, Amanda M.; Witt, Joseph C.; Naquin, Gale – School Psychology Review, 2003
This article describes efforts to examine the validity of a screening process that provides objective data for multidisciplinary team meetings where consideration is being given to teacher referral of a student for assessment and possible placement in special education. In this study, the accuracy with which this process, called Problem Validation…
Descriptors: Learning Disabilities, Achievement Tests, Grade 2, Special Education