Publication Date
In 2025 | 67 |
Since 2024 | 901 |
Since 2021 (last 5 years) | 3415 |
Since 2016 (last 10 years) | 7595 |
Since 2006 (last 20 years) | 14761 |
Descriptor
Test Reliability | 14547 |
Test Validity | 9865 |
Reliability | 9544 |
Foreign Countries | 6751 |
Test Construction | 4608 |
Validity | 4120 |
Measures (Individuals) | 3750 |
Factor Analysis | 3720 |
Psychometrics | 3393 |
Interrater Reliability | 3054 |
Correlation | 3009 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 703 |
Practitioners | 447 |
Teachers | 204 |
Administrators | 121 |
Policymakers | 62 |
Counselors | 42 |
Students | 37 |
Parents | 11 |
Community | 7 |
Media Staff | 5 |
Support Staff | 5 |
More ▼ |
Location
Turkey | 1246 |
Australia | 428 |
Canada | 371 |
China | 329 |
United States | 264 |
United Kingdom | 246 |
Taiwan | 221 |
Netherlands | 217 |
Indonesia | 214 |
California | 208 |
Spain | 201 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 8 |
Meets WWC Standards with or without Reservations | 9 |
Does not meet standards | 6 |
Miller, M. David – 2002
In 1994 the State Collaborative on Assessment and Student Standards of the Council of Chief State School Officers began a study to examine the generalizability of performance-based assessments (PBAs) for state-mandated assessment programs. The intent was to examine the major sources of error associated with PBAs and the generalizability and…
Descriptors: Elementary Secondary Education, Error of Measurement, Generalizability Theory, Performance Based Assessment
Gardner, John; Cowan, Pamela – 2000
The Transfer Procedure Test is taken by children around 11 years of age who wish to attend grammar schools in Northern Ireland. It is a high stakes test in that children are only allowed one attempt and their performance determines their future schooling in a manner that is not of their choice or of their parents. Candidates usually take two test…
Descriptors: Admission (School), Elementary Secondary Education, Foreign Countries, High Stakes Tests
Dickinson, David K.; McCabe, Allyssa; Sprague, Kim – 2001
The Teacher Rating of Oral Language and Literacy (TROLL) is an instrument that measures skills identified as critical in the New Standards for Speaking and Listening. In 5 to 10 minutes and without prior training, teachers can assess an individual child's current standing with respect to skills that research has identified as critical for literary…
Descriptors: Early Childhood Education, Language Skills, Language Tests, Literacy
Basturk, Ramazan; Loadman, William E. – Online Submission, 2001
Purpose: The purpose of this study is to assess and evaluate the grant selection process for reading excellence program in Ohio. School districts in Ohio were given the opportunity to apply for funding to support district based reading programs through a request for proposal procedures. An effort was made to reliably and equitably score the…
Descriptors: Reading Programs, Tutors, Statistics, Grants
McQueen, Joy; Congdon, Peter J. – 1997
A study was conducted to investigate the stability of rater severity over an extended rating period. Multifaceted Rasch analysis was applied to ratings of writing performances of 8,285 primary school (elementary) students. Each performance was rated on two performance dimensions by two trained raters over a period of 7 rating days. Performances…
Descriptors: Educational Assessment, Elementary Education, Elementary School Students, Foreign Countries
Yorke, Mantz – 1997
This paper examines the purpose, validity, and reliability of performance indicators in higher education, focusing on the experience of higher education institutions in the United Kingdom. Specifically, it studies the role of student entry and exit performance, teaching and staff quality, retention and completion, and placement in employment as…
Descriptors: Educational Policy, Foreign Countries, Higher Education, Institutional Evaluation
Scheuren, Fritz; Li, Bonnie – 1996
This report provides empirical results of attempts to achieve consistency of estimates between two National Center for Education Statistics (NCES) surveys, the 1993-94 Private School Survey (PSS) and the Schools and Staffing Survey (SASS). Comparisons are made among statistical and computational procedures that may achieve the desired consistency…
Descriptors: Classification, Elementary Secondary Education, Estimation (Mathematics), Least Squares Statistics
Edman, Laird R. O.; Bart, William M.; Robey, Jennifer; Silverman, Jenzi – 2000
The Minnesota Test of Critical Thinking (MTCT) has been designed to measure both critical thinking (CT) skills and a key disposition of critical reasoning: the willingness to evaluate arguments that are congruent with one's own goals and beliefs critically. The MTCT uses a taxonomy of CT skills derived from the American Philosophical Association's…
Descriptors: Critical Thinking, Factor Analysis, Factor Structure, Higher Education
Sciutto, Mark J.; Terjesen, Mark D. – 2000
This study examined the psychometric and technical characteristics of various measures of attention deficit hyperactivity disorder (ADHD) that are commonly used with preschool-aged children. Information on reliability, validity, norms, and scale-specific features was gathered from the test manuals of four commonly used behavior rating scales: (1)…
Descriptors: Attention Deficit Disorders, Diagnostic Tests, Hyperactivity, Norms
Bastick, Tony – 1999
The purpose of this paper is to report a successful technique for assessing cooperative group work reliably and validly. The paper demonstrates a simple-to-use assessment procedure that tracks individual accountability, energizes student interaction, and rewards cooperative learning, even as it uses fewer administrative resources than traditional…
Descriptors: Accountability, Cooperative Learning, Criteria, Evaluation Methods
Bastick, Tony – 1999
This paper aims to make the techniques of cooperative learning more attractive to teachers by presenting a method of assessment that avoids the drawbacks associated with trying to extract valid and reliable individual marks from cooperative performances. The paper presents an easy-to-use method of assessing an individual's contribution to a…
Descriptors: Accountability, Cooperative Learning, Criteria, Evaluation Methods
Crehan, Kevin D.; Hess, Robert K.; D'Agostino, Jerome V. – 2000
This paper focuses on teacher testing issues related to job analysis, test specification development, reliability, and validity. It emphasizes the conceptualization and operational definition of appropriate validity evidence to assess the quality of licensure testing decisions. It is suggested that the process of job, or practice, analysis would…
Descriptors: Cognitive Processes, Job Analysis, Licensing Examinations (Professions), Reliability
Floreck, Lisa M.; De Champlain, Andre F.; Kaplan, David – 2001
The purpose of the current study was to use multilevel modeling to quantify and explain the sources of score variation in standardized patient (SP) encounters. Through laypersons trained to portray SPs and record medical student actions, SP examinations allow the measurement of examinees' clinical and interpersonal skills. In this study, the SP…
Descriptors: Clinical Experience, Computer Software, Licensing Examinations (Professions), Patients
Klein, Davina C. D.; Chung, Gregory K. W. K.; Osmundson, Ellen; Herl, Howard E.; O'Neil, Harold F., Jr. – 2002
Knowledge mapping is expected to measure deep conceptual understanding and allow students to characterize relationships among concepts in a domain visually. This research examined the validity of knowledge mapping as an assessment tool in science. The approach to investigating this validity was three-pronged. First, a model was outlined for the…
Descriptors: Comprehension, Elementary School Students, Intermediate Grades, Multitrait Multimethod Techniques
Manalo, Jonathan R.; Wolfe, Edward W. – 2000
Recently, the Test of English as a Foreign Language (TOEFL) changed by including a writing section that gives the examinee an option between computer and handwritten formats to compose their responses. Unfortunately, this may introduce several potential sources of error that might reduce the reliability and validity of the scores. The seriousness…
Descriptors: Computer Assisted Testing, Essay Tests, Evaluators, Handwriting