ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	19
Since 2006 (last 20 years)	48

Descriptor

Criterion Referenced Tests	378
Test Reliability	378
Test Validity	237
Test Construction	163
Norm Referenced Tests	92
Item Analysis	64
Test Interpretation	55
Elementary Secondary Education	54
Achievement Tests	50
Testing Problems	48
Cutting Scores	47
Mastery Tests	47
Statistical Analysis	47
Measurement Techniques	46
Test Items	42
Testing	39
Scores	38
Student Evaluation	38
Evaluation Methods	37
Higher Education	36
Standardized Tests	36
Reading Tests	34
Comparative Analysis	32
Mathematical Models	31
Error of Measurement	30
More ▼

Education Level

Higher Education	19
Postsecondary Education	12
Early Childhood Education	7
Elementary Education	7
Secondary Education	6
Elementary Secondary Education	4
Grade 8	4
Grade 1	3
Grade 3	3
High Schools	3
Preschool Education	3
Grade 2	2
Grade 5	2
Junior High Schools	2
Kindergarten	2
Middle Schools	2
Primary Education	2
Grade 4	1
Grade 6	1
Grade 7	1
Grade 9	1
Intermediate Grades	1
More ▼

Audience

Researchers	15
Practitioners	14
Teachers	7
Administrators	3
Parents	2
Counselors	1
Students	1
Support Staff	1

Location

Australia	8
Illinois	4
Florida	3
Georgia	3
Tennessee	3
Texas	3
Canada	2
Colorado	2
Iran	2
Michigan	2
Minnesota (Saint Paul)	2
New York	2
North Carolina	2
Taiwan	2
Arizona (Phoenix)	1
Arkansas	1
Costa Rica	1
Delaware	1
Hawaii	1
Iowa	1
Jordan	1
Kansas	1
Kentucky	1
Louisiana	1
Massachusetts	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	3
Elementary and Secondary…	2
No Child Left Behind Act 2001	2
Early Head Start	1
Every Student Succeeds Act…	1
Individuals with Disabilities…	1
Race to the Top	1

What Works Clearinghouse Rating

Test Reliability X

Showing 31 to 45 of 378 results Save | Export

Mathematics Curriculum Based Measurement to Predict State Test Performance: A Comparison of Measures and Methods

Direct link

Stevens, Olinger; Leigh, Erika – ProQuest LLC, 2012

Scope and Method of Study: The purpose of the study is to use an empirical approach to identify a simple, economical, efficient, and technically adequate performance measure that teachers can use to assess student growth in mathematics. The current study has been designed to expand the body of research for math CBM to further examine technical…

Descriptors: Mathematics Instruction, Evaluation Methods, Student Evaluation, Measurement Techniques

Test Review: Beaver, J. M., & Carter, M. A. (2006). "The Developmental Reading Assessment--Second Edition" (DRA2). Upper Saddle River, NJ--Pearson

Peer reviewed

Direct link

McCarty, Allison M.; Christ, Theodore J. – Assessment for Effective Intervention, 2010

This article reviews the "Developmental Reading Assessment--Second Edition" (DRA2), a teacher-administered assessment that identifies students' instructional level, along with their strengths and weaknesses in reading. Once teachers calculate and interpret scores, the data can purportedly be used to inform, and possibly individualize,…

Descriptors: Reading Tests, Oral Reading, Reading Fluency, Criterion Referenced Tests

Potential Bias in Predictive Validity of Universal Screening Measures across Disaggregation Subgroups

Peer reviewed

Direct link

Hosp, John L.; Hosp, Michelle A.; Dole, Janice K. – School Psychology Review, 2011

Universal screening measures are an integral component of any tiered system of instructional delivery. Recent studies of screening measures have often excluded examinations of bias in predictive validity. The present study examined a common screening instrument for evidence of bias in predictive validity across the four disaggregation categories…

Descriptors: Evidence, Reading Fluency, Federal Legislation, Predictive Validity

On the Validity of Student Evaluation of Teaching: The State of the Art

Peer reviewed

Direct link

Spooren, Pieter; Brockx, Bert; Mortelmans, Dimitri – Review of Educational Research, 2013

This article provides an extensive overview of the recent literature on student evaluation of teaching (SET) in higher education. The review is based on the SET meta-validation model, drawing upon research reports published in peer-reviewed journals since 2000. Through the lens of validity, we consider both the more traditional research themes in…

Descriptors: Student Evaluation of Teacher Performance, Teacher Evaluation, Test Validity, Educational Research

Investigating the Value of Section Scores for the "TOEFL iBT"® Test. "TOEFL iBT"® Research Report. TOEFL iBT-21. ETS Research Report RR-13-35

Peer reviewed
PDF on ERIC

Download full text

Sawaki, Yasuyo; Sinharay, Sandip – ETS Research Report Series, 2013

This study investigates the value of reporting the reading, listening, speaking, and writing section scores for the "TOEFL iBT"® test, focusing on 4 related aspects of the psychometric quality of the TOEFL iBT section scores: reliability of the section scores, dimensionality of the test, presence of distinct score profiles, and the…

Descriptors: Scores, Computer Assisted Testing, Factor Analysis, Correlation

Test Review: ACCESS for ELLs[R]

Peer reviewed

Direct link

Fox, Janna; Fairbairn, Shelley – Language Testing, 2011

This article reviews Assessing Comprehension and Communication in English State-to-State for English Language Learners ("ACCESS for ELLs"[R]), which is a large-scale, high-stakes, standards-based, and criterion-referenced English language proficiency test administered in the USA annually to more than 840,000 English Language Learners (ELLs), in…

Descriptors: Test Preparation, Feedback (Response), Instructional Design, Testing Accommodations

Authentic Assessment for Infants and Toddlers: Exploring the Reliability and Validity of the Ounce Scale

Peer reviewed

Direct link

Meisels, Samuel J.; Wen, Xiaoli; Beachy-Quick, Kristy – Applied Developmental Science, 2010

This study used a mixed methods methodology to investigate the reliability and validity of the Ounce Scale, an authentic, observational assessment of infants' and toddlers' development from birth through 42 months of age. Quantitative cross-sectional data were collected from 287 children and 124 teachers in seven urban Early Head Start programs;…

Descriptors: Performance Based Assessment, Measures (Individuals), Infants, Toddlers

Reliability and Validity Evidence for the GED[R] English as a Second Language Test. GED Testing Service[R] Research Studies, 2009-4

Download full text

Setzer, J. Carl – GED Testing Service, 2009

The GED[R] English as a Second Language (GED ESL) Test was designed to serve as an adjunct to the GED test battery when an examinee takes either the Spanish- or French-language version of the tests. The GED ESL Test is a criterion-referenced, multiple-choice instrument that assesses the functional, English reading skills of adults whose first…

Descriptors: Language Tests, High School Equivalency Programs, Psychometrics, Reading Skills

Criterion Referenced Assessment: Establishing Content Validity of Complex Skills Related to Specific Tasks

Peer reviewed
PDF on ERIC

Download full text

MacQuarrie, David; Applegate, Brooks; Lacefield, Warren – Journal of Career and Technical Education, 2008

Career and Technical Education (CTE) is a nationwide program that emphasizes training for primary, secondary, and post secondary educational stages for the career and workforce needs of today and tomorrow's society. Mandated indicators of success have been set in place and secondary schools are expected to improve student's skill levels in…

Descriptors: Criterion Referenced Tests, Content Validity, Test Validity, Test Reliability

Practical Guidelines for Valid and Reliable Youth Fitness Testing

Peer reviewed

Direct link

Mahar, Matthew T.; Rowe, David A. – Measurement in Physical Education and Exercise Science, 2008

Accurate measures of youth fitness are needed by researchers and practitioners. Evidence of validity and reliability are essential before results of youth fitness tests can be used to make sound decisions. This article describes a three-stage paradigm for validation research and provides guidance for conducting and understanding norm-referenced…

Descriptors: Test Reliability, Test Validity, Guidelines, Physical Education Teachers

Assessor Training: Its Effects on Criterion-Based Assessment in a Medical Context

Direct link

Pell, Godfrey; Homer, Matthew S.; Roberts, Trudie E. – International Journal of Research & Method in Education, 2008

Increasingly, academic institutions are being required to improve the validity of the assessment process; unfortunately, often this is at the expense of reliability. In medical schools (such as Leeds), standardized tests of clinical skills, such as "Objective Structured Clinical Examinations" (OSCEs) are widely used to assess clinical…

Descriptors: Medical Education, Standardized Tests, Clinical Experience, Criterion Referenced Tests

Achievement Testing in the No Child Left Behind Era: The Arkansas Benchmark

Peer reviewed

Direct link

Hall, John D.; Howerton, D. Lynn; Jones, Craig H. – Research in the Schools, 2008

The No Child Left Behind Act and the accountability movement in public education caused many states to develop criterion-referenced academic achievement tests. Scores from these tests are often used to make high stakes decisions. Even so, these tests typically do not receive independent psychometric scrutiny. We evaluated the 2005 Arkansas…

Descriptors: Criterion Referenced Tests, Achievement Tests, High Stakes Tests, Public Education

How Does One Assess the Accuracy of Academic Success Predictors? ROC Analysis Applied to University Entrance Factors

Peer reviewed

Direct link

Vivo, Juana-Maria; Franco, Manuel – International Journal of Mathematical Education in Science and Technology, 2008

This article attempts to present a novel application of a method of measuring accuracy for academic success predictors that could be used as a standard. This procedure is known as the receiver operating characteristic (ROC) curve, which comes from statistical decision techniques. The statistical prediction techniques provide predictor models and…

Descriptors: Academic Achievement, Item Response Theory, Criterion Referenced Tests, Predictor Variables

Assessing Language, Literacy, and Mathematics Skills with "Work Sampling for Head Start"

Peer reviewed

Direct link

Meisels, Samuel J.; Xue, Yange; Shamblott, Melissa – Early Education and Development, 2008

Research Findings: We examined the reliability and validity of the language, literacy, and mathematics domains of "Work Sampling for Head Start" (WSHS), an observational assessment designed for 3- and 4-year-olds. Participants included 112 children who were enrolled over a two-year period in Head Start and a number of other programs…

Descriptors: Preschool Children, Preschool Education, Early Intervention, Criterion Referenced Tests

The Single Administration Estimate of the Proportion of Agreement of a Proficiency Test Scored with a Latent Structure Model.

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1981

This paper describes and compares procedures for estimating the reliability of proficiency tests that are scored with latent structure models. Results suggest that the predictive estimate is the most accurate of the procedures. (Author/BW)

Descriptors: Criterion Referenced Tests, Scoring, Test Reliability

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 26

Journal of Educational…	27
Educational and Psychological…	9
Online Submission	6
Assessment for Effective…	4
Journal of Psychoeducational…	4
Language Testing	4
Psychometrika	4
Research Quarterly for…	4
Educational Leadership	3
Educational Researcher	3
Evaluation and the Health…	3
Reading Teacher	3
Educational Measurement:…	2
English Language Teaching	2
Journal of Clinical Psychology	2
Journal of Educational…	2
Journal of Learning Design	2
Learning Disability Quarterly	2
Performance and Instruction	2
Review of Educational Research	2
School Psychology Review	2
American Journal of Education	1
American Journal of…	1
American Psychologist	1
American Vocational Journal	1
More ▼

Hambleton, Ronald K.	13
Livingston, Samuel A.	8
Brennan, Robert L.	6
Wilcox, Rand R.	5
Huynh, Huynh	4
Kane, Michael T.	4
Roid, Gale	4
Roudabush, Glenn E.	4
Subkoviak, Michael J.	4
Tindal, Gerald	4
Baker, Eva L.	3
Berk, Ronald A.	3
Ebel, Robert L.	3
Eignor, Daniel R.	3
Haladyna, Tom	3
Kriewall, Thomas E.	3
Lovett, Hubert T.	3
Popham, W. James	3
Winnick, Joseph P.	3
Algina, James	2
Bashaw, W. L.	2
Blatchford, Charles H.	2
Bloch, Barbara	2
Bormuth, John R.	2
More ▼

Reports - Research	175
Journal Articles	105
Speeches/Meeting Papers	53
Reports - Evaluative	43
Guides - Non-Classroom	23
Tests/Questionnaires	22
Opinion Papers	21
Information Analyses	19
Reports - Descriptive	17
Guides - Classroom - Teacher	5
Books	3
Collected Works - Proceedings	3
Collected Works - Serials	3
Numerical/Quantitative Data	3
Reports - General	3
Guides - General	2
Book/Product Reviews	1
Collected Works - General	1
Dissertations/Theses -…	1
Dissertations/Theses -…	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

National Assessment of…	3
SRA Achievement Series	3
California Achievement Tests	2
Adaptive Behavior Scale	1
Adult Performance Level	1
Battelle Developmental…	1
College Level Academic Skills…	1
Comprehensive Tests of Basic…	1
Cornell Critical Thinking Test	1
Dynamic Indicators of Basic…	1
General Educational…	1
Georgia Criterion Referenced…	1
Infant Toddler Environment…	1
Kaufman Test of Educational…	1
Metropolitan Readiness Tests	1
New York State Regents…	1
Preschool Language Scale	1
Test of English as a Foreign…	1
Texas Essential Knowledge and…	1
Watson Glaser Critical…	1
Wechsler Adult Intelligence…	1
Wechsler Individual…	1
Wechsler Intelligence Scales…	1
Woodcock Reading Mastery Test	1
More ▼