ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	19
Since 2006 (last 20 years)	48

Descriptor

Criterion Referenced Tests	378
Test Reliability	378
Test Validity	237
Test Construction	163
Norm Referenced Tests	92
Item Analysis	64
Test Interpretation	55
Elementary Secondary Education	54
Achievement Tests	50
Testing Problems	48
Cutting Scores	47
Mastery Tests	47
Statistical Analysis	47
Measurement Techniques	46
Test Items	42
Testing	39
Scores	38
Student Evaluation	38
Evaluation Methods	37
Higher Education	36
Standardized Tests	36
Reading Tests	34
Comparative Analysis	32
Mathematical Models	31
Error of Measurement	30
More ▼

Education Level

Higher Education	19
Postsecondary Education	12
Early Childhood Education	7
Elementary Education	7
Secondary Education	6
Elementary Secondary Education	4
Grade 8	4
Grade 1	3
Grade 3	3
High Schools	3
Preschool Education	3
Grade 2	2
Grade 5	2
Junior High Schools	2
Kindergarten	2
Middle Schools	2
Primary Education	2
Grade 4	1
Grade 6	1
Grade 7	1
Grade 9	1
Intermediate Grades	1
More ▼

Audience

Researchers	15
Practitioners	14
Teachers	7
Administrators	3
Parents	2
Counselors	1
Students	1
Support Staff	1

Location

Australia	8
Illinois	4
Florida	3
Georgia	3
Tennessee	3
Texas	3
Canada	2
Colorado	2
Iran	2
Michigan	2
Minnesota (Saint Paul)	2
New York	2
North Carolina	2
Taiwan	2
Arizona (Phoenix)	1
Arkansas	1
Costa Rica	1
Delaware	1
Hawaii	1
Iowa	1
Jordan	1
Kansas	1
Kentucky	1
Louisiana	1
Massachusetts	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	3
Elementary and Secondary…	2
No Child Left Behind Act 2001	2
Early Head Start	1
Every Student Succeeds Act…	1
Individuals with Disabilities…	1
Race to the Top	1

What Works Clearinghouse Rating

Test Reliability X

Showing 76 to 90 of 378 results Save | Export

A Pragmatic Approach to Criterion-Referenced Measures.

Download full text

Ivens, Stephen H. – 1972

A discussion of criterion-referenced measures is presented. Two characteristics define the criterion-referenced measure: the presence of a performance criterion, and test items keyed to a set of behavioral objectives. The performance criterion, in an educational setting, is usually a relative standard of performance. There are two ways of…

Descriptors: Behavioral Objectives, Criterion Referenced Tests, Item Analysis, Performance Criteria

The Effect of Violating the Assumption of Equal Item Means in Estimating the Livingston Coefficient.

Peer reviewed

Lovett, Hubert T. – Educational and Psychological Measurement, 1978

The validity of five methods of estimating the reliability of criterion-referenced tests was evaluated across nine conditions of variability among item means. The results were analyzed by analysis of variance, the Newman-Keuls test, and a nonparametric procedure. There was a tendency for all of the methods to be conservative. (Author/JKS)

Descriptors: Analysis of Variance, Criterion Referenced Tests, Item Analysis, Nonparametric Statistics

Estimating True Score in the Compound Binomial Error Model

Peer reviewed

Wilcox, Rand R. – Psychometrika, 1978

Several Bayesian approaches to the simultaneous estimation of the means of k binomial populations are discussed. This has particular applicability to criterion-referenced or mastery testing. (Author/JKS)

Descriptors: Bayesian Statistics, Criterion Referenced Tests, Mastery Tests, Probability

Reply to Shavelson, Block, and Ravitch's "Criterion-Referenced Testing: Comments on Reliability"

Peer reviewed

Livingston, Samuel A. – Journal of Educational Measurement, 1972

Author replies to article TM 500 559. (MB)

Descriptors: Criterion Referenced Tests, Measurement Techniques, Norm Referenced Tests, Scoring

The Reliability of a Criterion-Referenced Composite with the Parts of the Composite Having Different Cutting Scores.

Peer reviewed

Raju, Nambury S. – Educational and Psychological Measurement, 1982

Rajaratnam, Cronbach and Gleser's generalizability formula for stratified-parallel tests and Raju's coefficient beta are generalized to estimate the reliability of a composite of criterion-referenced tests, where the parts have different cutting scores. (Author/GK)

Descriptors: Criterion Referenced Tests, Cutting Scores, Mathematical Formulas, Scoring Formulas

The Reliability of Criterion-Referenced Tests and Special Education: Assumed versus Demonstrated.

Peer reviewed

Goodstein, H. A. – Journal of Special Education, 1982

A review of alternative methodologies and a conceptual framework for the study of reliability of criterion-referenced tests are presented. The possibility of aptitude-x-assessment interactions is considered and implications are discussed. (Author)

Descriptors: Criterion Referenced Tests, Disabilities, Elementary Secondary Education, Research Methodology

Estimating the Reliability of Criterion-Referenced Tests before Administration.

Peer reviewed

Chase, Clint – Mid-Western Educational Researcher, 1996

Classical procedures for calculating the two indices of decision consistency (P and Kappa) for criterion-referenced tests require two testings on each child. Huynh, Peng, and Subkoviak have presented one-testing procedures for these indices. These indices can be estimated without any test administration using Ebel's estimates of the mean, standard…

Descriptors: Criterion Referenced Tests, Educational Research, Educational Testing, Estimation (Mathematics)

A Comparison of Kuder-Richardson Formula 20 and Kappa as Estimates of the Reliability of Criterion-Referenced Tests.

Moyer, Judith E.; Fishbein, Ronald L. – 1977

The problem that this research addressed was one of decision making. Given three sets of criterion-referenced tests which were designed to be parallel in content, would a traditional reliability coefficient produce different decisions about the reliability of those tests than would kappa? The procedure used collected statewide results on 136 test…

Descriptors: Analysis of Variance, Comparative Analysis, Criterion Referenced Tests, Measurement Techniques

Measurement Problems and Issues Related to Applied Performance Testing.

Download full text

Sanders, James R. – 1976

Applied Performance Tests (APT) are defined as instruments designed to measure performance in an actual or simulated setting. They require at least a close approximation of the setting (if not the actual setting) to which the performance is expected to be transferred. This paper outlines measurement problems and issues that are unique to APT. It…

Descriptors: Criterion Referenced Tests, Elementary Secondary Education, Measurement, Performance Tests

Preparation of a Filmstrip Unit on Basic Measurement Principles. Final Report.

Download full text

Educational Testing Service, Princeton, NJ. – 1973

A filmstrip with associated audio track has been developed to cover the major planning steps in the development of a measurement instrument such as a test or questionnaire. The filmstrip addresses the following six questions: Why am I testing? What should I test? Whom am I testing? What kinds of questions should I use? How long should my test be?…

Descriptors: Criterion Referenced Tests, Filmstrips, Guides, Instructional Films

Classical Test Theory and Criterion-Referenced Scales.

Download full text

Woodson, M. I. Charles E.

The item (difficulty and discrimination) and test (reliability and validity) statistics in classical test theory are highly dependent upon the calibration sample of individuals used. The estimates of item and test parameters in classical test theory is valid within a range of interest along the characteristic measured. Generally, this range of…

Descriptors: Criterion Referenced Tests, Item Analysis, Research Reports, Statistics

Contrasting Norm Referenced and Criterion Referenced Measures.

Download full text

Randall, Robert S. – 1972

Differences in design between norm referenced measures (NRM) and criterion referenced measures (CRM) are reviewed, and some of the procedures proposed on designing and evaluating CRM are examined. Differences in design of NRM and CRM are said to arise from the different purposes that underlie each measure. In addition, there are differences among…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Norm Referenced Tests, Test Construction

A Comparison of Domain-Referenced and Classic Psychometric Test Construction Methods.

Download full text

Willoughby, Lee; And Others – 1976

This study compared a domain referenced approach with a traditional psychometric approach in the construction of a test. Results of the December, 1975 Quarterly Profile Exam (QPE) administered to 400 examinees at a university were the source of data. The 400 item QPE is a five alternative multiple choice test of information a "safe"…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Norm Referenced Tests, Statistical Analysis

The Role of Reliability in Criterion-Referenced Tests.

Peer reviewed

Kane, Michael T. – Journal of Educational Measurement, 1986

These analyses suggest that if a criterion-referenced test had a reliability (defined in terms of internal consistency) below 0.5, a simple a priori procedure would provide better estimates of students' universe scores than would individual observed scores. (Author/LMO)

Descriptors: Criterion Referenced Tests, Educational Research, Error of Measurement, Generalizability Theory

The Features of Good Language Tests

Download full text

Paradowski, Michal B. – Online Submission, 2002

The paper discusses the key criteria of good language tests: practicality, validity, and reliability.

Descriptors: Language Tests, Criterion Referenced Tests, Test Reliability, Test Validity

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 26

Journal of Educational…	27
Educational and Psychological…	9
Online Submission	6
Assessment for Effective…	4
Journal of Psychoeducational…	4
Language Testing	4
Psychometrika	4
Research Quarterly for…	4
Educational Leadership	3
Educational Researcher	3
Evaluation and the Health…	3
Reading Teacher	3
Educational Measurement:…	2
English Language Teaching	2
Journal of Clinical Psychology	2
Journal of Educational…	2
Journal of Learning Design	2
Learning Disability Quarterly	2
Performance and Instruction	2
Review of Educational Research	2
School Psychology Review	2
American Journal of Education	1
American Journal of…	1
American Psychologist	1
American Vocational Journal	1
More ▼

Hambleton, Ronald K.	13
Livingston, Samuel A.	8
Brennan, Robert L.	6
Wilcox, Rand R.	5
Huynh, Huynh	4
Kane, Michael T.	4
Roid, Gale	4
Roudabush, Glenn E.	4
Subkoviak, Michael J.	4
Tindal, Gerald	4
Baker, Eva L.	3
Berk, Ronald A.	3
Ebel, Robert L.	3
Eignor, Daniel R.	3
Haladyna, Tom	3
Kriewall, Thomas E.	3
Lovett, Hubert T.	3
Popham, W. James	3
Winnick, Joseph P.	3
Algina, James	2
Bashaw, W. L.	2
Blatchford, Charles H.	2
Bloch, Barbara	2
Bormuth, John R.	2
More ▼

Reports - Research	175
Journal Articles	105
Speeches/Meeting Papers	53
Reports - Evaluative	43
Guides - Non-Classroom	23
Tests/Questionnaires	22
Opinion Papers	21
Information Analyses	19
Reports - Descriptive	17
Guides - Classroom - Teacher	5
Books	3
Collected Works - Proceedings	3
Collected Works - Serials	3
Numerical/Quantitative Data	3
Reports - General	3
Guides - General	2
Book/Product Reviews	1
Collected Works - General	1
Dissertations/Theses -…	1
Dissertations/Theses -…	1
Reference Materials -…	1
Reference Materials - General	1
More ▼

National Assessment of…	3
SRA Achievement Series	3
California Achievement Tests	2
Adaptive Behavior Scale	1
Adult Performance Level	1
Battelle Developmental…	1
College Level Academic Skills…	1
Comprehensive Tests of Basic…	1
Cornell Critical Thinking Test	1
Dynamic Indicators of Basic…	1
General Educational…	1
Georgia Criterion Referenced…	1
Infant Toddler Environment…	1
Kaufman Test of Educational…	1
Metropolitan Readiness Tests	1
New York State Regents…	1
Preschool Language Scale	1
Test of English as a Foreign…	1
Texas Essential Knowledge and…	1
Watson Glaser Critical…	1
Wechsler Adult Intelligence…	1
Wechsler Individual…	1
Wechsler Intelligence Scales…	1
Woodcock Reading Mastery Test	1
More ▼