Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 34 |
Descriptor
Criterion Referenced Tests | 656 |
Test Construction | 656 |
Test Validity | 209 |
Test Reliability | 163 |
Norm Referenced Tests | 139 |
Test Items | 134 |
Elementary Secondary Education | 126 |
Achievement Tests | 112 |
Item Analysis | 106 |
Test Interpretation | 90 |
Testing | 90 |
More ▼ |
Source
Author
Hambleton, Ronald K. | 26 |
Roid, Gale | 11 |
Haladyna, Tom | 9 |
Popham, W. James | 9 |
Baker, Eva L. | 7 |
Cheek, Jimmy G. | 7 |
McGhee, Max B. | 7 |
Millman, Jason | 7 |
Nitko, Anthony J. | 7 |
Berk, Ronald A. | 6 |
Eignor, Daniel R. | 4 |
More ▼ |
Publication Type
Education Level
Higher Education | 15 |
Postsecondary Education | 11 |
Elementary Education | 6 |
Elementary Secondary Education | 6 |
Secondary Education | 4 |
Early Childhood Education | 2 |
Grade 1 | 1 |
Grade 10 | 1 |
Grade 3 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
More ▼ |
Audience
Practitioners | 55 |
Researchers | 29 |
Teachers | 25 |
Administrators | 7 |
Parents | 2 |
Students | 2 |
Counselors | 1 |
Support Staff | 1 |
Location
Florida | 13 |
Australia | 9 |
New York | 6 |
Georgia | 5 |
Massachusetts | 5 |
Oklahoma | 5 |
Canada | 4 |
Missouri | 4 |
Wisconsin | 4 |
California | 3 |
Japan | 3 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating

Wilcox, Rand R. – Psychometrika, 1978
Several Bayesian approaches to the simultaneous estimation of the means of k binomial populations are discussed. This has particular applicability to criterion-referenced or mastery testing. (Author/JKS)
Descriptors: Bayesian Statistics, Criterion Referenced Tests, Mastery Tests, Probability

Schott, Franz; And Others – Studies in Educational Evaluation, 1984
The relationship between instruction and a test is defined as a parallel content-valid relation. This article describes the PLANA procedure, which approaches the problem of content validity by applying constructional rules for producing or judging the content validity relationship between the instructional objective and the items. (BW)
Descriptors: Criterion Referenced Tests, Educational Objectives, Instructional Design, Teacher Education

Haladyna, Tom; Roid, Gale – Journal of Educational Measurement, 1981
The rationale for use of instructional sensitivity in the empirical review of test items is examined, and the results of a study that distinguishes instructional sensitivity from other item concepts are presented. Research is reviewed which indicates the existence of instructional sensitivity as a unique criterion-referenced test item concept. (RL)
Descriptors: Criterion Referenced Tests, Difficulty Level, Evaluation Criteria, Pretests Posttests
Popham, W. James – Phi Delta Kappan, 1998
The author began his education career committed to curriculum, not measurement. After years of using teacher-made and standardized tests as instructional afterthoughts, he recognized the superiority of criterion-referenced testing and the influence of high-stakes testing on instruction. He then became involved with developing such tests on a…
Descriptors: Behavioral Objectives, Criterion Referenced Tests, Educational Objectives, Educational Testing
Blachford, Jean S. – Today's Education, 1975
Descriptors: Achievement Tests, Criterion Referenced Tests, Diagnostic Tests, Educational Objectives
Hambleton, Ronald K. – 1986
Criterion-referenced tests (CRTs) are constructed to permit the interpretation of examinee tests performance in relation to a set of well-defined competencies. CRTs are currently used extensively in schools, industry, and the armed services because they provide valuable and different information from norm-referenced tests. Test publishers, school…
Descriptors: Behavioral Objectives, Criterion Referenced Tests, Decision Making, Evaluation Criteria
Popham, W. James; Lindheim, Elaine – NCME Measurement in Education, 1980
Attention is drawn to the dynamics of criterion-referenced test (CRT) construction in this report. How CRT's are developed at the Instructional Objectives Exchange is described through a series of three steps. The procedures pertain to the construction of "off the shelf" as well as "customized" tests. Step one, isolating the…
Descriptors: Criterion Referenced Tests, Guidelines, Item Banks, Skill Analysis
Educational Testing Service, Princeton, NJ. – 1973
A filmstrip with associated audio track has been developed to cover the major planning steps in the development of a measurement instrument such as a test or questionnaire. The filmstrip addresses the following six questions: Why am I testing? What should I test? Whom am I testing? What kinds of questions should I use? How long should my test be?…
Descriptors: Criterion Referenced Tests, Filmstrips, Guides, Instructional Films
Woodson, M. I. Charles E.
The item (difficulty and discrimination) and test (reliability and validity) statistics in classical test theory are highly dependent upon the calibration sample of individuals used. The estimates of item and test parameters in classical test theory is valid within a range of interest along the characteristic measured. Generally, this range of…
Descriptors: Criterion Referenced Tests, Item Analysis, Research Reports, Statistics
Durnin, John H.; Scandura, Joseph M. – 1972
For individualized or computer assisted instruction, norm referenced testing is inadequate to determine each individual's mastery on specific kinds of tasks. Hively's item forms and Ferguson's stratified item forms, both based on observable characteristics of the problems, and Scandura's algorithmic technology, positing that persons use rules to…
Descriptors: Branching, Computer Assisted Instruction, Criterion Referenced Tests, Individualized Instruction
Randall, Robert S. – 1972
Differences in design between norm referenced measures (NRM) and criterion referenced measures (CRM) are reviewed, and some of the procedures proposed on designing and evaluating CRM are examined. Differences in design of NRM and CRM are said to arise from the different purposes that underlie each measure. In addition, there are differences among…
Descriptors: Comparative Analysis, Criterion Referenced Tests, Norm Referenced Tests, Test Construction
Millman, Jason – 1977
A unique system is described for creating tests by computer. It is unique because, instead of storing items in the computer, item algorithms similar to Hively's notion of item forms are banked. Every item, and thus every test, represents a sample from domains consisting of thousands of items. The paper contains a discussion of the special…
Descriptors: Computer Assisted Testing, Computer Programs, Criterion Referenced Tests, Item Banks
Willoughby, Lee; And Others – 1976
This study compared a domain referenced approach with a traditional psychometric approach in the construction of a test. Results of the December, 1975 Quarterly Profile Exam (QPE) administered to 400 examinees at a university were the source of data. The 400 item QPE is a five alternative multiple choice test of information a "safe"…
Descriptors: Comparative Analysis, Criterion Referenced Tests, Norm Referenced Tests, Statistical Analysis

Guion, Robert M. – Personnel Psychology, 1978
"Content validity" has been widely but unwisely hailed as a solution to many problems in employee selection. The author argues that sampling from content domains cannot logically be substituted for criterion related validity. He suggests that evaluations of scores be based on the principle of construct validation. (Editor/RK)
Descriptors: Criterion Referenced Tests, Evaluation Methods, Personnel Evaluation, Personnel Selection

Hambleton, Ronald K. – Journal of Research in Science Teaching, 1977
The construction of test items, the purposes of testing and criterion-referenced testing are topics debated by two researchers. (CP)
Descriptors: Criterion Referenced Tests, Curriculum Development, Curriculum Evaluation, Evaluation