Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 7 |
Since 2016 (last 10 years) | 19 |
Since 2006 (last 20 years) | 48 |
Descriptor
Source
Author
Hambleton, Ronald K. | 13 |
Livingston, Samuel A. | 8 |
Brennan, Robert L. | 6 |
Wilcox, Rand R. | 5 |
Huynh, Huynh | 4 |
Kane, Michael T. | 4 |
Roid, Gale | 4 |
Roudabush, Glenn E. | 4 |
Subkoviak, Michael J. | 4 |
Tindal, Gerald | 4 |
Baker, Eva L. | 3 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 15 |
Practitioners | 14 |
Teachers | 7 |
Administrators | 3 |
Parents | 2 |
Counselors | 1 |
Students | 1 |
Support Staff | 1 |
Location
Australia | 8 |
Illinois | 4 |
Florida | 3 |
Georgia | 3 |
Tennessee | 3 |
Texas | 3 |
Canada | 2 |
Colorado | 2 |
Iran | 2 |
Michigan | 2 |
Minnesota (Saint Paul) | 2 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 3 |
Elementary and Secondary… | 2 |
No Child Left Behind Act 2001 | 2 |
Early Head Start | 1 |
Every Student Succeeds Act… | 1 |
Individuals with Disabilities… | 1 |
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Oakland, Thomas – 1972
New strategies for evaluation criterion referenced measures (CRM) are discussed. These strategies examine the following issues: (1) the use of normed referenced measures (NRM) as CRM and then estimating the reliability and validity of such measures in terms of variance from an arbitrarily specified criterion score, (2) estimation of the…
Descriptors: Criterion Referenced Tests, Evaluation Criteria, Evaluation Methods, Item Analysis
Livingston, Samuel A. – 1976
A distinction is made between reliability of measurement and reliability of classification; the "criterion-referenced reliability coefficient" describes the former. Application of this coefficient to the probability distribution of possible scores for a single student yields a meaningful way to describe the reliability of a single score. (Author)
Descriptors: Classification, Criterion Referenced Tests, Error of Measurement, Measurement

Hambleton, Ronald K. – Journal of Educational Measurement, 1978
The use of cut-off scores with criterion referenced tests is defended in this response to two papers by Gene Glass and Nancy Burton. Suggestions for setting cut-off scores are made. (JKS)
Descriptors: Academic Standards, Criterion Referenced Tests, Cutting Scores, Decision Making

Popham, W. James – Educational Leadership, 1978
A well-constructed criterion-referenced test has an unambiguous descriptive scheme, an adequate number of items, a sufficiently limited focus, reliability, validity, and comparative data. (Author/MLF)
Descriptors: Achievement Tests, Competency Based Education, Criterion Referenced Tests, Elementary Secondary Education

Tindal, Gerald; And Others – Journal of Educational Research, 1985
This study examined the test-retest reliability and criterion validity of basal mastery tests of three commercial reading series. Results indicated that reliability and validity of the test varied among and within instruments. Implications for developing and using basal mastery tests are discussed. (Author/MT)
Descriptors: Basal Reading, Criterion Referenced Tests, Elementary Education, Mastery Tests

Chang, S. Tai; Bashaw, W. L. – Journal of Clinical Psychology, 1984
Investigated the reliability of the McCarthy Screening Test (MST) from a criterion-referenced standpoint, using a sample of 1,323 children. Suggested that dependability indices be adopted for clinical instruments such as the MST. (JAC)
Descriptors: Criterion Referenced Tests, Elementary School Students, Learning Problems, Preschool Children

Blatchford, Charles H. – TESOL Quarterly, 1971
Much of the content was presented at the TESOL Convention, New Orleans, Louisiana, March 1971, and is derived from the author's dissertation Experimental Steps to Ascertain Reliability of Diagnostic Tests in English as a Second Language" (Columbia University). (VM)
Descriptors: Criterion Referenced Tests, Diagnostic Tests, Educational Experiments, English (Second Language)

Wilcox, Rand R. – Educational and Psychological Measurement, 1979
For some situations the beta-binomial distribution might be used to describe the marginal distribution of test scores for a particular population of examinees. Several different methods of approximating the maximum likelihood estimate were investigated, and it was found that the Newton-Raphson method should be used when it yields admissable…
Descriptors: Criterion Referenced Tests, Maximum Likelihood Statistics, Measurement, Monte Carlo Methods
Kane, Michael; Wilson, Jennifer – 1982
This paper evaluates the magnitude of the total error in estimates of the difference between an examinee's domain score and the cutoff score. An observed score based on a random sample of items from the domain, and an estimated cutoff score derived from a judgmental standard setting procedure are assumed. The work of Brennan and Lockwood (1980) is…
Descriptors: Criterion Referenced Tests, Cutting Scores, Error of Measurement, Mastery Tests
MacFarland, Thomas W. – 1985
Criterion-referenced evaluation (CRE) describes achievement in performance terms, whereas norm-referenced evaluation (NRE) compares the performance of one individual to that of others with respect to a given evaluation instrument. Vocational educators who base their programs on behaviorism commonly evaluate student performance from a CRE…
Descriptors: Comparative Analysis, Criterion Referenced Tests, Norm Referenced Tests, Secondary Education
Bunch, Michael B. – 1978
The validity and dependability of functional competency tests for adults are examined as they relate to the information needs of instructional decision makers. Test data from the Adult Performance Level (APL) Program (funded by the U.S. Office of Education at the University of Texas at Austin) is used to illustrate key points. In the discussion of…
Descriptors: Adults, Basic Skills, Criterion Referenced Tests, Cutting Scores
Reid, Jerry B.; Roberts, Dennis M. – 1978
Comparisons of corresponding values of phi and kappa coefficients were made for 270 instances of data generated by a Monte Carlo technique to simulate a test-retest situation. Data were generated for distributions with the same mean but three different levels of standard deviation, standard error of measurement and cutting score. Ten samples of…
Descriptors: Comparative Analysis, Correlation, Criterion Referenced Tests, Cutting Scores
Divgi, D. R. – 1978
One aim of criterion-referenced testing is to classify an examinee without reference to a norm group; therefore, any statements about the dependability of such classification ought to be group-independent also. A population-independent index is proposed in terms of the probability of incorrect classification near the cutoff true score. The…
Descriptors: Criterion Referenced Tests, Cutting Scores, Difficulty Level, Error of Measurement

Lovett, Hubert T. – 1975
The reliability of a criterion referenced test was defined as a measure of the degree to which the test discriminates between an individual's level of performance and a predetermined criterion level. The variances of observed and true scores were defined as the squared deviation of the score from the criterion. Based on these definitions and the…
Descriptors: Career Development, Comparative Analysis, Criterion Referenced Tests, Mathematical Models
Brennan, Robert L. – 1974
An attempt is made to explore the use of subjective probabilities in the analysis of item data, especially criterion-referenced item data. Two assumptions are implicit: (1) one wants to obtain a maximum amount of information with respect to an item using a minimum number of subjects; and (2) once the item is validated, it may well be administered…
Descriptors: Confidence Testing, Criterion Referenced Tests, Guessing (Tests), Item Analysis