Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 14 |
Since 2016 (last 10 years) | 21 |
Since 2006 (last 20 years) | 60 |
Descriptor
Source
Author
Ediger, Marlow | 10 |
Hambleton, Ronald K. | 5 |
Sousa, Ronald L. | 5 |
Popham, W. James | 4 |
Baker, Eva L. | 3 |
Bielinski, John | 3 |
Gillis, Shelley | 3 |
Bricker, Diane | 2 |
Busch, John Christian | 2 |
Edmonston, Leon P. | 2 |
Glass, Gene V. | 2 |
More ▼ |
Publication Type
Education Level
Location
Australia | 20 |
Canada | 6 |
California | 3 |
Florida | 3 |
Massachusetts | 3 |
Delaware | 2 |
Georgia | 2 |
Michigan | 2 |
New York | 2 |
New Zealand | 2 |
North Carolina | 2 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 9 |
No Child Left Behind Act 2001 | 3 |
Education Consolidation… | 1 |
Social Security Act Title XX | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Mahar, Matthew T.; Rowe, David A. – Measurement in Physical Education and Exercise Science, 2008
Accurate measures of youth fitness are needed by researchers and practitioners. Evidence of validity and reliability are essential before results of youth fitness tests can be used to make sound decisions. This article describes a three-stage paradigm for validation research and provides guidance for conducting and understanding norm-referenced…
Descriptors: Test Reliability, Test Validity, Guidelines, Physical Education Teachers
Tatsuoka, Curtis – Measurement: Interdisciplinary Research and Perspectives, 2009
In this commentary, the author addresses what is referred to as the deterministic input, noisy "and" gate (DINA) model. The author mentions concerns with how this model has been formulated and presented. In particular, the author points out that there is a lack of recognition of the confounding of profiles that generally arises and then discusses…
Descriptors: Test Items, Classification, Psychometrics, Item Response Theory
Maris, Gunter; Bechger, Timo – Measurement: Interdisciplinary Research and Perspectives, 2009
Rupp and Templin (2008) do a good job at describing the ever expanding landscape of Diagnostic Classification Models (DCM). In many ways, their review article clearly points to some of the questions that need to be answered before DCMs can become part of the psychometric practitioners toolkit. Apart from the issues mentioned in this article that…
Descriptors: Factor Analysis, Classification, Psychometrics, Item Response Theory
Petscher, Yaacov; Foorman, Barbara – Society for Research on Educational Effectiveness, 2009
The current study will examine possible contextual effects relative to differences in reading comprehension performance in the state of Florida. While the Reading First (RF) Impact study examined such difference using a regression discontinuity design, the authors are primarily interested in other analytic methods that might answer different…
Descriptors: Reading Comprehension, Criterion Referenced Tests, Comparative Analysis, Reading Programs
Sinharay, Sandip; Haberman, Shelby J. – Measurement: Interdisciplinary Research and Perspectives, 2009
In this commentary, the authors discuss some of the issues regarding the use of diagnostic classification models that practitioners should keep in mind. In the authors experience, these issues are not as well known as they should be. The authors then provide recommendations on diagnostic scoring.
Descriptors: Scoring, Reliability, Validity, Classification
Amrein-Beardsley, Audrey – Educational Researcher, 2008
Value-added models help to evaluate the knowledge that school districts, schools, and teachers add to student learning as students progress through school. In this article, the well-known Education Value-Added Assessment System (EVAAS) is examined. The author presents a practical investigation of the methodological issues associated with the…
Descriptors: Validity, School Districts, Academic Achievement, Measurement Techniques
Frey, Andreas; Carstensen, Claus H. – Measurement: Interdisciplinary Research and Perspectives, 2009
On a general level, the objective of diagnostic classifications models (DCMs) lies in a classification of individuals regarding multiple latent skills. In this article, the authors show that this objective can be achieved by multidimensional adaptive testing (MAT) as well. The authors discuss whether or not the restricted applicability of DCMs can…
Descriptors: Adaptive Testing, Test Items, Classification, Psychometrics
Black, David R.; Routson, Sue; Spight, Damon L.; Tindall, Judith A.; Wegner, Carolyn – Perspectives in Peer Programs, 2007
The Indiana Department of Education, at the direction of Phyllis Lewis, commissioned the National Association of Peer Programs (NAPP: formerly known as the National Peer Helpers Association) and the authors listed above to develop a rubric for peer helping programs. Development of the rubric began with a review of the NAPP Programmatic Standards…
Descriptors: Peer Teaching, Peer Influence, Helping Relationship, Scoring Rubrics
Edmonston, Leon P. – 1972
Attempts by the Southwest Educational Development Laboratory to arrive at a comprehensive evaluation model are reviewed. Problems that arose from using classical procedures and measures are discussed. The emphasis of the Lab was to develop an evaluation model related to criterion referenced measures that provide decision-making information. (DB)
Descriptors: Criterion Referenced Tests, Decision Making, Evaluation Methods, Models
Kosecoff, Jacqueline; And Others – 1976
There are, at present, a number of tests that are labeled criterion referenced. These tests vary considerably in format, design, analysis, and function. In order to provide an efficient and objective procedure for describing, assessing, and comparing these measures, the Criterion Referenced Test Description and Evaluation (CRTDE) rating system was…
Descriptors: Criterion Referenced Tests, Evaluation, Evaluation Criteria, Evaluation Methods
Roebuck, Martyn – Programmed Learning and Educational Technology, 1972
This paper reviews some of the suggested indices of learning and of the problems inherent in gain measures. It then discusses the relevance of criterion-referenced testing and of operationally-defined testing to the measurement of achievement. (Editor)
Descriptors: Achievement Tests, Criterion Referenced Tests, Evaluation Methods, Norm Referenced Tests
Grosges, Thomas; Barchiesi, Dominique – Higher Education in Europe, 2007
The European Credit Transfer and Accumulation System (ECTS) has been developed and instituted to facilitate student mobility and academic recognition. This paper presents, discusses, and illustrates the pertinence and the limitation of the current statistical distribution of the ECTS grades, and we propose an alternative way to calculate the ECTS…
Descriptors: Grades (Scholastic), Statistical Distributions, Statistical Analysis, Student Mobility
Holowinsky, Ivan Z. – 1978
The paper describes the emergence of the variety of modified approaches toward the assessment of cognitive skills. A change in emphasis away from norm-referenced assessment toward criterion-referenced assessment is noted. For such approaches as emphasis on the process rather than the product, intense detailed behavioral observations, and…
Descriptors: Cognitive Development, Criterion Referenced Tests, Evaluation Methods, Handicapped Children
Kennedy, Beth T. – 1972
Issues related to the evaluation of instructional programs developed under the auspices of the Southwest Educational Development Laboratory are briefly discussed. The Laboratory develops criterion-referenced tests which form an integral part of each instructional program. The importance of examining the reliability and validity of these tests is…
Descriptors: Criterion Referenced Tests, Evaluation Methods, Instructional Programs, Test Reliability

Zieky, Michael – Studies in Educational Evaluation, 1989
Problems inherent in setting standards/passing scores for criterion-referenced tests are discussed; and traditional methods of setting standards are reviewed. Three acceptable methods based on judgments of questions are discussed; their authors include, respectively: (1) W. H. Angoff (1971); (2) R. L. Ebel (1972); and (3) L. Nedelsky (1954). (SLD)
Descriptors: Criterion Referenced Tests, Cutting Scores, Evaluation Methods, Standard Setting (Scoring)