NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 5 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Clauser, Jerome C.; Hambleton, Ronald K.; Baldwin, Peter – Educational and Psychological Measurement, 2017
The Angoff standard setting method relies on content experts to review exam items and make judgments about the performance of the minimally proficient examinee. Unfortunately, at times content experts may have gaps in their understanding of specific exam content. These gaps are particularly likely to occur when the content domain is broad and/or…
Descriptors: Scores, Item Analysis, Classification, Decision Making
Peer reviewed Peer reviewed
Direct linkDirect link
Wendt, Heike; Kasper, Daniel – Large-scale Assessments in Education, 2016
Background: In 2011 the Progress in International Reading Literacy Study (PIRLS) and the Trends in International Mathematics and Science Study (TIMSS) were conducted at fourth grade in a number of participating countries with a shared representative sample. In this article we investigate whether there are multidimensional proficiency patterns…
Descriptors: Achievement Tests, Elementary Secondary Education, Foreign Countries, International Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Riley, Barth B.; Dennis, Michael L.; Conrad, Kendon J. – Applied Psychological Measurement, 2010
This simulation study sought to compare four different computerized adaptive testing (CAT) content-balancing procedures designed for use in a multidimensional assessment with respect to measurement precision, symptom severity classification, validity of clinical diagnostic recommendations, and sensitivity to atypical responding. The four…
Descriptors: Simulation, Computer Assisted Testing, Adaptive Testing, Comparative Analysis
Melnick, Steven A.; Henk, William A. – 1997
This paper compares two methods of establishing content validity, forced-choice judgmental review and a latent category judgmental review. It also compares content validity evidence with the results of a scale reliability analysis and makes recommendations of the two content validity procedures. Two different groups of graduate students enrolled…
Descriptors: Classification, Comparative Analysis, Content Validity, Graduate Students
Dorans, Neil J. – College Entrance Examination Board, 2000
Distinctions were made between three classes of statistical linkage: equivalence, concordance, and prediction. These distinctions were based on rational content considerations and empirical statistical relationships. A large database involving SAT I and ACT scores was used to determine which type of linkage was best suited for different scores and…
Descriptors: Statistical Analysis, Prediction, Scores, Standardized Tests