NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 9 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kim, Sooyeon; Moses, Tim – ETS Research Report Series, 2014
The purpose of this study was to investigate the potential impact of misrouting under a 2-stage multistage test (MST) design, which includes 1 routing and 3 second-stage modules. Simulations were used to create a situation in which a large group of examinees took each of the 3 possible MST paths (high, middle, and low). We compared differences in…
Descriptors: Comparative Analysis, Difficulty Level, Scores, Test Wiseness
Peer reviewed Peer reviewed
Direct linkDirect link
Phillips, Gary W. – Applied Measurement in Education, 2015
This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…
Descriptors: State Programs, Sampling, Research Design, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Sooyeon; Walker, Michael E.; McHale, Frederick – Journal of Educational Measurement, 2010
In this study we examined variations of the nonequivalent groups equating design for tests containing both multiple-choice (MC) and constructed-response (CR) items to determine which design was most effective in producing equivalent scores across the two tests to be equated. Using data from a large-scale exam, this study investigated the use of…
Descriptors: Measures (Individuals), Scoring, Equated Scores, Test Bias
Peer reviewed Peer reviewed
Direct linkDirect link
Wu, Margaret – Educational Measurement: Issues and Practice, 2010
In large-scale assessments, such as state-wide testing programs, national sample-based assessments, and international comparative studies, there are many steps involved in the measurement and reporting of student achievement. There are always sources of inaccuracies in each of the steps. It is of interest to identify the source and magnitude of…
Descriptors: Testing Programs, Educational Assessment, Measures (Individuals), Program Effectiveness
Peer reviewed Peer reviewed
Direct linkDirect link
Ross, John A.; Gray, Peter – Alberta Journal of Educational Research, 2008
We examined how much agreement there was between scores from large-scale mandated assessments and report-card grades for 14,776 students in grades 3, 6, and 9 of a district in which conditions were conducive to alignment of assessments. We found significant mean differences between internal and external assessments: effect sizes were 0.29 to 0.63…
Descriptors: Student Evaluation, Grades (Scholastic), Measures (Individuals), Effect Size
Phillips, S. E.; Anderson, A. E. – 1983
The LOGTRUE program can be used to obtain a scale of equated raw scores for two tests with parameter estimates on a common item response theory scale. The program derives its name from the method of logistic true score equating described by Lord (1980). The method can be applied to two tests with overlapping items administered to different groups…
Descriptors: Computer Programs, Equated Scores, Group Testing, Latent Trait Theory
Eisenberg, Eric M.; Book, Cassandra L. – 1980
Guidelines are described for setting up an item bank under latent trait theory which may be applied to the achievement testing system of multi-section, large-enrollment, college survey courses. The enrollment for the course is typically heterogeneous: students may be majors or non-majors, any one section may contain honors college students and…
Descriptors: Achievement Tests, Course Content, Equated Scores, Goodness of Fit
BARBER, MAX; BEIGHLEY, K.C. – 1960
TO DETERMINE HOW EFFECTIVELY THE COOPERATIVE ENGLISH TEST PREDICTS STUDENT SUCCESS IN FRESHMAN ENGLISH AT STOCKTON COLLEGE, THIS STUDY WAS CONDUCTED (1) TO OBTAIN STATISTICAL COMPARISONS BETWEEN FINAL SEMESTER GRADES OF A SAMPLE OF THE COLLEGE TRANSFER POPULATION WITH THE WHOLE TEST AND EACH OF ITS PARTS, (2) TO ANALYZE BY PERCENTAGES THE…
Descriptors: Academic Achievement, College Freshmen, Curriculum Research, Educational Testing
Brennan, Robert L., Ed. – Praeger, 2006
"Educational Measurement" has been the bible in its field since the first edition was published by ACE in 1951. The importance of this fourth edition of "Educational Measurement" is to extensively update and extend the topics treated in the previous three editions. As such, the fourth edition documents progress in the field and…
Descriptors: Educational Testing, Educational Assessment, Test Validity, Test Reliability