ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	6

Descriptor

Equated Scores	9
Group Testing	9
Academic Achievement	3
Educational Testing	3
Measures (Individuals)	3
Test Construction	3
Testing Programs	3
Comparative Testing	2
Computer Assisted Testing	2
Educational Assessment	2
Error of Measurement	2
Evaluation Methods	2
Evaluation Problems	2
Higher Education	2
Item Analysis	2
Item Response Theory	2
Latent Trait Theory	2
Raw Scores	2
Sampling	2
Scaling	2
Scoring	2
Test Bias	2
Test Format	2
Test Items	2
Test Reliability	2
More ▼

Source

Alberta Journal of…	1
Applied Measurement in…	1
ETS Research Report Series	1
Educational Measurement:…	1
Journal of Educational…	1
Praeger	1

Author

Kim, Sooyeon	2
Anderson, A. E.	1
BARBER, MAX	1
BEIGHLEY, K.C.	1
Book, Cassandra L.	1
Brennan, Robert L., Ed.	1
Eisenberg, Eric M.	1
Gray, Peter	1
McHale, Frederick	1
Moses, Tim	1
Phillips, Gary W.	1
Phillips, S. E.	1
Ross, John A.	1
Walker, Michael E.	1
Wu, Margaret	1
More ▼

Publication Type

Journal Articles	5
Reports - Research	3
Reports - Descriptive	2
Books	1
Collected Works - General	1
Guides - Classroom - Teacher	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Elementary Secondary Education	3
Adult Education	1
Grade 3	1
Grade 6	1
Grade 9	1
Higher Education	1

Audience

Location

Canada

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 9 results Save | Export

An Investigation of the Impact of Misrouting under Two-Stage Multistage Testing: A Simulation Study. Research Report. ETS RR-14-01

Peer reviewed
PDF on ERIC

Download full text

Kim, Sooyeon; Moses, Tim – ETS Research Report Series, 2014

The purpose of this study was to investigate the potential impact of misrouting under a 2-stage multistage test (MST) design, which includes 1 routing and 3 second-stage modules. Simulations were used to create a situation in which a large group of examinees took each of the 3 possible MST paths (high, middle, and low). We compared differences in…

Descriptors: Comparative Analysis, Difficulty Level, Scores, Test Wiseness

Impact of Design Effects in Large-Scale District and State Assessments

Peer reviewed

Direct link

Phillips, Gary W. – Applied Measurement in Education, 2015

This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…

Descriptors: State Programs, Sampling, Research Design, Error of Measurement

Comparisons among Designs for Equating Mixed-Format Tests in Large-Scale Assessments

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E.; McHale, Frederick – Journal of Educational Measurement, 2010

In this study we examined variations of the nonequivalent groups equating design for tests containing both multiple-choice (MC) and constructed-response (CR) items to determine which design was most effective in producing equivalent scores across the two tests to be equated. Using data from a large-scale exam, this study investigated the use of…

Descriptors: Measures (Individuals), Scoring, Equated Scores, Test Bias

Measurement, Sampling, and Equating Errors in Large-Scale Assessments

Peer reviewed

Direct link

Wu, Margaret – Educational Measurement: Issues and Practice, 2010

In large-scale assessments, such as state-wide testing programs, national sample-based assessments, and international comparative studies, there are many steps involved in the measurement and reporting of student achievement. There are always sources of inaccuracies in each of the steps. It is of interest to identify the source and magnitude of…

Descriptors: Testing Programs, Educational Assessment, Measures (Individuals), Program Effectiveness

Alignment of Scores on Large-Scale Assessments and Report-Card Grades

Peer reviewed

Direct link

Ross, John A.; Gray, Peter – Alberta Journal of Educational Research, 2008

We examined how much agreement there was between scores from large-scale mandated assessments and report-card grades for 14,776 students in grades 3, 6, and 9 of a district in which conditions were conducive to alignment of assessments. We found significant mean differences between internal and external assessments: effect sizes were 0.29 to 0.63…

Descriptors: Student Evaluation, Grades (Scholastic), Measures (Individuals), Effect Size

LOGTRUE: A Computer Program for Test Equating with Item Response Theory.

Phillips, S. E.; Anderson, A. E. – 1983

The LOGTRUE program can be used to obtain a scale of equated raw scores for two tests with parameter estimates on a common item response theory scale. The program derives its name from the method of logistic true score equating described by Lord (1980). The method can be applied to two tests with overlapping items administered to different groups…

Descriptors: Computer Programs, Equated Scores, Group Testing, Latent Trait Theory

Applying Latent Trait Theory to a Course Examination System: Administration, Maintenance, and Training.

Download full text

Eisenberg, Eric M.; Book, Cassandra L. – 1980

Guidelines are described for setting up an item bank under latent trait theory which may be applied to the achievement testing system of multi-section, large-enrollment, college survey courses. The enrollment for the course is typically heterogeneous: students may be majors or non-majors, any one section may contain honors college students and…

Descriptors: Achievement Tests, Course Content, Equated Scores, Goodness of Fit

AN ANALYSIS OF THE COOPERATIVE ENGLISH TEST AS A PREDICTOR OF SUCCESS IN ENGLISH 1A AND ENGLISH 1A71 AT STOCKTON COLLEGE.

Download full text

BARBER, MAX; BEIGHLEY, K.C. – 1960

TO DETERMINE HOW EFFECTIVELY THE COOPERATIVE ENGLISH TEST PREDICTS STUDENT SUCCESS IN FRESHMAN ENGLISH AT STOCKTON COLLEGE, THIS STUDY WAS CONDUCTED (1) TO OBTAIN STATISTICAL COMPARISONS BETWEEN FINAL SEMESTER GRADES OF A SAMPLE OF THE COLLEGE TRANSFER POPULATION WITH THE WHOLE TEST AND EACH OF ITS PARTS, (2) TO ANALYZE BY PERCENTAGES THE…

Descriptors: Academic Achievement, College Freshmen, Curriculum Research, Educational Testing

Educational Measurement. Fourth Edition. ACE/Praeger Series on Higher Education

Direct link

Brennan, Robert L., Ed. – Praeger, 2006

"Educational Measurement" has been the bible in its field since the first edition was published by ACE in 1951. The importance of this fourth edition of "Educational Measurement" is to extensively update and extend the topics treated in the previous three editions. As such, the fourth edition documents progress in the field and…

Descriptors: Educational Testing, Educational Assessment, Test Validity, Test Reliability