ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	1

Descriptor

High School Students	9
Item Response Theory	7
High Schools	6
College Entrance Examinations	4
Comparative Analysis	4
Achievement Tests	3
Equated Scores	3
Mathematics Tests	3
Estimation (Mathematics)	2
Grade 12	2
Latent Trait Theory	2
Mathematical Models	2
Raw Scores	2
Test Items	2
Test Theory	2
Age Differences	1
Algebra	1
Artificial Intelligence	1
Asian Americans	1
Biology	1
Black Students	1
Classification	1
Cognitive Ability	1
Cognitive Processes	1
Cognitive Tests	1
More ▼

Source

Journal of Educational…

Author

Harris, Deborah J.	2
Becker, Douglas F.	1
Cook, Linda L.	1
Dorans, Neil J.	1
Forsyth, Robert A.	1
Hills, John R.	1
Kolen, Michael J.	1
Lane, Suzanne	1
Schmitt, Alicia P.	1
Sireci, Stephen G.	1
Wesolowski, Brian C.	1
More ▼

Publication Type

Journal Articles	9
Reports - Research	7
Reports - Evaluative	2
Speeches/Meeting Papers	1

Education Level

High Schools	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)	2
Iowa Tests of Educational…	1

What Works Clearinghouse Rating

Showing all 9 results Save | Export

Predicting Operational Rater-Type Classifications Using Rasch Measurement Theory and Random Forests: A Music Performance Assessment Perspective

Peer reviewed

Direct link

Wesolowski, Brian C. – Journal of Educational Measurement, 2019

The purpose of this study was to build a Random Forest supervised machine learning model in order to predict musical rater-type classifications based upon a Rasch analysis of raters' differential severity/leniency related to item use. Raw scores (N = 1,704) from 142 raters across nine high school solo and ensemble festivals (grades 9-12) were…

Descriptors: Item Response Theory, Prediction, Classification, Artificial Intelligence

Equating Minimum-Competency Tests: Comparison of Methods.

Peer reviewed

Hills, John R.; And Others – Journal of Educational Measurement, 1988

Five methods of equating minimum-competency tests were compared using the Florida Statewide Student Assessment Test, Part II, for 1984 and 1986. Four of five methods yielded essentially comparable results for the highest scoring 84% of the students. Different lengths of anchor items were compared, using the concurrent item response theory equating…

Descriptors: Comparative Analysis, Equated Scores, Evaluation Methods, Graduation Requirements

A Comparison of Angoff's Design I and Design II for Vertical Equating Using Traditional and IRT Methodology.

Peer reviewed

Harris, Deborah J. – Journal of Educational Measurement, 1991

Two data collection designs, counterbalanced and spiraling (Angoff's Design I and Angoff's Design II) were compared using item response theory and equipercentile equating methodology in the vertical equating of 2 mathematics achievement tests using 1,000 eleventh graders and 1,000 twelfth graders. The greater stability of Design II is discussed.…

Descriptors: Achievement Tests, College Entrance Examinations, Comparative Analysis, Data Collection

A Comparative Study of the Effects of Recency of Instruction on the Stability of IRT and Conventional Item Parameter Estimates.

Peer reviewed

Cook, Linda L.; And Others – Journal of Educational Measurement, 1988

Three administrations of two forms of a biology achievement test were conducted with high school sophomores and seniors. The 58 common items between the two tests were used for score equating purposes. (TJH)

Descriptors: Achievement Tests, Biology, Comparative Analysis, Grade 10

An Empirical Investigation of Thurstone and IRT Methods of Scaling Achievement Tests.

Peer reviewed

Becker, Douglas F.; Forsyth, Robert A. – Journal of Educational Measurement, 1992

Measurement scales developed using Thurstone and item-response theory (IRT) methods of scaling achievement tests for the same single-level data were compared for approximately 4,000 high school students taking the Iowa Tests of Educational Development in 1975. Results of both approaches indicate that variability increases as grade level increases.…

Descriptors: Achievement Tests, Age Differences, High School Students, High Schools

Comparison of Item Preequating and Random Groups Equating Using IRT and Equipercentile Methods.

Peer reviewed

Kolen, Michael J.; Harris, Deborah J. – Journal of Educational Measurement, 1990

Item-preequating and random groups designs were used to equate forms of the American College Testing Assessment Mathematics Test for over 36,000 students. Equipercentile and three-parameter logistic model item response theory (IRT) procedures were used for both designs. The pretest methods did not compare well with the random groups method. (SLD)

Descriptors: College Entrance Examinations, Comparative Analysis, Equated Scores, High School Students

On the Reliability of Testlet-Based Tests.

Peer reviewed

Sireci, Stephen G.; And Others – Journal of Educational Measurement, 1991

Calculating the reliability of a testlet-based test is demonstrated using data from 1,812 males and 2,216 females taking the Scholastic Aptitude Test verbal section and 3,866 examinees taking another reading test. Traditional reliabilities calculated on reading comprehension tests constructed of four testlets provided substantial overestimates.…

Descriptors: College Entrance Examinations, Equations (Mathematics), Estimation (Mathematics), High School Students

Use of Restricted Item Response Models for Examining Item Difficulty Ordering and Slope Uniformity.

Peer reviewed

Lane, Suzanne – Journal of Educational Measurement, 1991

The use of restricted item response models to test hypotheses regarding item difficulty ordering and slope uniformity was demonstrated in a study in which 597 algebra students were asked to solve word problems reflecting various types of cognitive processing. Benefits and limitations of the procedures are discussed. (SLD)

Descriptors: Algebra, Cognitive Ability, Cognitive Processes, Cognitive Tests

Differential Item Functioning for Minority Examinees on the SAT.

Peer reviewed

Schmitt, Alicia P.; Dorans, Neil J. – Journal of Educational Measurement, 1990

Recent findings on differential item functioning (DIF) for minority examinees (Asian Americans, Hispanics, and Blacks) taking the Scholastic Aptitude Test (SAT) are presented, and the standardization approach to assessing DIF is described. Item characteristics related to DIF that generalize across ethnic groups are discussed. (SLD)

Descriptors: Asian Americans, Black Students, College Applicants, College Entrance Examinations