ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	2

Source

Journal of Educational…

Publication Type

Journal Articles	16
Reports - Research	10
Reports - Evaluative	5
Reports - Descriptive	1

Education Level

Elementary Secondary Education

Audience

Location

Israel

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	4
SAT (College Admission Test)	3

What Works Clearinghouse Rating

Showing 1 to 15 of 16 results Save | Export

Is It Necessary to Make Anchor Tests Mini-Versions of the Tests Being Equated or Can Some Restrictions Be Relaxed?

Peer reviewed

Direct link

Sinharay, Sandip; Holland, Paul W. – Journal of Educational Measurement, 2007

It is a widely held belief that anchor tests should be miniature versions (i.e., "minitests"), with respect to content and statistical characteristics, of the tests being equated. This article examines the foundations for this belief regarding statistical characteristics. It examines the requirement of statistical representativeness of…

Descriptors: Test Items, Comparative Testing

Comparisons among Designs for Equating Mixed-Format Tests in Large-Scale Assessments

Peer reviewed

Direct link

Kim, Sooyeon; Walker, Michael E.; McHale, Frederick – Journal of Educational Measurement, 2010

In this study we examined variations of the nonequivalent groups equating design for tests containing both multiple-choice (MC) and constructed-response (CR) items to determine which design was most effective in producing equivalent scores across the two tests to be equated. Using data from a large-scale exam, this study investigated the use of…

Descriptors: Measures (Individuals), Scoring, Equated Scores, Test Bias

A Comparison of Item Sampling Plans in the Application of Multiple Matrix Sampling.

Peer reviewed

Gressard, Risa P.; Loyd, Brenda H. – Journal of Educational Measurement, 1991

A Monte Carlo study, which simulated 10,000 examinees' responses to four tests, investigated the effect of item stratification on parameter estimation in multiple matrix sampling of achievement data. Practical multiple matrix sampling is based on item stratification by item discrimination and a sampling plan with moderate number of subtests. (SLD)

Descriptors: Achievement Tests, Comparative Testing, Computer Simulation, Estimation (Mathematics)

A Comparison of the Performance of Simulated Hierarchical and Linear Testlets.

Peer reviewed

Wainer, Howard; And Others – Journal of Educational Measurement, 1992

Computer simulations were run to measure the relationship between testlet validity and factors of item pool size and testlet length for both adaptive and linearly constructed testlets. Making a testlet adaptive yields only modest increases in aggregate validity because of the peakedness of the typical proficiency distribution. (Author/SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Computer Simulation

Relationships among Multiple-Choice and Open-Ended Analytical Questions.

Peer reviewed

Bridgeman, Brent; Rock, Donald A. – Journal of Educational Measurement, 1993

Exploratory and confirmatory factor analyses were used to explore relationships among existing item types and three new computer-administered item types for the analytical scale of the Graduate Record Examination General Test. Results with 349 students indicate constructs the item types are measuring. (SLD)

Descriptors: College Entrance Examinations, College Students, Comparative Testing, Computer Assisted Testing

Effects of Practical Constraints on Item Selection Rules at the Early Stages of Computerized Adaptive Testing

Peer reviewed

Direct link

Chen, Shu-Ying; Ankenman, Robert D. – Journal of Educational Measurement, 2004

The purpose of this study was to compare the effects of four item selection rules--(1) Fisher information (F), (2) Fisher information with a posterior distribution (FP), (3) Kullback-Leibler information with a posterior distribution (KP), and (4) completely randomized item selection (RN)--with respect to the precision of trait estimation and the…

Descriptors: Test Length, Adaptive Testing, Computer Assisted Testing, Test Selection

Building Algebra Testlets: A Comparison of Hierarchical and Linear Structures.

Peer reviewed

Wainer, Howard; And Others – Journal of Educational Measurement, 1991

Hierarchical (adaptive) and linear methods of testlet construction were compared. The performance of 2,080 ninth and tenth graders on a 4-item testlet was used to predict performance on the entire test. The adaptive test was slightly superior as a predictor, but the cost of obtaining that superiority was considerable. (SLD)

Descriptors: Adaptive Testing, Algebra, Comparative Testing, High School Students

Applications of the Analytically Derived Asymptotic Standard Errors of Item Response Theory Item Parameter Estimates

Peer reviewed

Direct link

Li, Yuan H.; Lissitz, Robert W. – Journal of Educational Measurement, 2004

The analytically derived asymptotic standard errors (SEs) of maximum likelihood (ML) item estimates can be approximated by a mathematical function without examinees' responses to test items, and the empirically determined SEs of marginal maximum likelihood estimation (MMLE)/Bayesian item estimates can be obtained when the same set of items is…

Descriptors: Test Items, Computation, Item Response Theory, Error of Measurement

Item Difficulty of Four Verbal Item Types and an Index of Differential Functioning for Black and White Examinees.

Peer reviewed

Freedle, Roy; Kostin, Irene – Journal of Educational Measurement, 1990

The importance of item difficulty (equated delta) was explored as a predictor of differential item functioning of Black versus White examinees for 4 verbal item types using 13 Graduate Record Examination forms and 11 Scholastic Aptitude Test forms. Several significant racial differences were found. (TJH)

Descriptors: Black Students, College Bound Students, College Entrance Examinations, Comparative Testing

The Standardization Approach to Assessing Comprehensive Differential Item Functioning.

Peer reviewed

Dorans, Neil J.; And Others – Journal of Educational Measurement, 1992

The standardization approach to comprehensive differential item functioning is described and contrasted with the log-linear approach to differential distractor functioning and the item-response-theory-based approach to differential alternative functioning. Data from an edition of the Scholastic Aptitude Test illustrate application of the approach…

Descriptors: Black Students, College Entrance Examinations, Comparative Testing, Distractors (Tests)

Gender Differences in Multiple-Choice Tests: The Role of Differential Guessing Tendencies.

Peer reviewed

Ben-Shakhar, Gershon; Sinai, Yakov – Journal of Educational Measurement, 1991

Gender differences in omitting items and guessing on multiple-choice tests were studied in Israel for 302 male and 302 female ninth graders and 150 male and 150 female university applicants. Females tended to omit more items and guess less often than did males. Implications for scoring are discussed. (SLD)

Descriptors: Aptitude Tests, Cognitive Ability, College Applicants, Comparative Testing

A Comparison of Quantitative Questions in Open-Ended and Multiple-Choice Formats.

Peer reviewed

Bridgeman, Brent – Journal of Educational Measurement, 1992

Examinees in a regular administration of the quantitative portion of the Graduate Record Examination responded to particular items in a machine-scannable multiple-choice format. Volunteers (n=364) used a computer to answer open-ended counterparts of these items. Scores for both formats demonstrated similar correlational patterns. (SLD)

Descriptors: Answer Sheets, College Entrance Examinations, College Students, Comparative Testing

A Comparison of Multiple-Choice and Constructed Figural Response Items.

Peer reviewed

Martinez, Michael E. – Journal of Educational Measurement, 1991

Figural response items (FRIs) in science were administered to 347 fourth graders, 365 eighth graders, and 322 twelfth graders. Item and test statistics from parallel FRIs and multiple-choice questions illustrate FRIs' more difficult and more discriminating nature. Relevance of guessing to FRIs and diagnostic value of the item type are highlighted.…

Descriptors: Comparative Testing, Constructed Response, Elementary School Students, Elementary Secondary Education

The Performance of the Mantel-Haenszel Procedure across Samples and Matching Criteria.

Peer reviewed

Ryan, Katherine E. – Journal of Educational Measurement, 1991

The reliability of Mantel-Haenszel (MH) indexes across samples of examinees and sample sizes and their robustness to item context effects were investigated with data for 670 African-American and 5,015 white students from the Second International Mathematics Study. MH procedures can be used to detect differential item functioning. (SLD)

Descriptors: Black Students, Comparative Testing, Context Effect, Evaluation Criteria

A Comparison of Self-Adapted and Computerized Adaptive Tests.

Peer reviewed

Wise, Steven L.; And Others – Journal of Educational Measurement, 1992

Performance of 156 undergraduate and 48 graduate students on a self-adapted test (SFAT)--students choose the difficulty level of their test items--was compared with performance on a computer-adapted test (CAT). Those taking the SFAT obtained higher ability scores and reported lower posttest state anxiety than did CAT takers. (SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Difficulty Level

Previous Page | Next Page »

Pages: 1 | 2

Comparative Testing	16
Test Items	16
Multiple Choice Tests	6
College Entrance Examinations	5
Computer Assisted Testing	5
Higher Education	5
Adaptive Testing	4
Black Students	4
Difficulty Level	4
High School Students	4
Item Bias	4
Mathematics Tests	4
Test Construction	4
Test Format	4
White Students	4
College Students	3
High Schools	3
Item Response Theory	3
Racial Differences	3
Computer Simulation	2
Equated Scores	2
Estimation (Mathematics)	2
Evaluation Criteria	2
Grade 8	2
Guessing (Tests)	2
More ▼

Bridgeman, Brent	2
Wainer, Howard	2
Ankenman, Robert D.	1
Ben-Shakhar, Gershon	1
Chen, Shu-Ying	1
Dorans, Neil J.	1
Freedle, Roy	1
Gerritz, Kalle	1
Gressard, Risa P.	1
Holland, Paul W.	1
Kim, Sooyeon	1
Kostin, Irene	1
Li, Yuan H.	1
Lissitz, Robert W.	1
Loyd, Brenda H.	1
Martinez, Michael E.	1
McHale, Frederick	1
Rock, Donald A.	1
Ryan, Katherine E.	1
Scheuneman, Janice Dowd	1
Sinai, Yakov	1
Sinharay, Sandip	1
Walker, Michael E.	1
Wise, Steven L.	1
More ▼