ERIC - Search Results

Publication Date

In 2025	4
Since 2024	60

Descriptor

Test Items	60
Item Response Theory	58
Models	17
Foreign Countries	16
Item Analysis	16
Test Construction	15
Difficulty Level	10
Simulation	10
Accuracy	9
Error of Measurement	9
Mathematics Tests	9
Comparative Analysis	8
Computation	8
Goodness of Fit	8
Correlation	7
Monte Carlo Methods	7
Scores	7
Test Format	7
Achievement Tests	6
Multiple Choice Tests	6
Psychometrics	6
Science Tests	6
Test Validity	6
Bayesian Statistics	5
English (Second Language)	5
More ▼

Publication Type

Journal Articles	53
Reports - Research	53
Dissertations/Theses -…	3
Information Analyses	2
Reports - Descriptive	2
Reports - Evaluative	2
Tests/Questionnaires	1

Education Level

Secondary Education	12
Elementary Education	9
Higher Education	7
Postsecondary Education	7
Junior High Schools	5
Middle Schools	5
Elementary Secondary Education	4
Early Childhood Education	3
Grade 8	3
Primary Education	3
Grade 2	2
Grade 4	2
High Schools	2
Intermediate Grades	2
Grade 1	1
Grade 11	1
Grade 3	1
Kindergarten	1
More ▼

Audience

Location

Indonesia	3
Nigeria	2
Uzbekistan	2
Germany	1
Iran	1
Switzerland	1
United Kingdom (Edinburgh)	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

Trends in International…	3
Big Five Inventory	1
International English…	1
Program for International…	1
Progress in International…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 60 results Save | Export

Generalizing beyond the Test: Permutation-Based Profile Analysis for Explaining DIF Using Item Features

Peer reviewed

Direct link

Maria Bolsinova; Jesper Tijmstra; Leslie Rutkowski; David Rutkowski – Journal of Educational and Behavioral Statistics, 2024

Profile analysis is one of the main tools for studying whether differential item functioning can be related to specific features of test items. While relevant, profile analysis in its current form has two restrictions that limit its usefulness in practice: It assumes that all test items have equal discrimination parameters, and it does not test…

Descriptors: Test Items, Item Analysis, Generalizability Theory, Achievement Tests

Comparison of Item Response Theory Ability and Item Parameters According to Classical and Bayesian Estimation Methods

Peer reviewed
PDF on ERIC

Download full text

Eray Selçuk; Ergül Demir – International Journal of Assessment Tools in Education, 2024

This research aims to compare the ability and item parameter estimations of Item Response Theory according to Maximum likelihood and Bayesian approaches in different Monte Carlo simulation conditions. For this purpose, depending on the changes in the priori distribution type, sample size, test length, and logistics model, the ability and item…

Descriptors: Item Response Theory, Item Analysis, Test Items, Simulation

Are the Steps on Likert Scales Equidistant? Responses on Visual Analog Scales Allow Estimating Their Distances

Peer reviewed

Direct link

Miguel A. García-Pérez – Educational and Psychological Measurement, 2024

A recurring question regarding Likert items is whether the discrete steps that this response format allows represent constant increments along the underlying continuum. This question appears unsolvable because Likert responses carry no direct information to this effect. Yet, any item administered in Likert format can identically be administered…

Descriptors: Likert Scales, Test Construction, Test Items, Item Analysis

Information Functions of Rank-2PL Models for Forced-Choice Questionnaires

Peer reviewed

Direct link

Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Journal of Educational Measurement, 2024

This paper presents the item and test information functions of the Rank two-parameter logistic models (Rank-2PLM) for items with two (pair) and three (triplet) statements in forced-choice questionnaires. The Rank-2PLM model for pairs is the MUPP-2PLM (Multi-Unidimensional Pairwise Preference) and, for triplets, is the Triplet-2PLM. Fisher's…

Descriptors: Questionnaires, Test Items, Item Response Theory, Models

The Impact of Measurement Noninvariance across Time and Group in Longitudinal Item Response Modeling

Peer reviewed

Direct link

In-Hee Choi – Asia Pacific Education Review, 2024

Longitudinal item response data often exhibit two types of measurement noninvariance: the noninvariance of item parameters between subject groups and that of item parameters across multiple time points. This study proposes a comprehensive approach to the simultaneous modeling of both types of measurement noninvariance in terms of longitudinal item…

Descriptors: Longitudinal Studies, Item Response Theory, Growth Models, Error of Measurement

Added Value of Subscores for Tests with Polytomous Items

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025

Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…

Descriptors: Scores, Test Theory, Test Items, Testing

Detecting Differential Item Functioning with Multiple Causes: A Comparison of Three Methods

Peer reviewed

Direct link

Xiaowen Liu – International Journal of Testing, 2024

Differential item functioning (DIF) often arises from multiple sources. Within the context of multidimensional item response theory, this study examined DIF items with varying secondary dimensions using the three DIF methods: SIBTEST, Mantel-Haenszel, and logistic regression. The effect of the number of secondary dimensions on DIF detection rates…

Descriptors: Item Analysis, Test Items, Item Response Theory, Correlation

Guesses and Slips as Proficiency-Related Phenomena and Impacts on Parameter Invariance

Peer reviewed

Direct link

Xiangyi Liao; Daniel M Bolt – Educational Measurement: Issues and Practice, 2024

Traditional approaches to the modeling of multiple-choice item response data (e.g., 3PL, 4PL models) emphasize slips and guesses as random events. In this paper, an item response model is presented that characterizes both disjunctively interacting guessing and conjunctively interacting slipping processes as proficiency-related phenomena. We show…

Descriptors: Item Response Theory, Test Items, Error Correction, Guessing (Tests)

The Impact of Non-Effortful Responding on Item and Person Parameters in Item-Pool Scaling Linking

Peer reviewed

Direct link

Yue Liu; Zhen Li; Hongyun Liu; Xiaofeng You – Applied Measurement in Education, 2024

Low test-taking effort of examinees has been considered a source of construct-irrelevant variance in item response modeling, leading to serious consequences on parameter estimation. This study aims to investigate how non-effortful response (NER) influences the estimation of item and person parameters in item-pool scale linking (IPSL) and whether…

Descriptors: Item Response Theory, Computation, Simulation, Responses

Modeling Dimensions Converging at the Upper Anchor in Learning Progressions: An Example of Micro-Evolution

Peer reviewed

Direct link

Mingfeng Xue; Mark Wilson – Applied Measurement in Education, 2024

Multidimensionality is common in psychological and educational measurements. This study focuses on dimensions that converge at the upper anchor (i.e. the highest acquisition status defined in a learning progression) and compares different ways of dealing with them using the multidimensional random coefficients multinomial logit model and scale…

Descriptors: Learning Trajectories, Educational Assessment, Item Response Theory, Evolution

The Development of a Standardized Effect Size for the SIBTEST Procedure

Peer reviewed

Direct link

James D. Weese; Ronna C. Turner; Allison Ames; Xinya Liang; Brandon Crawford – Journal of Experimental Education, 2024

In this study a standardized effect size was created for use with the SIBTEST procedure. Using this standardized effect size, a single set of heuristics was developed that are appropriate for data fitting different item response models (e.g., 2-parameter logistic, 3-parameter logistic). The standardized effect size rescales the raw beta-uni value…

Descriptors: Test Bias, Test Items, Item Response Theory, Effect Size

The Impact of Insufficient Effort Responses on the Order of Category Thresholds in the Polytomous Rasch Model

Peer reviewed

Direct link

Kuan-Yu Jin; Thomas Eckes – Educational and Psychological Measurement, 2024

Insufficient effort responding (IER) refers to a lack of effort when answering survey or questionnaire items. Such items typically offer more than two ordered response categories, with Likert-type scales as the most prominent example. The underlying assumption is that the successive categories reflect increasing levels of the latent variable…

Descriptors: Item Response Theory, Test Items, Test Wiseness, Surveys

A Multidimensional Partially Compensatory Response Time Model on Basis of the Log-Normal Distribution

Peer reviewed

Direct link

Jochen Ranger; Christoph König; Benjamin W. Domingue; Jörg-Tobias Kuhn; Andreas Frey – Journal of Educational and Behavioral Statistics, 2024

In the existing multidimensional extensions of the log-normal response time (LNRT) model, the log response times are decomposed into a linear combination of several latent traits. These models are fully compensatory as low levels on traits can be counterbalanced by high levels on other traits. We propose an alternative multidimensional extension…

Descriptors: Models, Statistical Distributions, Item Response Theory, Response Rates (Questionnaires)

Correcting for Extreme Response Style: Model Choice Matters

Peer reviewed

Direct link

Martijn Schoenmakers; Jesper Tijmstra; Jeroen Vermunt; Maria Bolsinova – Educational and Psychological Measurement, 2024

Extreme response style (ERS), the tendency of participants to select extreme item categories regardless of the item content, has frequently been found to decrease the validity of Likert-type questionnaire results. For this reason, various item response theory (IRT) models have been proposed to model ERS and correct for it. Comparisons of these…

Descriptors: Item Response Theory, Response Style (Tests), Models, Likert Scales

Detecting Multidimensional DIF in Polytomous Items with IRT Methods and Estimation Approaches

Peer reviewed

Direct link

Güler Yavuz Temel – Journal of Educational Measurement, 2024

The purpose of this study was to investigate multidimensional DIF with a simple and nonsimple structure in the context of multidimensional Graded Response Model (MGRM). This study examined and compared the performance of the IRT-LR and Wald test using MML-EM and MHRM estimation approaches with different test factors and test structures in…

Descriptors: Computation, Multidimensional Scaling, Item Response Theory, Models

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Educational and Psychological…	10
Applied Measurement in…	7
Journal of Educational and…	6
Grantee Submission	4
International Journal of…	3
Journal of Educational…	3
ProQuest LLC	3
Language Testing in Asia	2
Sociological Methods &…	2
Asia Pacific Education Review	1
CBE - Life Sciences Education	1
Chemistry Education Research…	1
Education and Information…	1
Educational Assessment,…	1
Educational Measurement:…	1
Interchange: A Quarterly…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Baltic Science…	1
Journal of Education and…	1
Journal of Experimental…	1
Language Testing	1
Large-scale Assessments in…	1
Measurement:…	1
More ▼

Amanda Goodwin	2
Benjamin W. Domingue	2
Chun Wang	2
Diyorjon Abdullaev	2
Gongjun Xu	2
Jesper Tijmstra	2
Jianbin Fu	2
Joshua B. Gilbert	2
Luke W. Miratrix	2
Maria Bolsinova	2
Matthew Naveiras	2
Patrick C. Kyllonen	2
Paul De Boeck	2
Sun-Joo Cho	2
Weicong Lyu	2
Xuan Tan	2
Adekunle Ibrahim Oladejo	1
Agus Santoso	1
Alicia A. Stoltenberg	1
Allan S. Cohen	1
Allison Ames	1
Amber Dudley	1
Anastasia Sofroniou	1
Andreas Frey	1
Andreas Kurz	1
More ▼