ERIC - Search Results

Publication Date

In 2025	12
Since 2024	187
Since 2021 (last 5 years)	818
Since 2016 (last 10 years)	1951
Since 2006 (last 20 years)	4074

Descriptor

Item Response Theory	5553
Test Items	1817
Foreign Countries	1196
Models	1148
Psychometrics	918
Scores	782
Comparative Analysis	761
Test Construction	750
Simulation	740
Statistical Analysis	659
Difficulty Level	570
Computer Assisted Testing	542
Test Validity	537
Test Reliability	532
Factor Analysis	513
Computation	512
Evaluation Methods	512
Item Analysis	504
Goodness of Fit	503
Correlation	481
Error of Measurement	440
Test Bias	427
Measures (Individuals)	423
Mathematics Tests	377
Measurement Techniques	373
More ▼

Author

Sinharay, Sandip	48
Wilson, Mark	45
Cohen, Allan S.	43
Meijer, Rob R.	43
Tindal, Gerald	42
Wang, Wen-Chung	40
Alonzo, Julie	37
Ferrando, Pere J.	36
Cai, Li	35
van der Linden, Wim J.	35
Glas, Cees A. W.	34
Engelhard, George, Jr.	33
Sijtsma, Klaas	33
Kim, Seock-Ho	32
von Davier, Matthias	32
Mislevy, Robert J.	29
Lee, Won-Chan	28
Haberman, Shelby J.	25
Kolen, Michael J.	25
Wind, Stefanie A.	25
De Boeck, Paul	24
DeMars, Christine E.	24
Hambleton, Ronald K.	23
Andrich, David	22
More ▼

Education Level

Higher Education	690
Secondary Education	564
Elementary Education	518
Postsecondary Education	518
Middle Schools	294
Elementary Secondary Education	237
Junior High Schools	229
High Schools	193
Early Childhood Education	160
Grade 8	158
Intermediate Grades	139
Grade 4	128
Grade 6	109
Grade 5	106
Primary Education	102
Grade 3	96
Grade 7	91
Kindergarten	61
Grade 9	48
Preschool Education	46
Grade 1	43
Grade 2	42
Adult Education	29
Grade 10	28
Grade 12	26
More ▼

Audience

Researchers	32
Practitioners	15
Teachers	7
Students	4
Administrators	2
Counselors	2
Policymakers	1

Location

Turkey	94
Australia	89
Germany	79
United States	74
Netherlands	68
Taiwan	59
Indonesia	53
China	51
Canada	49
Japan	38
Florida	37
Hong Kong	37
United Kingdom (England)	34
South Korea	33
Malaysia	32
Singapore	31
Spain	29
United Kingdom	29
California	28
Iran	25
Italy	24
Brazil	21
Texas	21
Belgium	19
Nigeria	19
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	13
No Child Left Behind Act 2001	12
Every Student Succeeds Act…	2
American Recovery and…	1
Education Consolidation…	1
Education Consolidation and…	1
Education for All Handicapped…	1
Elementary and Secondary…	1
Individuals with Disabilities…	1
Race to the Top	1
Reading Excellence Act	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4

Item Response Theory X

Showing 76 to 90 of 5,553 results Save | Export

Evaluating the Effects of Missing Data Handling Methods on Scale Linking Accuracy

Peer reviewed

Direct link

Wu, Tong; Kim, Stella Y.; Westine, Carl – Educational and Psychological Measurement, 2023

For large-scale assessments, data are often collected with missing responses. Despite the wide use of item response theory (IRT) in many testing programs, however, the existing literature offers little insight into the effectiveness of various approaches to handling missing responses in the context of scale linking. Scale linking is commonly used…

Descriptors: Data Analysis, Responses, Statistical Analysis, Measurement

Constructing a Robust Score Scale from IRT Scores with Informed Boundaries

Peer reviewed

Direct link

Choe, Edison M.; Han, Kyung T. – Journal of Educational Measurement, 2022

In operational testing, item response theory (IRT) models for dichotomous responses are popular for measuring a single latent construct [theta], such as cognitive ability in a content domain. Estimates of [theta], also called IRT scores or [theta hat], can be computed using estimators based on the likelihood function, such as maximum likelihood…

Descriptors: Scores, Item Response Theory, Test Items, Test Format

Planning Missing Data Designs for Human Ratings in Creativity Research: A Practical Guide

Peer reviewed

Direct link

Boris Forthmann; Benjamin Goecke; Roger E. Beaty – Creativity Research Journal, 2025

Human ratings are ubiquitous in creativity research. Yet, the process of rating responses to creativity tasks -- typically several hundred or thousands of responses, per rater -- is often time-consuming and expensive. Planned missing data designs, where raters only rate a subset of the total number of responses, have been recently proposed as one…

Descriptors: Creativity, Research, Researchers, Research Methodology

Correcting for Extreme Response Style: Model Choice Matters

Peer reviewed

Direct link

Martijn Schoenmakers; Jesper Tijmstra; Jeroen Vermunt; Maria Bolsinova – Educational and Psychological Measurement, 2024

Extreme response style (ERS), the tendency of participants to select extreme item categories regardless of the item content, has frequently been found to decrease the validity of Likert-type questionnaire results. For this reason, various item response theory (IRT) models have been proposed to model ERS and correct for it. Comparisons of these…

Descriptors: Item Response Theory, Response Style (Tests), Models, Likert Scales

The Goodness of Fit Evaluation against Local Dependence in Polytomous IRT Models: What Global Fit Indices Can Tell Us?

Direct link

Jiangqiong Li – ProQuest LLC, 2024

When measuring latent constructs, for example, language ability, we use statistical models to specify appropriate relationships between the latent construct and observe responses to test items. These models rely on theoretical assumptions to ensure accurate parameter estimates for valid inferences based on the test results. This dissertation…

Descriptors: Goodness of Fit, Item Response Theory, Models, Measurement Techniques

Detecting Multidimensional DIF in Polytomous Items with IRT Methods and Estimation Approaches

Peer reviewed

Direct link

Güler Yavuz Temel – Journal of Educational Measurement, 2024

The purpose of this study was to investigate multidimensional DIF with a simple and nonsimple structure in the context of multidimensional Graded Response Model (MGRM). This study examined and compared the performance of the IRT-LR and Wald test using MML-EM and MHRM estimation approaches with different test factors and test structures in…

Descriptors: Computation, Multidimensional Scaling, Item Response Theory, Models

Item Response Theory Models for Difference-in-Difference Estimates (And Whether They Are Worth the Trouble)

Peer reviewed

Direct link

James Soland – Journal of Research on Educational Effectiveness, 2024

When randomized control trials are not possible, quasi-experimental methods often represent the gold standard. One quasi-experimental method is difference-in-difference (DiD), which compares changes in outcomes before and after treatment across groups to estimate a causal effect. DiD researchers often use fairly exhaustive robustness checks to…

Descriptors: Item Response Theory, Testing, Test Validity, Intervention

Latent Variable Forests for Latent Variable Score Estimation

Peer reviewed

Direct link

Franz Classe; Christoph Kern – Educational and Psychological Measurement, 2024

We develop a "latent variable forest" (LV Forest) algorithm for the estimation of latent variable scores with one or more latent variables. LV Forest estimates unbiased latent variable scores based on "confirmatory factor analysis" (CFA) models with ordinal and/or numerical response variables. Through parametric model…

Descriptors: Algorithms, Item Response Theory, Artificial Intelligence, Factor Analysis

Rasch Modelling vs. Item Facility: Implications on the Validity of Assessments of Asian EFL/ESL Vocabulary Knowledge and Lexical Sophistication Modelling

Peer reviewed

Direct link

Liang Ye Tan; Stuart McLean; Young Ae Kim; Joseph P. Vitta – Language Testing in Asia, 2024

This study examines how second/foreign language (L2) word difficulty estimates derived from item response theory (IRT) and classical test theory (CTT) frameworks are virtually identical in the context of vocabulary testing. This conclusion is reached via a two-stage process: (a) psychometric assessments of both approaches and (b) L2 word…

Descriptors: Vocabulary, English (Second Language), Test Validity, Second Language Learning

A Practical Guide to Power Analyses of Moderation Effects in Multisite Individual and Cluster Randomized Trials

Peer reviewed

Direct link

Nianbo Dong; Benjamin Kelcey; Jessaca Spybrook; Yanli Xie; Dung Pham; Peilin Qiu; Ning Sui – Grantee Submission, 2024

Multisite trials that randomize individuals (e.g., students) within sites (e.g., schools) or clusters (e.g., teachers/classrooms) within sites (e.g., schools) are commonly used for program evaluation because they provide opportunities to learn about treatment effects as well as their heterogeneity across sites and subgroups (defined by moderating…

Descriptors: Statistical Analysis, Randomized Controlled Trials, Educational Research, Effect Size

Exploration of the Linear and Nonlinear Relationships between Learning Strategies and Mathematics Achievement in South Korea Using the Nominal Response Model: PISA 2012

Peer reviewed

Direct link

Jiyoun Kim; Chia-Wen Chen; Yi-Jhen Wu – Large-scale Assessments in Education, 2024

Learning strategies have been recognized as important predictors of mathematical achievement. In recent studies, it has been found that Asian students use combined learning strategies, primarily including metacognitive strategies, rather than rote memorization. To the best of the authors' knowledge, there is only one prior study including South…

Descriptors: Achievement Tests, Foreign Countries, Learning Strategies, Mathematics Achievement

Reliability of the Commonly Used and Newly-Developed Autism Measures

Peer reviewed

Direct link

Thomas W. Frazier; Andrew J. O. Whitehouse; Susan R. Leekam; Sarah J. Carrington; Gail A. Alvares; David W. Evans; Antonio Y. Hardan; Mirko Uljarevic – Journal of Autism and Developmental Disorders, 2024

Purpose: The aim of the present study was to compare scale and conditional reliability derived from item response theory analyses among the most commonly used, as well as several newly developed, observation, interview, and parent-report autism instruments. Methods: When available, data sets were combined to facilitate large sample evaluation.…

Descriptors: Test Reliability, Item Response Theory, Autism Spectrum Disorders, Clinical Diagnosis

Wald X[superscript 2] Test for Differential Item Functioning Detection with Polytomous Items in Multilevel Data

Peer reviewed

Direct link

Sijia Huang; Dubravka Svetina Valdivia – Educational and Psychological Measurement, 2024

Identifying items with differential item functioning (DIF) in an assessment is a crucial step for achieving equitable measurement. One critical issue that has not been fully addressed with existing studies is how DIF items can be detected when data are multilevel. In the present study, we introduced a Lord's Wald X[superscript 2] test-based…

Descriptors: Item Analysis, Item Response Theory, Algorithms, Accuracy

Latent Class Analysis with Measurement Invariance Testing: Simulation Study to Compare Overall Likelihood Ratio vs Residual Fit Statistics Based Model Selection

Peer reviewed

Direct link

Zsuzsa Bakk – Structural Equation Modeling: A Multidisciplinary Journal, 2024

A standard assumption of latent class (LC) analysis is conditional independence, that is the items of the LC are independent of the covariates given the LCs. Several approaches have been proposed for identifying violations of this assumption. The recently proposed likelihood ratio approach is compared to residual statistics (bivariate residuals…

Descriptors: Goodness of Fit, Error of Measurement, Comparative Analysis, Models

Item Parameter Recovery: Sensitivity to Prior Distribution

Peer reviewed

Direct link

Christine E. DeMars; Paulius Satkus – Educational and Psychological Measurement, 2024

Marginal maximum likelihood, a common estimation method for item response theory models, is not inherently a Bayesian procedure. However, due to estimation difficulties, Bayesian priors are often applied to the likelihood when estimating 3PL models, especially with small samples. Little focus has been placed on choosing the priors for marginal…

Descriptors: Item Response Theory, Statistical Distributions, Error of Measurement, Bayesian Statistics

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 371

Educational and Psychological…	390
Applied Psychological…	386
Journal of Educational…	264
ProQuest LLC	261
Psychometrika	206
Journal of Educational and…	172
Applied Measurement in…	169
ETS Research Report Series	120
Measurement:…	114
Grantee Submission	100
International Journal of…	90
Online Submission	76
Educational Measurement:…	69
International Journal of…	59
Journal of Applied Measurement	53
Language Testing	51
Multivariate Behavioral…	51
Psychological Assessment	50
Journal of Psychoeducational…	49
Journal of Outcome Measurement	43
Behavioral Research and…	41
Practical Assessment,…	39
International Educational…	38
Educational Assessment	37
Language Assessment Quarterly	35
More ▼

Journal Articles	4332
Reports - Research	3349
Reports - Evaluative	1297
Speeches/Meeting Papers	520
Reports - Descriptive	491
Dissertations/Theses -…	263
Tests/Questionnaires	123
Numerical/Quantitative Data	115
Opinion Papers	83
Information Analyses	62
Book/Product Reviews	22
Books	15
Collected Works - General	15
Guides - Non-Classroom	13
Collected Works - Proceedings	8
Non-Print Media	6
Reports - General	6
Reference Materials - General	4
Collected Works - Serials	3
Guides - General	3
ERIC Publications	2
Guides - Classroom - Learner	2
Reference Materials -…	2
Creative Works	1
Dissertations/Theses -…	1
More ▼

Program for International…	111
National Assessment of…	82
Trends in International…	81
Early Childhood Longitudinal…	41
SAT (College Admission Test)	34
Test of English as a Foreign…	33
Law School Admission Test	29
ACT Assessment	24
Graduate Record Examinations	21
Peabody Picture Vocabulary…	16
Progress in International…	16
Raven Progressive Matrices	15
Iowa Tests of Basic Skills	14
International English…	10
Armed Services Vocational…	9
Advanced Placement…	8
Gates MacGinitie Reading Tests	8
Stanford Achievement Tests	8
Measures of Academic Progress	7
Wechsler Individual…	7
Child Behavior Checklist	6
Force Concept Inventory	6
Woodcock Johnson Tests of…	6
Early Childhood Environment…	5
Florida Comprehensive…	5
More ▼