ERIC - Search Results

Publication Date

In 2025	12
Since 2024	187
Since 2021 (last 5 years)	818
Since 2016 (last 10 years)	1951
Since 2006 (last 20 years)	4074

Descriptor

Item Response Theory	5553
Test Items	1817
Foreign Countries	1196
Models	1148
Psychometrics	918
Scores	782
Comparative Analysis	761
Test Construction	750
Simulation	740
Statistical Analysis	659
Difficulty Level	570
Computer Assisted Testing	542
Test Validity	537
Test Reliability	532
Factor Analysis	513
Computation	512
Evaluation Methods	512
Item Analysis	504
Goodness of Fit	503
Correlation	481
Error of Measurement	440
Test Bias	427
Measures (Individuals)	423
Mathematics Tests	377
Measurement Techniques	373
More ▼

Author

Sinharay, Sandip	48
Wilson, Mark	45
Cohen, Allan S.	43
Meijer, Rob R.	43
Tindal, Gerald	42
Wang, Wen-Chung	40
Alonzo, Julie	37
Ferrando, Pere J.	36
Cai, Li	35
van der Linden, Wim J.	35
Glas, Cees A. W.	34
Engelhard, George, Jr.	33
Sijtsma, Klaas	33
Kim, Seock-Ho	32
von Davier, Matthias	32
Mislevy, Robert J.	29
Lee, Won-Chan	28
Haberman, Shelby J.	25
Kolen, Michael J.	25
Wind, Stefanie A.	25
De Boeck, Paul	24
DeMars, Christine E.	24
Hambleton, Ronald K.	23
Andrich, David	22
More ▼

Education Level

Higher Education	690
Secondary Education	564
Elementary Education	518
Postsecondary Education	518
Middle Schools	294
Elementary Secondary Education	237
Junior High Schools	229
High Schools	193
Early Childhood Education	160
Grade 8	158
Intermediate Grades	139
Grade 4	128
Grade 6	109
Grade 5	106
Primary Education	102
Grade 3	96
Grade 7	91
Kindergarten	61
Grade 9	48
Preschool Education	46
Grade 1	43
Grade 2	42
Adult Education	29
Grade 10	28
Grade 12	26
More ▼

Audience

Researchers	32
Practitioners	15
Teachers	7
Students	4
Administrators	2
Counselors	2
Policymakers	1

Location

Turkey	94
Australia	89
Germany	79
United States	74
Netherlands	68
Taiwan	59
Indonesia	53
China	51
Canada	49
Japan	38
Florida	37
Hong Kong	37
United Kingdom (England)	34
South Korea	33
Malaysia	32
Singapore	31
Spain	29
United Kingdom	29
California	28
Iran	25
Italy	24
Brazil	21
Texas	21
Belgium	19
Nigeria	19
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	13
No Child Left Behind Act 2001	12
Every Student Succeeds Act…	2
American Recovery and…	1
Education Consolidation…	1
Education Consolidation and…	1
Education for All Handicapped…	1
Elementary and Secondary…	1
Individuals with Disabilities…	1
Race to the Top	1
Reading Excellence Act	1
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	4
Meets WWC Standards with or without Reservations	4

Item Response Theory X

Showing 106 to 120 of 5,553 results Save | Export

On the Utility of Indirect Methods for Detecting Faking

Peer reviewed

Direct link

Philippe Goldammer; Peter Lucas Stöckli; Yannik Andrea Escher; Hubert Annen; Klaus Jonas – Educational and Psychological Measurement, 2024

Indirect indices for faking detection in questionnaires make use of a respondent's deviant or unlikely response pattern over the course of the questionnaire to identify them as a faker. Compared with established direct faking indices (i.e., lying and social desirability scales), indirect indices have at least two advantages: First, they cannot be…

Descriptors: Identification, Deception, Psychological Testing, Validity

Analyzing Polytomous Test Data: A Comparison between an Information-Based IRT Model and the Generalized Partial Credit Model

Peer reviewed

Direct link

Joakim Wallmark; James O. Ramsay; Juan Li; Marie Wiberg – Journal of Educational and Behavioral Statistics, 2024

Item response theory (IRT) models the relationship between the possible scores on a test item against a test taker's attainment of the latent trait that the item is intended to measure. In this study, we compare two models for tests with polytomously scored items: the optimal scoring (OS) model, a nonparametric IRT model based on the principles of…

Descriptors: Item Response Theory, Test Items, Models, Scoring

Expected Classification Accuracy for Categorical Growth Models

Peer reviewed

Direct link

Daniel Murphy; Sarah Quesen; Matthew Brunetti; Quintin Love – Educational Measurement: Issues and Practice, 2024

Categorical growth models describe examinee growth in terms of performance-level category transitions, which implies that some percentage of examinees will be misclassified. This paper introduces a new procedure for estimating the classification accuracy of categorical growth models, based on Rudner's classification accuracy index for item…

Descriptors: Classification, Growth Models, Accuracy, Performance Based Assessment

Extending an Identified Four-Parameter IRT Model: The Confirmatory Set-4PNO Model

Peer reviewed

Direct link

Justin L. Kern – Journal of Educational and Behavioral Statistics, 2024

Given the frequent presence of slipping and guessing in item responses, models for the inclusion of their effects are highly important. Unfortunately, the most common model for their inclusion, the four-parameter item response theory model, potentially has severe deficiencies related to its possible unidentifiability. With this issue in mind, the…

Descriptors: Item Response Theory, Models, Bayesian Statistics, Generalization

Validation of an Elicited Imitation Test as a Measure of Korean Language Proficiency

Peer reviewed

Direct link

Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024

This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…

Descriptors: Korean, Test Validity, Test Reliability, Imitation

Linking Errors Introduced by Rapid Guessing Responses When Employing Multigroup Concurrent IRT Scaling

Direct link

Jiayi Deng – ProQuest LLC, 2024

Test score comparability in international large-scale assessments (LSA) is of utmost importance in measuring the effectiveness of education systems and understanding the impact of education on economic growth. To effectively compare test scores on an international scale, score linking is widely used to convert raw scores from different linguistic…

Descriptors: Item Response Theory, Scoring Rubrics, Scoring, Error of Measurement

A Matrix Lie Group Formulation of Measurement Theory: Symmetries of Classical Measurement Theory

Peer reviewed

Direct link

William R. Nugent – Measurement: Interdisciplinary Research and Perspectives, 2024

Symmetry considerations are important in science, and Group Theory is a theory of symmetry. Classical Measurement Theory is the most used measurement theory in the social and behavioral sciences. In this article, the author uses Matrix Lie (Lee) group theory to formulate a measurement model. Symmetry is defined and illustrated using symmetries of…

Descriptors: Item Response Theory, Measurement Techniques, Models, Simulation

From Likert to Forced Choice: Statement Parameter Invariance and Context Effects in Personality Assessment

Peer reviewed

Direct link

Jianbin Fu; Patrick C. Kyllonen; Xuan Tan – Measurement: Interdisciplinary Research and Perspectives, 2024

Users of forced-choice questionnaires (FCQs) to measure personality commonly assume statement parameter invariance across contexts -- between Likert and forced-choice (FC) items and between different FC items that share a common statement. In this paper, an empirical study was designed to check these two assumptions for an FCQ assessment measuring…

Descriptors: Measurement Techniques, Questionnaires, Personality Measures, Interpersonal Competence

Not Just Generalizability: A Case for Multifaceted Latent Trait Models in Teacher Observation Systems

Peer reviewed

Direct link

Wind, Stefanie A.; Jones, Eli – Educational Researcher, 2019

Teacher evaluation systems often include classroom observations in which raters use rating scales to evaluate teachers' effectiveness. Recently, researchers have promoted the use of multifaceted approaches to investigating reliability using Generalizability theory, instead of rater reliability statistics. Generalizability theory allows analysts to…

Descriptors: Teacher Evaluation, Observation, Generalizability Theory, Item Response Theory

Examining How Using Dichotomous and Partial Credit Scoring Models Influence Sixth-Grade Mathematical Problem-Solving Assessment Outcomes

Peer reviewed

Direct link

May, Toni A.; Koskey, Kristin L. K.; Bostic, Jonathan D.; Stone, Gregory E.; Kruse, Lance M.; Matney, Gabriel – School Science and Mathematics, 2023

Determining the most appropriate method of scoring an assessment is based on multiple factors, including the intended use of results, the assessment's purpose, and time constraints. Both the dichotomous and partial credit models have their advantages, yet direct comparisons of assessment outcomes from each method are not typical with constructed…

Descriptors: Scoring, Evaluation Methods, Problem Solving, Student Evaluation

A New Bayesian Person-Fit Analysis Method Using Pivotal Discrepancy Measures

Peer reviewed

Direct link

Combs, Adam – Journal of Educational Measurement, 2023

A common method of checking person-fit in Bayesian item response theory (IRT) is the posterior-predictive (PP) method. In recent years, more powerful approaches have been proposed that are based on resampling methods using the popular L*[subscript z] statistic. There has also been proposed a new Bayesian model checking method based on pivotal…

Descriptors: Bayesian Statistics, Goodness of Fit, Evaluation Methods, Monte Carlo Methods

A Mixture Rasch Model Analysis of Data from a Survey of Novice Teacher Core Competencies

Peer reviewed
PDF on ERIC

Download full text

Toker, Türker; Seidel, Kent – International Journal of Contemporary Educational Research, 2023

Although the Rasch model is used to measure latent traits like attitude or ability where there are multiple latent structures within the dataset it is best to use a technique called the Mixture Rasch Model (MRM) which is a combination of a Rasch model and a latent class analysis (LCA). This study used data from a survey for teachers, teacher…

Descriptors: Item Response Theory, Beginning Teachers, Teacher Competencies, Teacher Effectiveness

Implementing a Standardized Effect Size in the POLYSIBTEST Procedure

Peer reviewed

Direct link

Weese, James D.; Turner, Ronna C.; Liang, Xinya; Ames, Allison; Crawford, Brandon – Educational and Psychological Measurement, 2023

A study was conducted to implement the use of a standardized effect size and corresponding classification guidelines for polytomous data with the POLYSIBTEST procedure and compare those guidelines with prior recommendations. Two simulation studies were included. The first identifies new unstandardized test heuristics for classifying moderate and…

Descriptors: Effect Size, Classification, Guidelines, Statistical Analysis

An Exponentially Weighted Moving Average Procedure for Detecting Back Random Responding Behavior

Peer reviewed

Direct link

He, Yinhong – Journal of Educational Measurement, 2023

Back random responding (BRR) behavior is one of the commonly observed careless response behaviors. Accurately detecting BRR behavior can improve test validities. Yu and Cheng (2019) showed that the change point analysis (CPA) procedure based on weighted residual (CPA-WR) performed well in detecting BRR. Compared with the CPA procedure, the…

Descriptors: Test Validity, Item Response Theory, Measurement, Monte Carlo Methods

A Statistical Test for the Detection of Item Compromise Combining Responses and Response Times

Peer reviewed

Direct link

van der Linden, Wim J.; Belov, Dmitry I. – Journal of Educational Measurement, 2023

A test of item compromise is presented which combines the test takers' responses and response times (RTs) into a statistic defined as the number of correct responses on the item for test takers with RTs flagged as suspicious. The test has null and alternative distributions belonging to the well-known family of compound binomial distributions, is…

Descriptors: Item Response Theory, Reaction Time, Test Items, Item Analysis

« Previous Page | Next Page »

Pages: 1 | ... | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | ... | 371

Educational and Psychological…	390
Applied Psychological…	386
Journal of Educational…	264
ProQuest LLC	261
Psychometrika	206
Journal of Educational and…	172
Applied Measurement in…	169
ETS Research Report Series	120
Measurement:…	114
Grantee Submission	100
International Journal of…	90
Online Submission	76
Educational Measurement:…	69
International Journal of…	59
Journal of Applied Measurement	53
Language Testing	51
Multivariate Behavioral…	51
Psychological Assessment	50
Journal of Psychoeducational…	49
Journal of Outcome Measurement	43
Behavioral Research and…	41
Practical Assessment,…	39
International Educational…	38
Educational Assessment	37
Language Assessment Quarterly	35
More ▼

Journal Articles	4332
Reports - Research	3349
Reports - Evaluative	1297
Speeches/Meeting Papers	520
Reports - Descriptive	491
Dissertations/Theses -…	263
Tests/Questionnaires	123
Numerical/Quantitative Data	115
Opinion Papers	83
Information Analyses	62
Book/Product Reviews	22
Books	15
Collected Works - General	15
Guides - Non-Classroom	13
Collected Works - Proceedings	8
Non-Print Media	6
Reports - General	6
Reference Materials - General	4
Collected Works - Serials	3
Guides - General	3
ERIC Publications	2
Guides - Classroom - Learner	2
Reference Materials -…	2
Creative Works	1
Dissertations/Theses -…	1
More ▼

Program for International…	111
National Assessment of…	82
Trends in International…	81
Early Childhood Longitudinal…	41
SAT (College Admission Test)	34
Test of English as a Foreign…	33
Law School Admission Test	29
ACT Assessment	24
Graduate Record Examinations	21
Peabody Picture Vocabulary…	16
Progress in International…	16
Raven Progressive Matrices	15
Iowa Tests of Basic Skills	14
International English…	10
Armed Services Vocational…	9
Advanced Placement…	8
Gates MacGinitie Reading Tests	8
Stanford Achievement Tests	8
Measures of Academic Progress	7
Wechsler Individual…	7
Child Behavior Checklist	6
Force Concept Inventory	6
Woodcock Johnson Tests of…	6
Early Childhood Environment…	5
Florida Comprehensive…	5
More ▼