ERIC - Search Results

Publication Date

In 2025	3
Since 2024	38

Publication Type

Journal Articles	30
Reports - Research	29
Dissertations/Theses -…	5
Tests/Questionnaires	3
Information Analyses	2
Reports - Evaluative	2
Reports - Descriptive	1

Education Level

Higher Education	7
Postsecondary Education	7
Secondary Education	7
Elementary Education	6
Early Childhood Education	4
High Schools	4
Intermediate Grades	2
Middle Schools	2
Primary Education	2
Elementary Secondary Education	1
Grade 1	1
Grade 10	1
Grade 2	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 9	1
Junior High Schools	1
Kindergarten	1
Preschool Education	1
More ▼

Audience

Location

China	2
Australia	1
Hawaii	1
Indonesia	1
Ohio (Dayton)	1
Serbia	1
Virginia	1

Laws, Policies, & Programs

Assessments and Surveys

California Critical Thinking…	1
Classroom Assessment Scoring…	1
Early Childhood Longitudinal…	1
Measures of Academic Progress	1
Program for International…	1
Progress in International…	1
Woodcock Johnson Tests of…	1
Woodcock Johnson Tests of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 38 results Save | Export

Latent Trait Item Response Models for Continuous Responses

Peer reviewed

Direct link

Gerhard Tutz; Pascal Jordan – Journal of Educational and Behavioral Statistics, 2024

A general framework of latent trait item response models for continuous responses is given. In contrast to classical test theory (CTT) models, which traditionally distinguish between true scores and error scores, the responses are clearly linked to latent traits. It is shown that CTT models can be derived as special cases, but the model class is…

Descriptors: Item Response Theory, Responses, Scores, Models

Assessing the Fairness of Mathematical Literacy Test in Indonesia: Evidence from Gender-Based Differential Item Function Analysis

Peer reviewed
PDF on ERIC

Download full text

Kartianom Kartianom; Heri Retnawati; Kana Hidayati – Journal of Pedagogical Research, 2024

Conducting a fair test is important for educational research. Unfair assessments can lead to gender disparities in academic achievement, ultimately resulting in disparities in opportunities, wages, and career choice. Differential Item Function [DIF] analysis is presented to provide evidence of whether the test is truly fair, where it does not harm…

Descriptors: Foreign Countries, Test Bias, Item Response Theory, Test Theory

A Comparison of Response Time Threshold Scoring Procedures in Mitigating Bias from Rapid Guessing Behavior

Peer reviewed

Direct link

Joseph A. Rios; Jiayi Deng – Educational and Psychological Measurement, 2024

Rapid guessing (RG) is a form of non-effortful responding that is characterized by short response latencies. This construct-irrelevant behavior has been shown in previous research to bias inferences concerning measurement properties and scores. To mitigate these deleterious effects, a number of response time threshold scoring procedures have been…

Descriptors: Reaction Time, Scores, Item Response Theory, Guessing (Tests)

Information Functions of Rank-2PL Models for Forced-Choice Questionnaires

Peer reviewed

Direct link

Jianbin Fu; Xuan Tan; Patrick C. Kyllonen – Journal of Educational Measurement, 2024

This paper presents the item and test information functions of the Rank two-parameter logistic models (Rank-2PLM) for items with two (pair) and three (triplet) statements in forced-choice questionnaires. The Rank-2PLM model for pairs is the MUPP-2PLM (Multi-Unidimensional Pairwise Preference) and, for triplets, is the Triplet-2PLM. Fisher's…

Descriptors: Questionnaires, Test Items, Item Response Theory, Models

Added Value of Subscores for Tests with Polytomous Items

Peer reviewed

Direct link

Kylie Gorney; Sandip Sinharay – Educational and Psychological Measurement, 2025

Test-takers, policymakers, teachers, and institutions are increasingly demanding that testing programs provide more detailed feedback regarding test performance. As a result, there has been a growing interest in the reporting of subscores that potentially provide such detailed feedback. Haberman developed a method based on classical test theory…

Descriptors: Scores, Test Theory, Test Items, Testing

MindPlay Reading: Efficacy Study. Examining the Correlation of MindPlay Use and Outcomes on NWEA Map, Fall 2021-Spring 2022, Grades 2-6

Download full text

Rachel L. Schechter; Anna Robinson; Manvi Teki – Online Submission, 2024

This study investigates the impact of the MindPlay Reading program on student literacy achievement in Dayton City Schools, Ohio, during the 2021-2022 academic year. A correlational analysis was conducted in collaboration with LXD Research to examine the relationship between MindPlay usage and student outcomes on literacy assessments. The sample…

Descriptors: Achievement Tests, Cognitive Processes, Play, Theory of Mind

Classification Consistency and Accuracy Indices for Simple Structure Multidimensional Item Response Theory Model

Direct link

Huan Liu – ProQuest LLC, 2024

In many large-scale testing programs, examinees are frequently categorized into different performance levels. These classifications are then used to make high-stakes decisions about examinees in contexts such as in licensure, certification, and educational assessments. Numerous approaches to estimating the consistency and accuracy of this…

Descriptors: Classification, Accuracy, Item Response Theory, Decision Making

Item Response Theory Models for Difference-in-Difference Estimates (And Whether They Are Worth the Trouble)

Peer reviewed

Direct link

James Soland – Journal of Research on Educational Effectiveness, 2024

When randomized control trials are not possible, quasi-experimental methods often represent the gold standard. One quasi-experimental method is difference-in-difference (DiD), which compares changes in outcomes before and after treatment across groups to estimate a causal effect. DiD researchers often use fairly exhaustive robustness checks to…

Descriptors: Item Response Theory, Testing, Test Validity, Intervention

Latent Variable Forests for Latent Variable Score Estimation

Peer reviewed

Direct link

Franz Classe; Christoph Kern – Educational and Psychological Measurement, 2024

We develop a "latent variable forest" (LV Forest) algorithm for the estimation of latent variable scores with one or more latent variables. LV Forest estimates unbiased latent variable scores based on "confirmatory factor analysis" (CFA) models with ordinal and/or numerical response variables. Through parametric model…

Descriptors: Algorithms, Item Response Theory, Artificial Intelligence, Factor Analysis

Comparing Examinee-Based and Response-Based Motivation Filtering Methods in Remote Low-Stakes Testing

Peer reviewed

Direct link

Sarah Alahmadi; Christine E. DeMars – Applied Measurement in Education, 2024

Large-scale educational assessments are sometimes considered low-stakes, increasing the possibility of confounding true performance level with low motivation. These concerns are amplified in remote testing conditions. To remove the effects of low effort levels in responses observed in remote low-stakes testing, several motivation filtering methods…

Descriptors: Multiple Choice Tests, Item Response Theory, College Students, Scores

Validation of an Elicited Imitation Test as a Measure of Korean Language Proficiency

Peer reviewed

Direct link

Hojung Kim; Changkyung Song; Jiyoung Kim; Hyeyun Jeong; Jisoo Park – Language Testing in Asia, 2024

This study presents a modified version of the Korean Elicited Imitation (EI) test, designed to resemble natural spoken language, and validates its reliability as a measure of proficiency. The study assesses the correlation between average test scores and Test of Proficiency in Korean (TOPIK) levels, examining score distributions among beginner,…

Descriptors: Korean, Test Validity, Test Reliability, Imitation

From Likert to Forced Choice: Statement Parameter Invariance and Context Effects in Personality Assessment

Peer reviewed

Direct link

Jianbin Fu; Patrick C. Kyllonen; Xuan Tan – Measurement: Interdisciplinary Research and Perspectives, 2024

Users of forced-choice questionnaires (FCQs) to measure personality commonly assume statement parameter invariance across contexts -- between Likert and forced-choice (FC) items and between different FC items that share a common statement. In this paper, an empirical study was designed to check these two assumptions for an FCQ assessment measuring…

Descriptors: Measurement Techniques, Questionnaires, Personality Measures, Interpersonal Competence

Justice-Oriented, Antiracist Validation: Continuing to Disrupt White Supremacy in Assessment Practices

Peer reviewed

Direct link

Jennifer Randall; Mya Poe; Maria Elena Oliveri; David Slomp – Educational Assessment, 2024

Traditional validation approaches fail to account for the ways oppressive systems (e.g. racism, radical nationalism) impact the test design and development process. To disrupt this legacy of white supremacy, we illustrate how justice-oriented, antiracist validation (JAV) framework can be applied to construct articulation and validation, data…

Descriptors: Social Justice, Racism, Educational Assessment, Models

Improving the Precision of Classroom Observation Scores Using a Multi-Rater and Multi-Timepoint Item Response Theory Model

Peer reviewed

Direct link

Kelly Edwards; James Soland – Educational Assessment, 2024

Classroom observational protocols, in which raters observe and score the quality of teachers' instructional practices, are often used to evaluate teachers for consequential purposes despite evidence that scores from such protocols are frequently driven by factors, such as rater and temporal effects, that have little to do with teacher quality. In…

Descriptors: Classroom Observation Techniques, Teacher Evaluation, Accuracy, Scores

A One-Parameter Diagnostic Classification Model with Familiar Measurement Properties

Peer reviewed

Direct link

Matthew J. Madison; Stefanie Wind; Lientje Maas; Kazuhiro Yamaguchi; Sergio Haab – Grantee Submission, 2024

Diagnostic classification models (DCMs) are psychometric models designed to classify examinees according to their proficiency or nonproficiency of specified latent characteristics. These models are well suited for providing diagnostic and actionable feedback to support intermediate and formative assessment efforts. Several DCMs have been developed…

Descriptors: Diagnostic Tests, Classification, Models, Psychometrics

Previous Page | Next Page »

Pages: 1 | 2 | 3

ProQuest LLC	5
Educational and Psychological…	4
Educational Assessment	2
Journal of Educational…	2
Language Testing	2
Society for Research on…	2
Applied Measurement in…	1
Chemistry Education Research…	1
Cognitive Science	1
Early Education and…	1
Educational Assessment,…	1
Exceptionality	1
Grantee Submission	1
Infant and Child Development	1
International Journal of…	1
Journal of Educational and…	1
Journal of Pedagogical…	1
Journal of Research on…	1
Journal of the Scholarship of…	1
Language Teaching Research	1
Language Testing in Asia	1
Measurement:…	1
Online Submission	1
SAGE Open	1
School Mental Health	1
More ▼

Scores	38
Item Response Theory	19
Models	7
Teaching Methods	7
Test Items	7
Foreign Countries	6
Achievement Tests	5
Educational Research	5
Language Proficiency	5
Learning Processes	5
Mathematics Tests	5
Psychometrics	5
Test Validity	5
Accuracy	4
Language Tests	4
Standardized Tests	4
Student Attitudes	4
Test Theory	4
Academic Achievement	3
Attribution Theory	3
Classification	3
Classroom Observation…	3
Comparative Analysis	3
Computer Assisted Testing	3
English (Second Language)	3
More ▼

Daniel R. Isbell	2
James Soland	2
Jianbin Fu	2
Kazuhiro Yamaguchi	2
Lientje Maas	2
Matthew J. Madison	2
Patrick C. Kyllonen	2
Sergio Haab	2
Xuan Tan	2
Allie J. Boquet	1
Anahid S. Modrek	1
Andrés Christiansen	1
Ann Tai Choe	1
Anna Robinson	1
Anum Khushal	1
Arkadiusz Gut	1
Brian A. Couch	1
Caroline G. Kosho	1
Changkyung Song	1
Christine E. DeMars	1
Christoph Kern	1
Christopher Edward Gilmore	1
Collette Marie Lere' London	1
Dan Cloney	1
Daniel Holden	1
More ▼