ERIC - Search Results

Publication Date

In 2025	0
Since 2024	5
Since 2021 (last 5 years)	31
Since 2016 (last 10 years)	67
Since 2006 (last 20 years)	114

Descriptor

Test Length	163
Item Response Theory	126
Test Items	76
Sample Size	70
Simulation	42
Computer Assisted Testing	33
Models	33
Adaptive Testing	32
Error of Measurement	32
Comparative Analysis	31
Accuracy	26
Latent Trait Theory	26
Test Construction	26
Item Analysis	25
Monte Carlo Methods	25
Statistical Analysis	25
Goodness of Fit	24
Test Reliability	24
Test Bias	23
Computation	22
Correlation	22
Difficulty Level	20
Scores	19
Maximum Likelihood Statistics	15
Test Format	15
More ▼

Publication Type

Reports - Research	163
Journal Articles	130
Speeches/Meeting Papers	18
Numerical/Quantitative Data	2
Reports - Evaluative	2
Guides - Non-Classroom	1
Information Analyses	1
Tests/Questionnaires	1

Education Level

Higher Education	6
Postsecondary Education	6
Elementary Secondary Education	3
Secondary Education	3
Elementary Education	2
Early Childhood Education	1
Grade 3	1
Grade 8	1
High Schools	1
Junior High Schools	1
Middle Schools	1
Preschool Education	1
More ▼

Audience

Researchers

Location

Taiwan	2
Turkey	2
Alabama	1
Australia	1
Colombia	1
Germany	1
Illinois (Chicago)	1
Indiana	1
Indonesia	1
Israel	1
Japan	1
Jordan	1
Michigan	1
Netherlands	1
Peru	1
Qatar	1
United Kingdom	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

MacArthur Communicative…	2
Center for Epidemiologic…	1
Iowa Tests of Basic Skills	1
Law School Admission Test	1
Medical College Admission Test	1
Otis Lennon School Ability…	1
Program for International…	1
SAT (College Admission Test)	1
School and College Ability…	1
Test of English as a Foreign…	1
Texas Assessment of Basic…	1
Texas Educational Assessment…	1
Trends in International…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 163 results Save | Export

A Comparison of the Efficacies of Differential Item Functioning Detection Methods

Peer reviewed
PDF on ERIC

Download full text

Basman, Munevver – International Journal of Assessment Tools in Education, 2023

To ensure the validity of the tests is to check that all items have similar results across different groups of individuals. However, differential item functioning (DIF) occurs when the results of individuals with equal ability levels from different groups differ from each other on the same test item. Based on Item Response Theory and Classic Test…

Descriptors: Test Bias, Test Items, Test Validity, Item Response Theory

Number of Response Categories and Sample Size Requirements in Polytomous IRT Models

Peer reviewed

Direct link

Dubravka Svetina Valdivia; Shenghai Dai – Journal of Experimental Education, 2024

Applications of polytomous IRT models in applied fields (e.g., health, education, psychology) are abound. However, little is known about the impact of the number of categories and sample size requirements for precise parameter recovery. In a simulation study, we investigated the impact of the number of response categories and required sample size…

Descriptors: Item Response Theory, Sample Size, Models, Classification

An Exponentially Weighted Moving Average Procedure for Detecting Back Random Responding Behavior

Peer reviewed

Direct link

He, Yinhong – Journal of Educational Measurement, 2023

Back random responding (BRR) behavior is one of the commonly observed careless response behaviors. Accurately detecting BRR behavior can improve test validities. Yu and Cheng (2019) showed that the change point analysis (CPA) procedure based on weighted residual (CPA-WR) performed well in detecting BRR. Compared with the CPA procedure, the…

Descriptors: Test Validity, Item Response Theory, Measurement, Monte Carlo Methods

What Affects the Quality of Score Transformations? Potential Issues in True-Score Equating Using the Partial Credit Model

Peer reviewed

Direct link

Fellinghauer, Carolina; Debelak, Rudolf; Strobl, Carolin – Educational and Psychological Measurement, 2023

This simulation study investigated to what extent departures from construct similarity as well as differences in the difficulty and targeting of scales impact the score transformation when scales are equated by means of concurrent calibration using the partial credit model with a common person design. Practical implications of the simulation…

Descriptors: True Scores, Equated Scores, Test Items, Sample Size

A Randomization P-Value Test for Detecting Copying on Multiple-Choice Exams

Peer reviewed

Direct link

Lang, Joseph B. – Journal of Educational and Behavioral Statistics, 2023

This article is concerned with the statistical detection of copying on multiple-choice exams. As an alternative to existing permutation- and model-based copy-detection approaches, a simple randomization p-value (RP) test is proposed. The RP test, which is based on an intuitive match-score statistic, makes no assumptions about the distribution of…

Descriptors: Identification, Cheating, Multiple Choice Tests, Item Response Theory

Assessing Dimensionality of IRT Models Using Traditional and Revised Parallel Analyses

Peer reviewed

Direct link

Guo, Wenjing; Choi, Youn-Jeng – Educational and Psychological Measurement, 2023

Determining the number of dimensions is extremely important in applying item response theory (IRT) models to data. Traditional and revised parallel analyses have been proposed within the factor analysis framework, and both have shown some promise in assessing dimensionality. However, their performance in the IRT framework has not been…

Descriptors: Item Response Theory, Evaluation Methods, Factor Analysis, Guidelines

Accuracy and Sensitivity of Coefficient Alpha and Its Alternatives with Unidimensional and Contaminated Scales

Peer reviewed

Direct link

Xiao, Leifeng; Hau, Kit-Tai – Applied Measurement in Education, 2023

We compared coefficient alpha with five alternatives (omega total, omega RT, omega h, GLB, and coefficient H) in two simulation studies. Results showed for unidimensional scales, (a) all indices except omega h performed similarly well for most conditions; (b) alpha is still good; (c) GLB and coefficient H overestimated reliability with small…

Descriptors: Test Theory, Test Reliability, Factor Analysis, Test Length

Type I Error and Power Rates: A Comparative Analysis of Techniques in Differential Item Functioning

Peer reviewed
PDF on ERIC

Download full text

Ayse Bilicioglu Gunes; Bayram Bicak – International Journal of Assessment Tools in Education, 2023

The main purpose of this study is to examine the Type I error and statistical power ratios of Differential Item Functioning (DIF) techniques based on different theories under different conditions. For this purpose, a simulation study was conducted by using Mantel-Haenszel (MH), Logistic Regression (LR), Lord's [chi-squared], and Raju's Areas…

Descriptors: Test Items, Item Response Theory, Error of Measurement, Test Bias

IRT Characteristic Curve Linking Methods Weighted by Information for Mixed-Format Tests

Peer reviewed

Direct link

Shaojie Wang; Won-Chan Lee; Minqiang Zhang; Lixin Yuan – Applied Measurement in Education, 2024

To reduce the impact of parameter estimation errors on IRT linking results, recent work introduced two information-weighted characteristic curve methods for dichotomous items. These two methods showed outstanding performance in both simulation and pseudo-form pseudo-group analysis. The current study expands upon the concept of information…

Descriptors: Item Response Theory, Test Format, Test Length, Error of Measurement

A Note on Improving Variational Estimation for Multidimensional Item Response Theory

Peer reviewed

Direct link

Chenchen Ma; Jing Ouyang; Chun Wang; Gongjun Xu – Grantee Submission, 2024

Survey instruments and assessments are frequently used in many domains of social science. When the constructs that these assessments try to measure become multifaceted, multidimensional item response theory (MIRT) provides a unified framework and convenient statistical tool for item analysis, calibration, and scoring. However, the computational…

Descriptors: Algorithms, Item Response Theory, Scoring, Accuracy

Investigating Confidence Intervals of Item Parameters When Some Item Parameters Take Priors in the 2PL and 3PL Models

Peer reviewed

Direct link

Paek, Insu; Lin, Zhongtian; Chalmers, Robert Philip – Educational and Psychological Measurement, 2023

To reduce the chance of Heywood cases or nonconvergence in estimating the 2PL or the 3PL model in the marginal maximum likelihood with the expectation-maximization (MML-EM) estimation method, priors for the item slope parameter in the 2PL model or for the pseudo-guessing parameter in the 3PL model can be used and the marginal maximum a posteriori…

Descriptors: Models, Item Response Theory, Test Items, Intervals

Evaluation of Factors Affecting the Performance of the "S - X[superscript 2]" Item-Fit Index

Peer reviewed

Direct link

Kim, Hyung Jin; Lee, Won-Chan – Journal of Educational Measurement, 2022

Orlando and Thissen (2000) introduced the "S - X[superscript 2]" item-fit index for testing goodness-of-fit with dichotomous item response theory (IRT) models. This study considers and evaluates an alternative approach for computing "S - X[superscript 2]" values and other factors associated with collapsing tables of observed…

Descriptors: Goodness of Fit, Test Items, Item Response Theory, Computation

A Comparison of Common IRT Model-Selection Methods with Mixed-Format Tests

Peer reviewed

Direct link

Luo, Yong – Measurement: Interdisciplinary Research and Perspectives, 2021

To date, only frequentist model-selection methods have been studied with mixed-format data in the context of IRT model-selection, and it is unknown how popular Bayesian model-selection methods such as DIC, WAIC, and LOO perform. In this study, we present the results of a comprehensive simulation study that compared the performances of eight…

Descriptors: Item Response Theory, Test Format, Selection, Methods

Multidimensional Forced-Choice CAT with Dominance Items: An Empirical Comparison with Optimal Static Testing under Different Desirability Matching

Peer reviewed

Direct link

Lin, Yin; Brown, Anna; Williams, Paul – Educational and Psychological Measurement, 2023

Several forced-choice (FC) computerized adaptive tests (CATs) have emerged in the field of organizational psychology, all of them employing ideal-point items. However, despite most items developed historically follow dominance response models, research on FC CAT using dominance items is limited. Existing research is heavily dominated by…

Descriptors: Measurement Techniques, Computer Assisted Testing, Adaptive Testing, Industrial Psychology

Application of Change Point Analysis of Response Time Data to Detect Test Speededness

Peer reviewed

Direct link

Cheng, Ying; Shao, Can – Educational and Psychological Measurement, 2022

Computer-based and web-based testing have become increasingly popular in recent years. Their popularity has dramatically expanded the availability of response time data. Compared to the conventional item response data that are often dichotomous or polytomous, response time has the advantage of being continuous and can be collected in an…

Descriptors: Reaction Time, Test Wiseness, Computer Assisted Testing, Simulation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11

Educational and Psychological…	32
Applied Psychological…	15
Applied Measurement in…	12
Journal of Educational…	12
ETS Research Report Series	9
International Journal of…	5
Measurement:…	5
Educational Sciences: Theory…	4
Journal of Educational and…	4
Eurasian Journal of…	3
International Journal of…	3
Journal of Experimental…	3
Journal of Speech, Language,…	3
Psychometrika	3
Educational Measurement:…	2
Grantee Submission	2
ACT, Inc.	1
AERA Online Paper Repository	1
Anatomical Sciences Education	1
Asia Pacific Education Review	1
Education Sciences	1
International Journal of…	1
Journal of Applied Measurement	1
Journal of Intellectual…	1
Journal of Outcome Measurement	1
More ▼

Hambleton, Ronald K.	7
Lee, Won-Chan	5
Weiss, David J.	5
Lee, Yi-Hsuan	3
Reckase, Mark D.	3
Stark, Stephen	3
Svetina, Dubravka	3
Wang, Chun	3
Wells, Craig S.	3
Zhang, Jinming	3
Baris Pekmezci, Fulya	2
Bergstrom, Betty A.	2
Bulut, Okan	2
Cheng, Ying	2
Chernyshenko, Oleksandr S.	2
Chon, Kyong Hee	2
Chun Wang	2
DeMars, Christine E.	2
Dorans, Neil J.	2
Douglas, Jeffrey A.	2
Drasgow, Fritz	2
Frick, Theodore W.	2
Guo, Hongwen	2
Haladyna, Tom	2
More ▼