ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	4

Descriptor

Item Response Theory	8
Test Length	8
Computation	4
Correlation	3
Error of Measurement	3
Evaluation Methods	3
Foreign Countries	3
Simulation	3
Test Items	3
Adaptive Testing	2
Classification	2
Computer Assisted Testing	2
Effect Size	2
Likert Scales	2
Models	2
Rating Scales	2
Sample Size	2
Statistical Analysis	2
Test Bias	2
Bayesian Statistics	1
Cognitive Style	1
Computer Software	1
Cutting Scores	1
Data Analysis	1
Error Patterns	1
More ▼

Source

Educational and Psychological…	6
Applied Psychological…	2

Author

Wang, Wen-Chung	8
Liu, Chen-Wei	2
Chen, Cheng-Te	1
Chen, Hsueh-Chu	1
Cheng, Ying-Yao	1
Chou, Yeh-Tai	1
Ho, Yi-Hui	1
Su, Ya-Hui	1
Wu, Shiu-Lien	1

Publication Type

Journal Articles	8
Reports - Evaluative	5
Reports - Research	2
Reports - Descriptive	1

Education Level

Audience

Location

Taiwan

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 8 results Save | Export

The Random-Threshold Generalized Unfolding Model and Its Application of Computerized Adaptive Testing

Peer reviewed

Direct link

Wang, Wen-Chung; Liu, Chen-Wei; Wu, Shiu-Lien – Applied Psychological Measurement, 2013

The random-threshold generalized unfolding model (RTGUM) was developed by treating the thresholds in the generalized unfolding model as random effects rather than fixed effects to account for the subjective nature of the selection of categories in Likert items. The parameters of the new model can be estimated with the JAGS (Just Another Gibbs…

Descriptors: Computer Assisted Testing, Adaptive Testing, Models, Bayesian Statistics

Checking Dimensionality in Item Response Models with Principal Component Analysis on Standardized Residuals

Peer reviewed

Direct link

Chou, Yeh-Tai; Wang, Wen-Chung – Educational and Psychological Measurement, 2010

Dimensionality is an important assumption in item response theory (IRT). Principal component analysis on standardized residuals has been used to check dimensionality, especially under the family of Rasch models. It has been suggested that an eigenvalue greater than 1.5 for the first eigenvalue signifies a violation of unidimensionality when there…

Descriptors: Test Length, Sample Size, Correlation, Item Response Theory

Computerized Classification Testing under the Generalized Graded Unfolding Model

Peer reviewed

Direct link

Wang, Wen-Chung; Liu, Chen-Wei – Educational and Psychological Measurement, 2011

The generalized graded unfolding model (GGUM) has been recently developed to describe item responses to Likert items (agree-disagree) in attitude measurement. In this study, the authors (a) developed two item selection methods in computerized classification testing under the GGUM, the current estimate/ability confidence interval method and the cut…

Descriptors: Computer Assisted Testing, Adaptive Testing, Classification, Item Response Theory

Multidimensional Rasch Analysis of a Psychological Test with Multiple Subtests: A Statistical Solution for the Bandwidth-Fidelity Dilemma

Peer reviewed

Direct link

Cheng, Ying-Yao; Wang, Wen-Chung; Ho, Yi-Hui – Educational and Psychological Measurement, 2009

Educational and psychological tests are often composed of multiple short subtests, each measuring a distinct latent trait. Unfortunately, short subtests suffer from low measurement precision, which makes the bandwidth-fidelity dilemma inevitable. In this study, the authors demonstrate how a multidimensional Rasch analysis can be employed to take…

Descriptors: Item Response Theory, Measurement, Correlation, Measures (Individuals)

Direct Estimation of Correlation as a Measure of Association Strength Using Multidimensional Item Response Models

Peer reviewed

Direct link

Wang, Wen-Chung – Educational and Psychological Measurement, 2004

The Pearson correlation is used to depict effect sizes in the context of item response theory. Amultidimensional Rasch model is used to directly estimate the correlation between latent traits. Monte Carlo simulations were conducted to investigate whether the population correlation could be accurately estimated and whether the bootstrap method…

Descriptors: Test Length, Sampling, Effect Size, Correlation

Factors Influencing the Mantel and Generalized Mantel-Haenszel Methods for the Assessment of Differential Item Functioning in Polytomous Items

Peer reviewed

Direct link

Wang, Wen-Chung; Su, Ya-Hui – Applied Psychological Measurement, 2004

Eight independent variables (differential item functioning [DIF] detection method, purification procedure, item response model, mean latent trait difference between groups, test length, DIF pattern, magnitude of DIF, and percentage of DIF items) were manipulated, and two dependent variables (Type I error and power) were assessed through…

Descriptors: Test Length, Test Bias, Simulation, Item Response Theory

Item Parameter Recovery, Standard Error Estimates, and Fit Statistics of the Winsteps Program for the Family of Rasch Models

Peer reviewed

Direct link

Wang, Wen-Chung; Chen, Cheng-Te – Educational and Psychological Measurement, 2005

This study investigates item parameter recovery, standard error estimates, and fit statistics yielded by the WINSTEPS program under the Rasch model and the rating scale model through Monte Carlo simulations. The independent variables were item response model, test length, and sample size. WINSTEPS yielded practically unbiased estimates for the…

Descriptors: Statistics, Test Length, Rating Scales, Item Response Theory

The Standardized Mean Difference within the Framework of Item Response Theory

Peer reviewed

Direct link

Wang, Wen-Chung; Chen, Hsueh-Chu – Educational and Psychological Measurement, 2004

As item response theory (IRT) becomes popular in educational and psychological testing, there is a need of reporting IRT-based effect size measures. In this study, we show how the standardized mean difference can be generalized into such a measure. A disattenuation procedure based on the IRT test reliability is proposed to correct the attenuation…

Descriptors: Test Reliability, Rating Scales, Sample Size, Error of Measurement