Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 26 |
Descriptor
Item Response Theory | 40 |
Models | 20 |
Simulation | 19 |
Test Items | 18 |
Computation | 13 |
Foreign Countries | 13 |
Evaluation Methods | 9 |
Test Bias | 9 |
Computer Software | 8 |
Correlation | 8 |
Test Length | 8 |
More ▼ |
Source
Author
Wang, Wen-Chung | 40 |
Jin, Kuan-Yu | 6 |
Wilson, Mark | 5 |
Huang, Hung-Yu | 4 |
Shih, Ching-Lin | 4 |
Chen, Po-Hsi | 3 |
Cheng, Ying-Yao | 3 |
Liu, Chen-Wei | 3 |
Chen, Cheng-Te | 2 |
Chen, Hsueh-Chu | 2 |
Qiu, Xue-Lan | 2 |
More ▼ |
Publication Type
Journal Articles | 37 |
Reports - Research | 23 |
Reports - Evaluative | 14 |
Reports - Descriptive | 2 |
Speeches/Meeting Papers | 2 |
Education Level
Higher Education | 4 |
Postsecondary Education | 4 |
Junior High Schools | 3 |
Middle Schools | 3 |
Secondary Education | 3 |
Elementary Education | 2 |
Grade 4 | 1 |
Grade 8 | 1 |
Intermediate Grades | 1 |
Audience
Location
Taiwan | 8 |
Hong Kong | 2 |
China | 1 |
United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Trends in International… | 2 |
Graduate Record Examinations | 1 |
Program for International… | 1 |
Students Evaluation of… | 1 |
Wechsler Adult Intelligence… | 1 |
What Works Clearinghouse Rating
Shih, Ching-Lin; Wang, Wen-Chung – Applied Psychological Measurement, 2009
The multiple indicators, multiple causes (MIMIC) method with a pure short anchor was proposed to detect differential item functioning (DIF). A simulation study showed that the MIMIC method with an anchor of 1, 2, 4, or 10 DIF-free items yielded a well-controlled Type I error rate even when such tests contained as many as 40% DIF items. In general,…
Descriptors: Test Bias, Simulation, Methods, Factor Analysis
Wang, Wen-Chung; Jin, Kuan-Yu – Educational and Psychological Measurement, 2010
In this study, the authors extend the standard item response model with internal restrictions on item difficulty (MIRID) to fit polytomous items using cumulative logits and adjacent-category logits. Moreover, the new model incorporates discrimination parameters and is rooted in a multilevel framework. It is a nonlinear mixed model so that existing…
Descriptors: Difficulty Level, Test Items, Item Response Theory, Generalization
Wang, Wen-Chung; Liu, Chen-Wei – Educational and Psychological Measurement, 2011
The generalized graded unfolding model (GGUM) has been recently developed to describe item responses to Likert items (agree-disagree) in attitude measurement. In this study, the authors (a) developed two item selection methods in computerized classification testing under the GGUM, the current estimate/ability confidence interval method and the cut…
Descriptors: Computer Assisted Testing, Adaptive Testing, Classification, Item Response Theory
Huang, Hung-Yu; Wang, Wen-Chung – Educational and Psychological Measurement, 2014
In the social sciences, latent traits often have a hierarchical structure, and data can be sampled from multiple levels. Both hierarchical latent traits and multilevel data can occur simultaneously. In this study, we developed a general class of item response theory models to accommodate both hierarchical latent traits and multilevel data. The…
Descriptors: Item Response Theory, Hierarchical Linear Modeling, Computation, Test Reliability
Wang, Wen-Chung; Jin, Kuan-Yu – Applied Psychological Measurement, 2010
In this study, all the advantages of slope parameters, random weights, and latent regression are acknowledged when dealing with component and composite items by adding slope parameters and random weights into the standard item response model with internal restrictions on item difficulty and formulating this new model within a multilevel framework…
Descriptors: Test Items, Difficulty Level, Regression (Statistics), Generalization
Cheng, Ying-Yao; Wang, Wen-Chung; Ho, Yi-Hui – Educational and Psychological Measurement, 2009
Educational and psychological tests are often composed of multiple short subtests, each measuring a distinct latent trait. Unfortunately, short subtests suffer from low measurement precision, which makes the bandwidth-fidelity dilemma inevitable. In this study, the authors demonstrate how a multidimensional Rasch analysis can be employed to take…
Descriptors: Item Response Theory, Measurement, Correlation, Measures (Individuals)
Wang, Wen-Chung; Shih, Ching-Lin; Yang, Chih-Chien – Educational and Psychological Measurement, 2009
This study implements a scale purification procedure onto the standard MIMIC method for differential item functioning (DIF) detection and assesses its performance through a series of simulations. It is found that the MIMIC method with scale purification (denoted as M-SP) outperforms the standard MIMIC method (denoted as M-ST) in controlling…
Descriptors: Test Items, Measures (Individuals), Test Bias, Evaluation Research
Wang, Wen-Chung – Applied Psychological Measurement, 2008
Raju and Oshima (2005) proposed two prophecy formulas based on item response theory in order to predict the reliability of ability estimates for a test after change in its length. The first prophecy formula is equivalent to the classical Spearman-Brown prophecy formula. The second prophecy formula is misleading because of an underlying false…
Descriptors: Test Reliability, Item Response Theory, Computation, Evaluation Methods
Effects of Ignoring Item Interaction on Item Parameter Estimation and Detection of Interacting Items
Chen, Cheng-Te; Wang, Wen-Chung – Applied Psychological Measurement, 2007
This study explores the effects of ignoring item interaction on item parameter estimation and the efficiency of using the local dependence index Q[subscript 3] and the SAS NLMIXED procedure to detect item interaction under the three-parameter logistic model and the generalized partial credit model. Through simulations, it was found that ignoring…
Descriptors: Models, Item Response Theory, Simulation, Generalization
Wang, Wen-Chung; Liu, Chih-Yu – Educational and Psychological Measurement, 2007
In this study, the authors develop a generalized multilevel facets model, which is not only a multilevel and two-parameter generalization of the facets model, but also a multilevel and facet generalization of the generalized partial credit model. Because the new model is formulated within a framework of nonlinear mixed models, no efforts are…
Descriptors: Generalization, Item Response Theory, Models, Equipment
Wang, Wen-Chung; Chen, Po-Hsi; Cheng, Ying-Yao – Psychological Methods, 2004
A conventional way to analyze item responses in multiple tests is to apply unidimensional item response models separately, one test at a time. This unidimensional approach, which ignores the correlations between latent traits, yields imprecise measures when tests are short. To resolve this problem, one can use multidimensional item response models…
Descriptors: Item Response Theory, Test Items, Testing, Test Validity
Wang, Wen-Chung – Journal of Experimental Education, 2004
Scale indeterminacy in analysis of differential item functioning (DIF) within the framework of item response theory can be resolved by imposing 3 anchor item methods: the equal-mean-difficulty method, the all-other anchor item method, and the constant anchor item method. In this article, applicability and limitations of these 3 methods are…
Descriptors: Test Bias, Models, Item Response Theory, Comparative Analysis
Wang, Wen-Chung; Wilson, Mark – Applied Psychological Measurement, 2005
The Rasch testlet model for both dichotomous and polytomous items in testlet-based tests is proposed. It can be viewed as a special case of the multidimensional random coefficients multinomial logit model (MRCMLM). Therefore, the estimation procedures for the MRCMLM can be directly applied. Simulations were conducted to examine parameter recovery…
Descriptors: Test Construction, Item Response Theory, Measurement Techniques, Models
Wang, Wen-Chung – 1998
The conventional two-group differential item functioning (DIF) analysis is extended to an analysis of variance-like (ANOVA-like) DIF analysis where multiple factors with multiple groups are compared simultaneously. Moreover, DIF is treated as a parameter to be estimated rather than simply a sign to be detected. This proposed approach allows the…
Descriptors: Analysis of Variance, Foreign Countries, Item Bias, Item Response Theory
Wang, Wen-Chung; Wilson, Mark – Applied Psychological Measurement, 2005
The random-effects facet model that deals with local item dependence in many-facet contexts is presented. It can be viewed as a special case of the multidimensional random coefficients multinomial logit model (MRCMLM) so that the estimation procedures for the MRCMLM can be directly applied. Simulations were conducted to examine parameter recovery…
Descriptors: Test Reliability, Item Response Theory, Interrater Reliability, Rating Scales