Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 26 |
Descriptor
Item Response Theory | 40 |
Models | 20 |
Simulation | 19 |
Test Items | 18 |
Computation | 13 |
Foreign Countries | 13 |
Evaluation Methods | 9 |
Test Bias | 9 |
Computer Software | 8 |
Correlation | 8 |
Test Length | 8 |
More ▼ |
Source
Author
Wang, Wen-Chung | 40 |
Jin, Kuan-Yu | 6 |
Wilson, Mark | 5 |
Huang, Hung-Yu | 4 |
Shih, Ching-Lin | 4 |
Chen, Po-Hsi | 3 |
Cheng, Ying-Yao | 3 |
Liu, Chen-Wei | 3 |
Chen, Cheng-Te | 2 |
Chen, Hsueh-Chu | 2 |
Qiu, Xue-Lan | 2 |
More ▼ |
Publication Type
Journal Articles | 37 |
Reports - Research | 23 |
Reports - Evaluative | 14 |
Reports - Descriptive | 2 |
Speeches/Meeting Papers | 2 |
Education Level
Higher Education | 4 |
Postsecondary Education | 4 |
Junior High Schools | 3 |
Middle Schools | 3 |
Secondary Education | 3 |
Elementary Education | 2 |
Grade 4 | 1 |
Grade 8 | 1 |
Intermediate Grades | 1 |
Audience
Location
Taiwan | 8 |
Hong Kong | 2 |
China | 1 |
United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Trends in International… | 2 |
Graduate Record Examinations | 1 |
Program for International… | 1 |
Students Evaluation of… | 1 |
Wechsler Adult Intelligence… | 1 |
What Works Clearinghouse Rating
Wang, Wen-Chung; Chyi-In, Wu – Educational and Psychological Measurement, 2004
Because of the requirement of reporting effect sizes and in the interest of measurement of change within the item response theory framework, their combination becomes a new issue. In the present study, repeated measures are decomposed as an initial ability and one or more modifiabilities (gain score) using a multidimensional Rasch model. The…
Descriptors: Simulation, Effect Size, Item Response Theory, Meta Analysis
Wang, Wen-Chung; Wilson, Mark; Shih, Ching-Lin – Journal of Educational Measurement, 2006
This study presents the random-effects rating scale model (RE-RSM) which takes into account randomness in the thresholds over persons by treating them as random-effects and adding a random variable for each threshold in the rating scale model (RSM) (Andrich, 1978). The RE-RSM turns out to be a special case of the multidimensional random…
Descriptors: Item Analysis, Rating Scales, Item Response Theory, Monte Carlo Methods
Wang, Wen-Chung – Educational and Psychological Measurement, 2004
The Pearson correlation is used to depict effect sizes in the context of item response theory. Amultidimensional Rasch model is used to directly estimate the correlation between latent traits. Monte Carlo simulations were conducted to investigate whether the population correlation could be accurately estimated and whether the bootstrap method…
Descriptors: Test Length, Sampling, Effect Size, Correlation
Su, Ya-Hui; Wang, Wen-Chung – Applied Measurement in Education, 2005
Simulations were conducted to investigate factors that influence the Mantel, generalized Mantel-Haenszel (GMH), and logistic discriminant function analysis (LDFA) methods in assessing differential item functioning (DIF) for polytomous items. The results show that the magnitude of DIF contamination in the matching score, as measured by the average…
Descriptors: Discriminant Analysis, Test Bias, Research Methodology, Test Items
Wang, Wen-Chung; Wilson, Mark – Educational and Psychological Measurement, 2005
This study presents a procedure for detecting differential item functioning (DIF) for dichotomous and polytomous items in testlet-based tests, whereby DIF is taken into account by adding DIF parameters into the Rasch testlet model. Simulations were conducted to assess recovery of the DIF and other parameters. Two independent variables, test type…
Descriptors: Test Format, Test Bias, Item Response Theory, Item Analysis
Wang, Wen-Chung; Cheng, Ying-Yao; Wilson, Mark – Educational and Psychological Measurement, 2005
A parallel design, in which items across different scales within an instrument share common stimuli and subjects respond to the common stimulus for each scale, is sometimes used in questionnaires or inventories. Because the items across scales share the same stimuli, the assumption of local item independence may not hold, thereby violating the…
Descriptors: Stimuli, Psychometrics, Test Items, Item Response Theory
Chen, Hsueh-Chu; Wang, Wen-Chung – 1999
Fifty core tasks that are generally performed by and important for secondary school beginning teachers are identified. Participants (n=297) were asked to judge the importance of each task. College students (n=476) were asked how confident they would be in doing these tasks as if they were beginning teachers. Rasch technique was used to scale the…
Descriptors: Beginning Teachers, College Students, Curriculum, Foreign Countries
Wang, Wen-Chung; Su, Ya-Hui – Applied Psychological Measurement, 2004
Eight independent variables (differential item functioning [DIF] detection method, purification procedure, item response model, mean latent trait difference between groups, test length, DIF pattern, magnitude of DIF, and percentage of DIF items) were manipulated, and two dependent variables (Type I error and power) were assessed through…
Descriptors: Test Length, Test Bias, Simulation, Item Response Theory
Wang, Wen-Chung; Chen, Cheng-Te – Educational and Psychological Measurement, 2005
This study investigates item parameter recovery, standard error estimates, and fit statistics yielded by the WINSTEPS program under the Rasch model and the rating scale model through Monte Carlo simulations. The independent variables were item response model, test length, and sample size. WINSTEPS yielded practically unbiased estimates for the…
Descriptors: Statistics, Test Length, Rating Scales, Item Response Theory
Wang, Wen-Chung; Chen, Hsueh-Chu – Educational and Psychological Measurement, 2004
As item response theory (IRT) becomes popular in educational and psychological testing, there is a need of reporting IRT-based effect size measures. In this study, we show how the standardized mean difference can be generalized into such a measure. A disattenuation procedure based on the IRT test reliability is proposed to correct the attenuation…
Descriptors: Test Reliability, Rating Scales, Sample Size, Error of Measurement