NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 31 to 44 of 44 results Save | Export
Peer reviewed Peer reviewed
Kim, Seock-Ho; Cohen, Allan S. – Applied Psychological Measurement, 1998
Investigated Type I error rates of the likelihood-ratio test for the detection of differential item functioning (DIF) using Monte Carlo simulations under the graded-response model. Type I error rates were within theoretically expected values for all six combinations of sample sizes and ability-matching conditions at each of the nominal alpha…
Descriptors: Ability, Item Bias, Item Response Theory, Monte Carlo Methods
Peer reviewed Peer reviewed
Cohen, Allan S.; And Others – Applied Psychological Measurement, 1996
Type I error rates for the likelihood ratio test for detecting differential item functioning (DIF) were investigated using Monte Carlo simulations. Type I error rates for the two-parameter model were within theoretically expected values at each alpha level, but those for the three-parameter model were not. (SLD)
Descriptors: Identification, Item Bias, Item Response Theory, Maximum Likelihood Statistics
Peer reviewed Peer reviewed
Cohen, Allan S.; And Others – Applied Psychological Measurement, 1993
Three measures of differential item functioning for the dichotomous response model are extended to include Samejima's graded response model. Two are based on area differences between item true score functions, and one is a chi-square statistic for comparing differences in item parameters. (SLD)
Descriptors: Chi Square, Comparative Analysis, Identification, Item Bias
Peer reviewed Peer reviewed
Cohen, Allan S.; Kim, Seock-Ho – Applied Psychological Measurement, 1993
The effectiveness of two statistical tests of the area between item response functions (exact signed area and exact unsigned area) estimated in different samples, a measure of differential item functioning (DIF), was compared with Lord's chi square. Lord's chi square was found the most effective in determining DIF. (SLD)
Descriptors: Chi Square, Comparative Analysis, Equations (Mathematics), Estimation (Mathematics)
Kim, Seock-Ho; Cohen, Allan S. – 1997
Applications of item response theory to practical testing problems including equating, differential item functioning, and computerized adaptive testing, require that item parameter estimates be placed onto a common metric. In this study, two methods for developing a common metric for the graded response model under item response theory were…
Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Equated Scores
Peer reviewed Peer reviewed
Kim, Seock-Ho; Cohen, Allan S. – Applied Measurement in Education, 1995
Three procedures for the detection of differential item functioning under item response theory were compared. Data for 2 forms of a mathematics test taken by 1,490 college students were analyzed through F. M. Lord's chi-square, N. S. Raju's area measures, and the likelihood ratio test. (SLD)
Descriptors: Chi Square, College Students, Comparative Analysis, Higher Education
Kim, Seock-Ho; Cohen, Allan S. – 1991
Studies of differential item functioning (DIF) under item response theory require that item parameter estimates be placed on the same metric before comparisons can be made. Evidence that methods for linking metrics may be influenced by the presence of differentially functioning items has been inconsistent. The effects of three methods for linking…
Descriptors: Chi Square, Comparative Analysis, Equations (Mathematics), Estimation (Mathematics)
Peer reviewed Peer reviewed
Kim, Seock-Ho; Cohen, Allan S. – Journal of Educational Measurement, 1992
Effects of the following methods for linking metrics on detection of differential item functioning (DIF) were compared: (1) test characteristic curve method (TCC); (2) weighted mean and sigma method; and (3) minimum chi-square method. With large samples, results were essentially the same. With small samples, TCC was most accurate. (SLD)
Descriptors: Chi Square, Comparative Analysis, Equations (Mathematics), Estimation (Mathematics)
Kim, Seock-Ho; Cohen, Allan S. – 1996
Applications of item response theory to practical testing problems including equating, differential item functioning, and computerized adaptive testing, require that item parameter estimates be placed onto a common metric. In this study, three methods for developing a common metric under item response theory are compared: (1) linking separate…
Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Difficulty Level
Cohen, Allan S.; Kappy, Kathleen A. – 1980
The ability of the Rasch model to provide item difficulties and achievement test scores which are invariant is studied. Data for the study were obtained from students in grades 3 through 7 who took the Sequential Tests of Educational Progress (STEP III) Reading and Mathematics Concepts tests during a spring norming study. Each test contained 50…
Descriptors: Achievement Tests, Difficulty Level, Elementary Education, Item Analysis
Cohen, Allan S.; Kim, Seock-Ho – 1993
Equating tests from different calibrations under item response theory (IRT) requires calculation of the slope and intercept of the appropriate linear transformation. Two methods have been proposed recently for equating graded response items under IRT, a test characteristic curve method and a minimum chi-square method. These two methods are…
Descriptors: Chi Square, Comparative Analysis, Computer Simulation, Equated Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Cohen, Allan S.; Gregg, Noel; Deng, Meng – Learning Disabilities Research & Practice, 2005
The premise of a great deal of current research guiding policy development has been that accommodations are the catalyst for student performance differences. Rather than accepting this premise, two studies were conducted to investigate the influence of extended time and content knowledge on the performance of ninth-grade students who took a…
Descriptors: Program Effectiveness, Mathematics Tests, Learning Disabilities, Testing Accommodations
Peer reviewed Peer reviewed
Kim, Seock-Ho; Cohen, Allan S. – Applied Psychological Measurement, 1991
The exact and closed-interval area measures for detecting differential item functioning are compared for actual data from 1,000 African-American and 1,000 white college students taking a vocabulary test with items intentionally constructed to favor 1 set of examinees. No real differences in detection of biased items were found. (SLD)
Descriptors: Black Students, College Students, Comparative Testing, Equations (Mathematics)
Peer reviewed Peer reviewed
Cohen, Allan S.; And Others – Journal of Educational Measurement, 1991
Detecting differential item functioning (DIF) on test items constructed to favor 1 group over another was investigated on parameter estimates from 2 item response theory-based computer programs--BILOG and LOGIST--using data for 1,000 White and 1,000 Black college students. Use of prior distributions and marginal-maximum a posteriori estimation is…
Descriptors: Black Students, College Students, Computer Assisted Testing, Equations (Mathematics)
« Previous Page | Next Page
Pages: 1  |  2  |  3