NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 12 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Stella Y.; Lee, Won-Chan – Journal of Educational Measurement, 2023
The current study proposed several variants of simple-structure multidimensional item response theory equating procedures. Four distinct sets of data were used to demonstrate feasibility of proposed equating methods for two different equating designs: a random groups design and a common-item nonequivalent groups design. Findings indicated some…
Descriptors: Item Response Theory, Equated Scores, Monte Carlo Methods, Research Methodology
Peer reviewed Peer reviewed
Direct linkDirect link
Sinharay, Sandip – Journal of Educational Measurement, 2016
De la Torre and Deng suggested a resampling-based approach for person-fit assessment (PFA). The approach involves the use of the [math equation unavailable] statistic, a corrected expected a posteriori estimate of the examinee ability, and the Monte Carlo (MC) resampling method. The Type I error rate of the approach was closer to the nominal level…
Descriptors: Sampling, Research Methodology, Error Patterns, Monte Carlo Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Cui, Ying; Leighton, Jacqueline P. – Journal of Educational Measurement, 2009
In this article, we introduce a person-fit statistic called the hierarchy consistency index (HCI) to help detect misfitting item response vectors for tests developed and analyzed based on a cognitive model. The HCI ranges from -1.0 to 1.0, with values close to -1.0 indicating that students respond unexpectedly or differently from the responses…
Descriptors: Test Length, Simulation, Correlation, Research Methodology
Peer reviewed Peer reviewed
Kane, Michael – Journal of Educational Measurement, 2002
Reviews the criticisms of sampling assumptions in generalizability theory (and in reliability theory) and examines the feasibility of using representative sampling, stratification, homogeneity assumptions, and replications to address these criticisms. Suggests some general outlines for the conduct of generalizability theory studies. (SLD)
Descriptors: Generalizability Theory, Reliability, Research Methodology, Sampling
Peer reviewed Peer reviewed
Stenner, A. Jackson; And Others – Journal of Educational Measurement, 1983
In an attempt to restore the symmetry and balance between the study of person and item variation, this paper presents a novel methodology construct specification equations, which allows one to ascertain from the lawful behavior of items what an instrument is measuring. (Author/PN)
Descriptors: Measurement Objectives, Measurement Techniques, Research Methodology, Test Construction
Peer reviewed Peer reviewed
Winne, Philip H.; Belfry, M. Joan – Journal of Educational Measurement, 1982
This review of issues about correcting for attenuation concludes that the basic difficulty lies in being able to identify and equate sources of variance in estimates of validity and reliability. Recommendations are proposed for cautious use of correction for attenuation. (Author/CM)
Descriptors: Correlation, Error of Measurement, Research Methodology, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Wen-Chung; Wilson, Mark; Shih, Ching-Lin – Journal of Educational Measurement, 2006
This study presents the random-effects rating scale model (RE-RSM) which takes into account randomness in the thresholds over persons by treating them as random-effects and adding a random variable for each threshold in the rating scale model (RSM) (Andrich, 1978). The RE-RSM turns out to be a special case of the multidimensional random…
Descriptors: Item Analysis, Rating Scales, Item Response Theory, Monte Carlo Methods
Peer reviewed Peer reviewed
Nandakumar, Ratna – Journal of Educational Measurement, 1994
Using simulated and real data, this study compares the performance of three methodologies for assessing unidimensionality: (1) DIMTEST; (2) the approach of Holland and Rosenbaum; and (3) nonlinear factor analysis. All three models correctly confirm unidimensionality, but they differ in their ability to detect the lack of unidimensionality.…
Descriptors: Ability, Comparative Analysis, Evaluation Methods, Factor Analysis
Peer reviewed Peer reviewed
Masters, Geofferey N. – Journal of Educational Measurement, 1984
This paper develops and illustrates a latent trait approach to constructing an item bank when responses are scored in several ordered categories. This approach is an extension of the methodology developed by Choppin, Wright and Stone, and Wright and Bell for the construction and maintenance of banks of dichotomously scored items. (Author/PN)
Descriptors: Equated Scores, Item Banks, Latent Trait Theory, Mathematical Models
Peer reviewed Peer reviewed
Beaton, Albert E.; Johnson, Eugene G. – Journal of Educational Measurement, 1992
The National Assessment of Educational Progress (NAEP) uses item response theory (IRT) based scaling methods to summarize information in complex data sets. The necessity of global scores or more detailed subscores, creation of developmental scales for different ages, and use of scale anchoring for scale interpretation are discussed. (SLD)
Descriptors: Age Differences, Educational Assessment, Elementary Secondary Education, Evaluation Methods
Peer reviewed Peer reviewed
Baker, Frank B.; Al-Karni, Ali – Journal of Educational Measurement, 1991
Two methods of computing test equating coefficients under item response theory by the following authors are compared: (1) B. H. Loyd and H. D. Hoover (1980); and (2) M. L. Stocking and F. M. Lord (1983). Conditions under which the method of Stocking and Lord is preferable are described. (SLD)
Descriptors: Ability, College Entrance Examinations, Comparative Analysis, Equated Scores
Peer reviewed Peer reviewed
Muthen, Bengt O.; And Others – Journal of Educational Measurement, 1991
A procedure is presented for examining the influence of instruction on responses to test items by extending item response theory to incorporate variables illustrating different amounts of opportunity to learn. Data from the Second International Mathematics Study (grade 8 scores for about 7,000 students) illustrate the discussion. (SLD)
Descriptors: Ability, Achievement Tests, Estimation (Mathematics), Grade 8