Publication Date
In 2025 | 12 |
Since 2024 | 187 |
Since 2021 (last 5 years) | 818 |
Since 2016 (last 10 years) | 1951 |
Since 2006 (last 20 years) | 4074 |
Descriptor
Item Response Theory | 5553 |
Test Items | 1817 |
Foreign Countries | 1196 |
Models | 1148 |
Psychometrics | 918 |
Scores | 782 |
Comparative Analysis | 761 |
Test Construction | 750 |
Simulation | 740 |
Statistical Analysis | 659 |
Difficulty Level | 570 |
More ▼ |
Source
Author
Sinharay, Sandip | 48 |
Wilson, Mark | 45 |
Cohen, Allan S. | 43 |
Meijer, Rob R. | 43 |
Tindal, Gerald | 42 |
Wang, Wen-Chung | 40 |
Alonzo, Julie | 37 |
Ferrando, Pere J. | 36 |
Cai, Li | 35 |
van der Linden, Wim J. | 35 |
Glas, Cees A. W. | 34 |
More ▼ |
Publication Type
Education Level
Location
Turkey | 94 |
Australia | 89 |
Germany | 79 |
United States | 74 |
Netherlands | 68 |
Taiwan | 59 |
Indonesia | 53 |
China | 51 |
Canada | 49 |
Japan | 38 |
Florida | 37 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 4 |
Meets WWC Standards with or without Reservations | 4 |
Sykes, Robert C.; Truskosky, Denise; White, Hillory – 2001
The purpose of this research was to study the effect of the three different ways of increasing the number of points contributed by constructed response (CR) items on the reliability of test scores from mixed-item-format tests. The assumption of unidimensionality that underlies the accuracy of item response theory model-based standard error…
Descriptors: Constructed Response, Elementary Education, Elementary School Students, Error of Measurement

Longford, Nicholas T. – Journal of Educational Statistics, 1990
A multilevel variance component analysis from the pilot year of pretesting an instrument--the GENED--designed to present information about general education outcomes is presented, using data from about 11,000 college students. The analysis addresses the discriminant validity of the subtests and statistical issues in test construction. (SLD)
Descriptors: College Students, Discriminant Analysis, General Education, Higher Education

Ansley, Timothy N.; Forsyth, Robert A. – Applied Measurement in Education, 1990
The interaction of reading and computational abilities in solving mathematics word problems was investigated with the responses of about 20,000 fourth graders and 20,000 eighth graders to items from the Iowa Tests of Basic Skills. Item responses exhibited different types of interaction, a finding with implications for item-response theory…
Descriptors: Computation, Elementary Education, Elementary School Students, Grade 4

Mislevy, Robert J.; And Others – Journal of Educational Measurement, 1992
Concepts behind plausible values in estimating population characteristics from sparse matrix samples of item responses are discussed. The use of marginal analyses is described in the context of the National Assessment of Educational Progress, and the approach is illustrated with Scholastic Aptitude Test data for 9,075 high school seniors. (SLD)
Descriptors: College Entrance Examinations, Educational Assessment, Equations (Mathematics), Estimation (Mathematics)

Forsyth, Robert A. – Educational Measurement: Issues and Practice, 1991
The scales of the National Assessment of Educational Progress (NAEP), as constructed, do not yield meaningful criterion-referenced interpretations. Poorly defined NAEP goals and the present knowledge base do not allow the measurement of what examinees can and cannot do. Inappropriate interpretations of NAEP data are discussed, with specific…
Descriptors: Achievement Tests, Criterion Referenced Tests, Educational Assessment, Item Response Theory

Chan, Jason C. – Educational and Psychological Measurement, 1991
A study involving 102 high school students (49 males and 53 females) from Taiwan revealed that the order of response scale labels had a primacy effect on subjects' choices of the alternatives in Likert-type attitude scales. Practical implications of the response-order effects for measurement are discussed. (SLD)
Descriptors: Correlation, Factor Analysis, Factor Structure, Foreign Countries

Stone, Clement A.; Lane, Suzanne – Applied Measurement in Education, 1991
A model-testing approach for evaluating the stability of item response theory item parameter estimates (IPEs) in a pretest-posttest design is illustrated. Nineteen items from the Head Start Measures Battery were used. A moderately high degree of stability in the IPEs for 5,510 children assessed on 2 occasions was found. (TJH)
Descriptors: Comparative Testing, Compensatory Education, Computer Assisted Testing, Early Childhood Education
Li, Yuan H.; Lissitz, Robert W. – Journal of Educational Measurement, 2004
The analytically derived asymptotic standard errors (SEs) of maximum likelihood (ML) item estimates can be approximated by a mathematical function without examinees' responses to test items, and the empirically determined SEs of marginal maximum likelihood estimation (MMLE)/Bayesian item estimates can be obtained when the same set of items is…
Descriptors: Test Items, Computation, Item Response Theory, Error of Measurement
Elder, Cathie; Knoch, Ute; Barkhuizen, Gary; von Randow, Janet – Language Assessment Quarterly, 2005
Research on the utility of feedback to raters in the form of performance reports has produced mixed findings (Lunt, Morton, & Wigglesworth, 1994; Wigglesworth, 1993) and has thus far been trialled only in oral assessment contexts. This article reports on a study investigating raters' attitudes and responsiveness to feedback on their ratings of…
Descriptors: Feedback (Response), Scripts, Test Bias, Writing Evaluation
Harlow, Ann; Jones, Alister – Research in Science Education, 2004
The purpose of this study was to explore how Year 8 students answered Third International Mathematics and Science Study (TIMSS) questions and whether the test questions represented the scientific understanding of these students. One hundred and seventy-seven students were tested using written test questions taken from the science test used in the…
Descriptors: Science Tests, Test Items, Grade 8, Scientific Concepts
Samuelsson, Stefan; Byrne, Brian; Quain, Peter; Wadsworth, Sally; Corley, Robin; DeFries, John C.; Willcutt, Eric; Olson, Richard – Journal of Educational Psychology, 2005
Individual differences in measures of prereading skills and in questionnaire measures of 4-5-year-old twins' print environments in Australia, Scandinavia, and the United States were explored with a behavioral-genetic design. Modest phenotypic correlations were found between environmental measures and the twins' print knowledge, general verbal…
Descriptors: Foreign Countries, Environmental Influences, Genetics, Prereading Experience
Doig, Brian – Mathematics Education Research Journal, 2005
The assessment of children in their years before school and their first years of school has been, traditionally, informal. Further, assessment of children's mathematical skills at this level has been infrequent compared to social, emotional and physical assessments. However, there are contexts where reliable, valid, standardised data from…
Descriptors: Mathematics Skills, Item Response Theory, Mathematics Achievement, Preschool Children
Hill, Heather C.; Ball, Deborah Loewenberg; Blunk, Merrie; Goffney, Imani Masters; Rowan, Brian – Measurement: Interdisciplinary Research and Perspectives, 2007
This paper provides a summary of the authors' attempts to uncover links between their measures, classroom mathematics instruction, and student learning. This paper also provides evidence regarding one central critique of their measures: that multiple-choice assessments cannot validly represent the knowledge, skills, and judgment involved in actual…
Descriptors: Teacher Characteristics, Teaching Methods, Correlation, Mathematics Achievement
Tang, Huixing – 1994
This paper describes an item response theory (IRT) based method of differential item functioning (DIF) detection that involves neither separate calibration nor ability grouping. IRT is used to generate residual scores, scores free of the effects of person or group ability and item difficulty. Analysis of variance is then used to test the group…
Descriptors: Ability Grouping, Analysis of Variance, Goodness of Fit, Identification
Muraki, Eiji; Carlson, James E. – 1993
A multidimensional item response theory (IRT) model for polytomously scored items, based on the graded response model and using the normal ogive, is developed; and an EM algorithm that may be used to estimate the parameters of the model is also discussed. The model is illustrated through a simulation study in which the polytomous item responses of…
Descriptors: Computer Graphics, Computer Simulation, Equations (Mathematics), Estimation (Mathematics)