Publication Date
In 2025 | 12 |
Since 2024 | 187 |
Since 2021 (last 5 years) | 818 |
Since 2016 (last 10 years) | 1951 |
Since 2006 (last 20 years) | 4074 |
Descriptor
Item Response Theory | 5553 |
Test Items | 1817 |
Foreign Countries | 1196 |
Models | 1148 |
Psychometrics | 918 |
Scores | 782 |
Comparative Analysis | 761 |
Test Construction | 750 |
Simulation | 740 |
Statistical Analysis | 659 |
Difficulty Level | 570 |
More ▼ |
Source
Author
Sinharay, Sandip | 48 |
Wilson, Mark | 45 |
Cohen, Allan S. | 43 |
Meijer, Rob R. | 43 |
Tindal, Gerald | 42 |
Wang, Wen-Chung | 40 |
Alonzo, Julie | 37 |
Ferrando, Pere J. | 36 |
Cai, Li | 35 |
van der Linden, Wim J. | 35 |
Glas, Cees A. W. | 34 |
More ▼ |
Publication Type
Education Level
Location
Turkey | 94 |
Australia | 89 |
Germany | 79 |
United States | 74 |
Netherlands | 68 |
Taiwan | 59 |
Indonesia | 53 |
China | 51 |
Canada | 49 |
Japan | 38 |
Florida | 37 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 4 |
Meets WWC Standards with or without Reservations | 4 |

Westers, Paul; Kelderman, Henk – Psychometrika, 1992
A method for analyzing test-item responses is proposed to examine differential item functioning (DIF) in multiple-choice items within the latent class framework. Different models for detection of DIF are formulated, defining the subgroup as a latent variable. An efficient estimation method is described and illustrated. (SLD)
Descriptors: Chi Square, Difficulty Level, Educational Testing, Equations (Mathematics)

Harris, Deborah J. – Applied Psychological Measurement, 1991
Effects of passage and item-scrambling on equipercentile and item-response theory equating were investigated using 2 scrambled versions of the American College Testing Program Assessment for approximately 25,000 examinees. Results indicate that using a base-form conversion table with a scrambled form affects the individual examinee level. (SLD)
Descriptors: College Entrance Examinations, Comparative Testing, Context Effect, Equated Scores

Sireci, Stephen G.; And Others – Journal of Educational Measurement, 1991
Calculating the reliability of a testlet-based test is demonstrated using data from 1,812 males and 2,216 females taking the Scholastic Aptitude Test verbal section and 3,866 examinees taking another reading test. Traditional reliabilities calculated on reading comprehension tests constructed of four testlets provided substantial overestimates.…
Descriptors: College Entrance Examinations, Equations (Mathematics), Estimation (Mathematics), High School Students

Zwick, Rebecca – Educational Measurement: Issues and Practice, 1991
Item parameter estimates derived through item response theory methods have been considered relatively robust to changes in item position and context, but the anomaly in reading scores from the 1986 National Assessment of Educational Progress (NAEP) illustrates problems with common population equating procedures when there are test form changes.…
Descriptors: Achievement Tests, Context Effect, Equated Scores, Estimation (Mathematics)

Tsutakawa, Robert K.; Johnson, Jane C. – Psychometrika, 1990
The conventional method of measuring ability--based on items with assumed true parameter values obtained from a pretest--is compared to a Bayesian method that deals with the uncertainties of such items. Data from a 1987 American College Testing Program mathematics test indicate that maximum likelihood/Bayesian techniques underestimate uncertainty.…
Descriptors: Ability Identification, Bayesian Statistics, College Entrance Examinations, Comparative Analysis

Leplege, Alain; Ecosse, Emmanuel – Journal of Applied Measurement, 2000
Describes the source instrument, the database, and the experimental methodology that have been used in developing the World Health Organization Quality of Life (WHOQOL) profiles to allow the comparison of quality of life findings from a variety of cultural settings. Illustrates the precautions necessary in interpreting scores from different…
Descriptors: Adults, Cross Cultural Studies, Cultural Differences, Culture Fair Tests

Seltzer, Michael H.; And Others – Educational Evaluation and Policy Analysis, 1994
A series of analyses based on a longitudinal study of reading achievement in Chicago (Illinois) illustrate that the conclusions drawn about academic growth and the decisions about types of interventions needed may be very sensitive to the metric that is used (i.e., grade equivalents or item response theory metrics). (SLD)
Descriptors: Academic Achievement, Achievement Gains, Decision Making, Educational Policy
Guzman, Eduardo; Conejo, Ricardo; Garcia-Hervas; Emilio – Educational Technology & Society, 2005
SIETTE is a web-based adaptive testing system. It implements Computerized Adaptive Tests. These tests are tailor-made, theory-based tests, where questions shown to students, finalization of the test, and student knowledge estimation is accomplished adaptively. To construct these tests, SIETTE has an authoring environment comprising a suite of…
Descriptors: Adaptive Testing, Computer Assisted Testing, Test Construction, Test Items
Myers, Nicholas D.; Wolfe, Edward W.; Feltz, Deborah L. – Measurement in Physical Education and Exercise Science, 2005
This study extends validity evidence for the Coaching Efficacy Scale (CES; Feltz, Chase, Moritz, & Sullivan, 1999) by providing an evaluation of the psychometric properties of the instrument from previously collected data on high school and college coaches from United States. Data were fitted to a multidimensional item response theory model.…
Descriptors: Self Efficacy, Test Validity, Rating Scales, Psychometrics
Sy, Susan R.; Schulenberg, John E. – International Journal of Behavioral Development, 2005
This study examines the predictive relationships among 309 Asian American and 9471 European American parents' beliefs, expectations, and involvement, and their children's math and reading achievement trajectories during children's transition to school. Data came from the Early Childhood Longitudinal Study-Kindergarten Cohort (ECLS-K), an ongoing…
Descriptors: Reading Achievement, Parent Participation, Kindergarten, Asian Americans
The Standardized Letter of Recommendation: Implications for Selection. Research Report. ETS RR-07-38
Liu, Ou Lydia; Minsky, Jennifer; Ling, Guangming; Kyllonen, Patrick – ETS Research Report Series, 2007
In an effort to standardize academic application procedures, the Standardized Letter of Recommendation (SLR) was developed to capture important cognitive and noncognitive qualities of graduate school candidates. The SLR consists of seven scales ("knowledge," "analytical skills," "communication skills,"…
Descriptors: Letters (Correspondence), Graduate Students, College Applicants, Cognitive Ability
Westers, Paul – 1993
The subject of this dissertation is the examination of differential item functioning (DIF) through the use of loglinear Rasch models with latent classes. DIF refers to the probability that a correct response among equally able test takers is different for various racial, ethnic, and gender groups. Because usual methods of detecting DIF give little…
Descriptors: Ability, Estimation (Mathematics), Ethnic Groups, Foreign Countries
Harvey-Beavis, Adrian – 1994
How teachers' judgments about student literacy behavior were analyzed under the Rasch model is described. The analysis was done to assist staff of the Western Australian Department of Education to revise aspects of a literacy program called "First Steps" for the early years of school. First Steps uses a developmental continuum of small…
Descriptors: Behavior Patterns, Child Development, Educational Assessment, Elementary Education
Lunz, Mary E.; And Others – 1990
This study explores the test-retest consistency of computer adaptive tests of varying lengths. The testing model used was designed as a mastery model to determine whether an examinee's estimated ability level is above or below a pre-established criterion expressed in the metric (logits) of the calibrated item pool scale. The Rasch model was used…
Descriptors: Ability Identification, Adaptive Testing, College Students, Comparative Testing

Jolly, S. J.; And Others – Florida Journal of Educational Research, 1987
This study examined irregularities in reading comprehension (RC) test results at both the individual and aggregate levels, focusing on the effect of test speededness on test scores. Data were taken from results of the Stanford Achievement Test--Seventh Edition (SAT/7), administered during April of 1983 to about 50,000 Palm Beach County (Florida)…
Descriptors: Elementary School Students, Grade 4, Intermediate Grades, Item Response Theory