Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 4 |
Descriptor
Test Length | 6 |
Item Response Theory | 5 |
Comparative Analysis | 2 |
Computation | 2 |
Maximum Likelihood Statistics | 2 |
Measurement Techniques | 2 |
Models | 2 |
Test Reliability | 2 |
Theories | 2 |
Ability | 1 |
Adaptive Testing | 1 |
More ▼ |
Source
Psychometrika | 6 |
Author
Chiu, Chia-Yi | 1 |
Doebler, Anna | 1 |
Doebler, Philipp | 1 |
Douglas, Jeffrey A. | 1 |
Eggen, Theo J. H. M. | 1 |
Holling, Heinz | 1 |
Huynh, Huynh | 1 |
Kim, Seock-Ho | 1 |
Li, Xiaodong | 1 |
Verelst, Norman D. | 1 |
Yao, Lihua | 1 |
More ▼ |
Publication Type
Journal Articles | 6 |
Reports - Research | 3 |
Reports - Evaluative | 2 |
Education Level
Audience
Location
Germany | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Yao, Lihua – Psychometrika, 2012
Multidimensional computer adaptive testing (MCAT) can provide higher precision and reliability or reduce test length when compared with unidimensional CAT or with the paper-and-pencil test. This study compared five item selection procedures in the MCAT framework for both domain scores and overall scores through simulation by varying the structure…
Descriptors: Item Banks, Test Length, Simulation, Adaptive Testing
Doebler, Anna; Doebler, Philipp; Holling, Heinz – Psychometrika, 2013
The common way to calculate confidence intervals for item response theory models is to assume that the standardized maximum likelihood estimator for the person parameter [theta] is normally distributed. However, this approximation is often inadequate for short and medium test lengths. As a result, the coverage probabilities fall below the given…
Descriptors: Foreign Countries, Item Response Theory, Computation, Hypothesis Testing
Chiu, Chia-Yi; Douglas, Jeffrey A.; Li, Xiaodong – Psychometrika, 2009
Latent class models for cognitive diagnosis often begin with specification of a matrix that indicates which attributes or skills are needed for each item. Then by imposing restrictions that take this into account, along with a theory governing how subjects interact with items, parametric formulations of item response functions are derived and…
Descriptors: Test Length, Identification, Multivariate Analysis, Item Response Theory

Huynh, Huynh – Psychometrika, 1978
The use of Cohen's kappa index as a measure of the reliability of multiple classifications is developed. Special cases of the index as well as the effects of test length on the index are also explored. (JKS)
Descriptors: Career Development, Classification, Mastery Tests, Test Length
Eggen, Theo J. H. M.; Verelst, Norman D. – Psychometrika, 2006
In this paper, the efficiency of conditional maximum likelihood (CML) and marginal maximum likelihood (MML) estimation of the item parameters of the Rasch model in incomplete designs is investigated. The use of the concept of F-information (Eggen, 2000) is generalized to incomplete testing designs. The scaled determinant of the F-information…
Descriptors: Test Length, Computation, Maximum Likelihood Statistics, Models

Kim, Seock-Ho; And Others – Psychometrika, 1994
Hierarchical Bayes procedures for the two-parameter logistic item response model were compared for estimating item and ability parameters through two joint and two marginal Bayesian procedures. Marginal procedures yielded smaller root mean square differences for item and ability, but results for larger sample size and test length were similar.…
Descriptors: Ability, Bayesian Statistics, Computer Simulation, Estimation (Mathematics)