Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 2 |
Descriptor
Test Length | 8 |
Item Response Theory | 5 |
Test Construction | 3 |
Test Items | 3 |
Test Reliability | 3 |
Evaluation Methods | 2 |
Higher Education | 2 |
Multiple Choice Tests | 2 |
Objective Tests | 2 |
Test Theory | 2 |
Test Validity | 2 |
More ▼ |
Source
Educational and Psychological… | 2 |
Applied Psychological… | 1 |
Assessment and Evaluation in… | 1 |
Journal of Vocational Behavior | 1 |
Learning Disabilities… | 1 |
Measurement:… | 1 |
Medical Teacher | 1 |
Author
Burton, Richard F. | 1 |
Chen, Hsueh-Chu | 1 |
Cohen, Allan S. | 1 |
Deng, Meng | 1 |
Drasgow, Fritz | 1 |
Embretson, Susan E. | 1 |
Gorman, C. Allen | 1 |
Gregg, Noel | 1 |
Meriac, John P. | 1 |
Mitchell, G. | 1 |
Sanders, Piet F. | 1 |
More ▼ |
Publication Type
Journal Articles | 8 |
Reports - Descriptive | 8 |
Education Level
Higher Education | 1 |
Audience
Practitioners | 1 |
Location
Taiwan | 1 |
United Kingdom (Great Britain) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meriac, John P.; Woehr, David J.; Gorman, C. Allen; Thomas, Amanda L. E. – Journal of Vocational Behavior, 2013
The multidimensional work ethic profile (MWEP) has become one of the most widely-used inventories for measuring the work ethic construct. However, its length has been a potential barrier to even more widespread use. We developed a short form of the MWEP, the MWEP-SF. A subset of items from the original measure was identified, using item response…
Descriptors: Work Ethic, Profiles, Measures (Individuals), Test Construction
Tay, Louis; Drasgow, Fritz – Educational and Psychological Measurement, 2012
Two Monte Carlo simulation studies investigated the effectiveness of the mean adjusted X[superscript 2]/df statistic proposed by Drasgow and colleagues and, because of problems with the method, a new approach for assessing the goodness of fit of an item response theory model was developed. It has been previously recommended that mean adjusted…
Descriptors: Test Length, Monte Carlo Methods, Goodness of Fit, Item Response Theory

Sanders, Piet F.; Verschoor, Alfred J. – Applied Psychological Measurement, 1998
Presents minimization and maximization models for parallel test construction under constraints. The minimization model constructs weakly and strongly parallel tests of minimum length, while the maximization model constructs weakly and strongly parallel tests with maximum test reliability. (Author/SLD)
Descriptors: Algorithms, Models, Reliability, Test Construction
Burton, Richard F. – Assessment and Evaluation in Higher Education, 2005
Examiners seeking guidance on multiple-choice and true/false tests are likely to encounter various faulty or questionable ideas. Twelve of these are discussed in detail, having to do mainly with the effects on test reliability of test length, guessing and scoring method (i.e. number-right scoring or negative marking). Some misunderstandings could…
Descriptors: Guessing (Tests), Multiple Choice Tests, Objective Tests, Test Reliability
Wang, Wen-Chung; Chen, Hsueh-Chu – Educational and Psychological Measurement, 2004
As item response theory (IRT) becomes popular in educational and psychological testing, there is a need of reporting IRT-based effect size measures. In this study, we show how the standardized mean difference can be generalized into such a measure. A disattenuation procedure based on the IRT test reliability is proposed to correct the attenuation…
Descriptors: Test Reliability, Rating Scales, Sample Size, Error of Measurement
Cohen, Allan S.; Gregg, Noel; Deng, Meng – Learning Disabilities Research & Practice, 2005
The premise of a great deal of current research guiding policy development has been that accommodations are the catalyst for student performance differences. Rather than accepting this premise, two studies were conducted to investigate the influence of extended time and content knowledge on the performance of ninth-grade students who took a…
Descriptors: Program Effectiveness, Mathematics Tests, Learning Disabilities, Testing Accommodations

Mitchell, G.; And Others – Medical Teacher, 1986
Describes a study designed to determine if the amount of time allocated for answering multiple true/false type questions affects the grades of the medical students taking the tests. Students who had 2-1/4 minutes to answer each question scored significantly better than those who had 1-1/2 minutes or 3 minutes. (TW)
Descriptors: Biochemistry, College Science, Higher Education, Medical Education
Embretson, Susan E. – Measurement: Interdisciplinary Research and Perspectives, 2004
The last century was marked by dazzling changes in many areas, such as technology and communications. Predictions into the second century of testing are seemingly difficult in such a context. Yet, looking back to the turn of the last century, Kirkpatrick (1900), in his American Psychological Association presidential address, presented fundamental…
Descriptors: Ability, Testing, Futures (of Society), Psychometrics