ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	2

Descriptor

Test Length	8
Item Response Theory	5
Test Construction	3
Test Items	3
Test Reliability	3
Evaluation Methods	2
Higher Education	2
Multiple Choice Tests	2
Objective Tests	2
Test Theory	2
Test Validity	2
Testing	2
Ability	1
Achievement Tests	1
Algorithms	1
Artificial Intelligence	1
Automation	1
Biochemistry	1
Cognitive Psychology	1
College Science	1
Effect Size	1
Error Patterns	1
Error of Measurement	1
Foreign Countries	1
Futures (of Society)	1
More ▼

Source

Educational and Psychological…	2
Applied Psychological…	1
Assessment and Evaluation in…	1
Journal of Vocational Behavior	1
Learning Disabilities…	1
Measurement:…	1
Medical Teacher	1

Author

Burton, Richard F.	1
Chen, Hsueh-Chu	1
Cohen, Allan S.	1
Deng, Meng	1
Drasgow, Fritz	1
Embretson, Susan E.	1
Gorman, C. Allen	1
Gregg, Noel	1
Meriac, John P.	1
Mitchell, G.	1
Sanders, Piet F.	1
Tay, Louis	1
Thomas, Amanda L. E.	1
Verschoor, Alfred J.	1
Wang, Wen-Chung	1
Woehr, David J.	1
More ▼

Publication Type

Journal Articles	8
Reports - Descriptive	8

Education Level

Higher Education

Audience

Practitioners

Location

Taiwan	1
United Kingdom (Great Britain)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Development and Validation of a Short Form for the Multidimensional Work Ethic Profile

Peer reviewed

Direct link

Meriac, John P.; Woehr, David J.; Gorman, C. Allen; Thomas, Amanda L. E. – Journal of Vocational Behavior, 2013

The multidimensional work ethic profile (MWEP) has become one of the most widely-used inventories for measuring the work ethic construct. However, its length has been a potential barrier to even more widespread use. We developed a short form of the MWEP, the MWEP-SF. A subset of items from the original measure was identified, using item response…

Descriptors: Work Ethic, Profiles, Measures (Individuals), Test Construction

Adjusting the Adjusted X[superscript 2]/df Ratio Statistic for Dichotomous Item Response Theory Analyses: Does the Model Fit?

Peer reviewed

Direct link

Tay, Louis; Drasgow, Fritz – Educational and Psychological Measurement, 2012

Two Monte Carlo simulation studies investigated the effectiveness of the mean adjusted X[superscript 2]/df statistic proposed by Drasgow and colleagues and, because of problems with the method, a new approach for assessing the goodness of fit of an item response theory model was developed. It has been previously recommended that mean adjusted…

Descriptors: Test Length, Monte Carlo Methods, Goodness of Fit, Item Response Theory

Parallel Test Construction Using Classical Item Parameters.

Peer reviewed

Sanders, Piet F.; Verschoor, Alfred J. – Applied Psychological Measurement, 1998

Presents minimization and maximization models for parallel test construction under constraints. The minimization model constructs weakly and strongly parallel tests of minimum length, while the maximization model constructs weakly and strongly parallel tests with maximum test reliability. (Author/SLD)

Descriptors: Algorithms, Models, Reliability, Test Construction

Multiple-Choice and True/False Tests: Myths and Misapprehensions

Peer reviewed

Direct link

Burton, Richard F. – Assessment and Evaluation in Higher Education, 2005

Examiners seeking guidance on multiple-choice and true/false tests are likely to encounter various faulty or questionable ideas. Twelve of these are discussed in detail, having to do mainly with the effects on test reliability of test length, guessing and scoring method (i.e. number-right scoring or negative marking). Some misunderstandings could…

Descriptors: Guessing (Tests), Multiple Choice Tests, Objective Tests, Test Reliability

The Standardized Mean Difference within the Framework of Item Response Theory

Peer reviewed

Direct link

Wang, Wen-Chung; Chen, Hsueh-Chu – Educational and Psychological Measurement, 2004

As item response theory (IRT) becomes popular in educational and psychological testing, there is a need of reporting IRT-based effect size measures. In this study, we show how the standardized mean difference can be generalized into such a measure. A disattenuation procedure based on the IRT test reliability is proposed to correct the attenuation…

Descriptors: Test Reliability, Rating Scales, Sample Size, Error of Measurement

The Role of Extended Time and Item Content on a High-Stakes Mathematics Test

Peer reviewed

Direct link

Cohen, Allan S.; Gregg, Noel; Deng, Meng – Learning Disabilities Research & Practice, 2005

The premise of a great deal of current research guiding policy development has been that accommodations are the catalyst for student performance differences. Rather than accepting this premise, two studies were conducted to investigate the influence of extended time and content knowledge on the performance of ninth-grade students who took a…

Descriptors: Program Effectiveness, Mathematics Tests, Learning Disabilities, Testing Accommodations

Optimising Marks Obtained in Multiple Choice Question Examinations.

Peer reviewed

Mitchell, G.; And Others – Medical Teacher, 1986

Describes a study designed to determine if the amount of time allocated for answering multiple true/false type questions affects the grades of the medical students taking the tests. Students who had 2-1/4 minutes to answer each question scored significantly better than those who had 1-1/2 minutes or 3 minutes. (TW)

Descriptors: Biochemistry, College Science, Higher Education, Medical Education

The Second Century of Ability Testing: Some Predictions and Speculations

Peer reviewed

Direct link

Embretson, Susan E. – Measurement: Interdisciplinary Research and Perspectives, 2004

The last century was marked by dazzling changes in many areas, such as technology and communications. Predictions into the second century of testing are seemingly difficult in such a context. Yet, looking back to the turn of the last century, Kirkpatrick (1900), in his American Psychological Association presidential address, presented fundamental…

Descriptors: Ability, Testing, Futures (of Society), Psychometrics