Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 1 |
Descriptor
Source
Applied Measurement in… | 4 |
Author
Bell, Robert M. | 1 |
Comfort, Kathy | 1 |
Gordon, Belita | 1 |
Johnson, Robert L. | 1 |
Klein, Stephen P. | 1 |
Leighton, Jacqueline P. | 1 |
Loyd, Brenda H. | 1 |
McCaffrey, Daniel | 1 |
Ormseth, Tor | 1 |
Othman, Abdul R. | 1 |
Penny, James | 1 |
More ▼ |
Publication Type
Journal Articles | 4 |
Reports - Research | 4 |
Education Level
Grade 12 | 1 |
High Schools | 1 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Leighton, Jacqueline P. – Applied Measurement in Education, 2013
The Standards for Educational and Psychological Testing indicate that multiple sources of validity evidence should be used to support the interpretation of test scores. In the past decade, examinee response processes, as a source of validity evidence, have received increased attention. However, there have been relatively few methodological studies…
Descriptors: Psychological Testing, Standards, Interviews, Protocol Analysis

Klein, Stephen P.; Stecher, Brian M.; Shavelson, Richard J.; McCaffrey, Daniel; Ormseth, Tor; Bell, Robert M.; Comfort, Kathy; Othman, Abdul R. – Applied Measurement in Education, 1998
Two studies involving 368 elementary and high school students and 29 readers were conducted to investigate reader consistency, score reliability, and reader time requirements of three hands-on science performance tasks. Holistic scores were as reliable as analytic scores, and there was a high correlation between them after they were disattenuated…
Descriptors: Elementary School Students, Elementary Secondary Education, Hands on Science, High School Students

Johnson, Robert L.; Penny, James; Gordon, Belita – Applied Measurement in Education, 2000
Studied four forms of score resolution used by testing agencies and investigated the effect that each has on the interrater reliability associated with the resulting operational scores. Results, based on 120 essays from the Georgia High School Writing Test, show some forms of resolution to be associated with higher reliability and some associated…
Descriptors: Essay Tests, High School Students, High Schools, Interrater Reliability

Loyd, Brenda H. – Applied Measurement in Education, 1990
Four mathematics test-item types that may perform differently when calculators are used were assessed using data from 160 high school students attending a summer enrichment program. The effects of testing with and without calculators on testing time, test reliability, item difficulty, and item discrimination were also assessed. (TJH)
Descriptors: Calculators, Difficulty Level, High School Students, High Schools