Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 5 |
Descriptor
Test Items | 9 |
Test Results | 9 |
Test Wiseness | 9 |
Test Construction | 4 |
Scores | 3 |
Test Reliability | 3 |
Comparative Analysis | 2 |
Computer Assisted Testing | 2 |
English (Second Language) | 2 |
Evaluation | 2 |
Foreign Countries | 2 |
More ▼ |
Source
Educational Measurement:… | 2 |
Educational Evaluation and… | 1 |
Educational Technology &… | 1 |
International Journal of… | 1 |
Journal of Medical Education | 1 |
US Citizenship and… | 1 |
Author
Albanese, Mark A. | 1 |
Beddow, Peter A. | 1 |
Chen, Li-Ju | 1 |
Dorans, Neil J. | 1 |
Ho, Rong-Guey | 1 |
Jensen, Nate | 1 |
Liang, Longjuan | 1 |
Mehrens, William A. | 1 |
Rice, Andrew | 1 |
Sinharay, Sandip | 1 |
Soland, James | 1 |
More ▼ |
Publication Type
Journal Articles | 6 |
Reports - Research | 4 |
Reports - Evaluative | 3 |
Reports - Descriptive | 2 |
Numerical/Quantitative Data | 1 |
Opinion Papers | 1 |
Speeches/Meeting Papers | 1 |
Tests/Questionnaires | 1 |
Education Level
Adult Education | 1 |
Elementary Secondary Education | 1 |
Grade 3 | 1 |
Grade 9 | 1 |
Audience
Practitioners | 1 |
Teachers | 1 |
Location
Canada | 1 |
Taiwan | 1 |
United States | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Jensen, Nate; Rice, Andrew; Soland, James – Educational Evaluation and Policy Analysis, 2018
While most educators assume that not all students try their best on achievement tests, no current research examines if behaviors associated with low test effort, like rapidly guessing on test items, affect teacher value-added estimates. In this article, we examined the prevalence of rapid guessing to determine if this behavior varied by grade,…
Descriptors: Item Response Theory, Value Added Models, Achievement Tests, Test Items
Sinharay, Sandip; Dorans, Neil J.; Liang, Longjuan – Educational Measurement: Issues and Practice, 2011
Over the past few decades, those who take tests in the United States have exhibited increasing diversity with respect to native language. Standard psychometric procedures for ensuring item and test fairness that have existed for some time were developed when test-taking groups were predominantly native English speakers. A better understanding of…
Descriptors: Test Bias, Testing Programs, Psychometrics, Language Proficiency
Beddow, Peter A. – International Journal of Disability, Development and Education, 2012
In the arena of educational testing, accessibility refers to the degree to which students are given the opportunity to participate in and engage a test. Accessibility theory is a model for examining the interactions between the test-taker and the test itself and defining how they may decrease some students' access to the test event, ultimately…
Descriptors: Test Results, Test Items, Educational Testing, Scores
Chen, Li-Ju; Ho, Rong-Guey; Yen, Yung-Chin – Educational Technology & Society, 2010
This study aimed to explore the effects of marking and metacognition-evaluated feedback (MEF) in computer-based testing (CBT) on student performance and review behavior. Marking is a strategy, in which students place a question mark next to a test item to indicate an uncertain answer. The MEF provided students with feedback on test results…
Descriptors: Feedback (Response), Test Results, Test Items, Testing
US Citizenship and Immigration Services, 2008
"Naturalization Test Redesign Project: Civics Item Selection Analysis" provides an overview of the development of content items for the U.S. history and government (civics) portion of the redesigned naturalization test. This document also reviews the process used to gather and analyze data from multiple studies to determine which civics…
Descriptors: History, Test Items, Citizenship, Individual Testing

Mehrens, William A. – Educational Measurement: Issues and Practice, 1991
Cohen and Hyman's response contains several misunderstandings of the original article by Mehrens and Kaminski. One frequently wishes to make inferences to a domain from a test, but teaching a specific performance and testing for that performance does not allow for a domain inference. (SLD)
Descriptors: Cheating, Criterion Referenced Tests, Educational Assessment, Inferences

Albanese, Mark A. – Journal of Medical Education, 1979
Results of a study involving pathology students suggest that there is significant cluing in multiple-true-false test questions that use secondary responses to represent combinations of the primary response (e.g., "Mark B if only 1 and 3 are correct"). Thus test scores are artificially inflated and test reliability is lowered. (JMD)
Descriptors: Allied Health Occupations Education, Cues, Higher Education, Medical Education
Wise, Steven L. – 1996
In recent years, a controversy has arisen about the advisability of allowing examinees to review their test items and possibly change answers. Arguments for and against allowing item review are discussed, and issues that a test designer should consider when designing a Computerized Adaptive Test (CAT) are identified. Most CATs do not allow…
Descriptors: Achievement Gains, Adaptive Testing, Computer Assisted Testing, Error Correction
Talbot, Gilles L. – 1994
This paper offers college teachers guidelines for improving their teacher made tests. It notes that teachers may focus on how well students have learned course objectives while being unaware of how the testing process itself contributes to the results obtained. The paper reports the results of a test-taking workshop designed to improve college…
Descriptors: College Students, Foreign Countries, Higher Education, Quality Control