Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 13 |
Since 2016 (last 10 years) | 24 |
Since 2006 (last 20 years) | 42 |
Descriptor
Source
Author
Levine, Michael V. | 3 |
McMorris, Robert F. | 3 |
Mercer, Maryann | 3 |
Harnisch, Delwyn L. | 2 |
Horst, Paul | 2 |
Liu, Ou Lydia | 2 |
Rubin, Donald B. | 2 |
Weiss, David J. | 2 |
Abela, John R. Z. | 1 |
Abranovic, Wynn A. | 1 |
Alderman, Donald L. | 1 |
More ▼ |
Publication Type
Reports - Research | 70 |
Journal Articles | 56 |
Speeches/Meeting Papers | 12 |
Reports - Evaluative | 9 |
Reports - Descriptive | 4 |
Dissertations/Theses -… | 1 |
Numerical/Quantitative Data | 1 |
Opinion Papers | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 9 |
Postsecondary Education | 8 |
Elementary Education | 7 |
Grade 6 | 4 |
Intermediate Grades | 4 |
Middle Schools | 4 |
Grade 4 | 3 |
Secondary Education | 3 |
Elementary Secondary Education | 2 |
Grade 3 | 2 |
Grade 5 | 2 |
More ▼ |
Audience
Researchers | 6 |
Practitioners | 1 |
Teachers | 1 |
Location
Canada | 2 |
Germany | 2 |
New Zealand | 2 |
Australia | 1 |
Brunei | 1 |
Czech Republic | 1 |
Iowa | 1 |
Ireland | 1 |
Morocco | 1 |
New Hampshire | 1 |
Ohio | 1 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Vale, C. David; Weiss, David J. – 1975
A conventional test and two forms of a stradaptive test were administered to thousands of simulated subjects by minicomputer. Characteristics of the three tests using several scoring techniques were investigated while varying the discriminating power of the items, the lengths of the tests, and the availability of prior information about the…
Descriptors: Ability, Branching, Comparative Analysis, Computer Oriented Programs
Abranovic, Wynn A.
Two basic issues are discussed that involve the Carver model of the relationship between test score and knowledge. The first of these concerns different types of tests which have different probabilities of obtaining the correct answer due to guessing. A derivation is shown which places different test types "on-a-par"--meaning that two tests are…
Descriptors: Guessing (Tests), Knowledge Level, Mathematical Models, Multiple Choice Tests

McMorris, Robert F.; And Others – Journal of Educational Measurement, 1987
Consistency of gain from changing test answers was tested for students instructed about answer-changing research results, and composition of the gain was analyzed by examining the students' reasons for changing. Mean gain remained positive and consistent with gain for previously studied uninstructed groups; amount of change was also stable.…
Descriptors: Difficulty Level, Graduate Students, Higher Education, Instruction

Fabrey, Lawrence J.; Case, Susan M. – Journal of Medical Education, 1985
The effect on test scores of changing answers to multiple-choice questions was studied and compared to earlier research. The current setting was a nationally administered, in-training, specialty examination for medical residents in obstetrics and gynecology. Both low and high scorers improved their scores when they changed answers. (SW)
Descriptors: Educational Testing, Graduate Medical Students, Guessing (Tests), Gynecology

Reiling, Eldon; Taylor, Ryland – Journal of Educational Measurement, 1972
The hypothesis that it is unwise to change answers to multiple choice questions was tested using multiple regression analysis. The hypothesis was rejected as results showed that there are gains to be made by changing responses. (Author/CK)
Descriptors: Guessing (Tests), Hypothesis Testing, Measurement Techniques, Multiple Choice Tests
Edens, John F. – Assessment, 2004
Recent evidence suggests that the Psychopathic Personality Inventory (PPI) is composed of two orthogonal factors, one representing traits such as dominance and low anxiety (PPI-I) and a second that reflects more socially deviant aspects of psychopathy (PPI-II). To assess whether response sets differentially affect these factors, the present…
Descriptors: Psychopathology, Response Style (Tests), Personality Measures, Psychometrics
Leite, Walter L.; Beretvas, S. Natasha – Educational and Psychological Measurement, 2005
The Marlowe-Crowne Social Desirability Scale (MCSDS), the most commonly used social desirability bias (SDB) assessment, conceptualizes SDB as an individual's need for approval. The Balanced Inventory of Desirable Responding (BIDR) measures SDB as two separate constructs: impression management and self-deception. Scores on SDB scales are commonly…
Descriptors: Measures (Individuals), Factor Analysis, Social Desirability, Scores
Rogers, Bruce G. – 1978
The manner in which instructions to respondents influenced the psychometric results of an affective instrument was examined. Undergraduate students enrolled in an educational measurement course were randomly assigned to either immediate response or careful response treatments. A fifteen-item attitude scale was administered to both groups. The…
Descriptors: Affective Measures, Higher Education, Overt Response, Performance Factors
Mercer, Maryann – 1977
In a 1977 review of the literature on test answer changing, Mueller and Wasser (EJ 163 236) cited 17 studies and concluded that students changing answers on objective tests gain more points than they lost by so doing. Higher scoring students tend to gain more than do the lower scoring students. Six additional studies not reported in the Mueller…
Descriptors: Guessing (Tests), Higher Education, Junior High Schools, Literature Reviews

Mueller, Daniel J.; Schwedel, Allan – Journal of Educational Measurement, 1975
Determines the relationship of sex, answer-changing incidence, and total score to net changes in total score resulting from changing answers, by examining the answer-changing behavior of graduate students responding to achievement test items. (Author/RC)
Descriptors: Achievement Gains, Achievement Tests, Graduate Students, Higher Education
Etherton, Joseph L.; Bianchini, Kevin J.; Ciota, Megan A.; Greve, Kevin W. – Assessment, 2005
Reliable Digit Span (RDS) is an indicator used to assess the validity of cognitive test performance. Scores of 7 or lower suggest poor effort or negative response bias. The possibility that RDS scores are also affected by pain has not been addressed thus potentially threatening RDS specificity. The current study used cold pressor-induced pain to…
Descriptors: Response Style (Tests), Simulation, Intelligence Tests, Pain
Frierson, Henry T., Jr. – 1984
Effects of test-taking instruction for University of North Carolina medical students who failed end-of-year examinations were studied. The exam is compensatory and includes a number of subtests, including National Board (NB) Part I standardized subtests in pathology, pharmacology, and physiology. Students who had to repeat NB Part I subtests were…
Descriptors: Academic Failure, Educational Testing, Higher Education, Intervention
Cross, Lawrence H.; Frary, Robert B. – 1976
It has been demonstrated that corrected-for-guessing scores will be superior to number-right scores in providing estimates of examinee standing on the trait measured by a multiple-choice test, if it can be assumed that examinees can and will comply with the appropriate directions. The purpose of the present study was to test the validity of that…
Descriptors: Achievement Tests, Guessing (Tests), Individual Characteristics, Multiple Choice Tests
Boldt, Robert F. – 1971
One formulation of confidence scoring requires the examinee to indicate as a number his personal probability of the correctness of each alternative in a multiple-choice test. For this formulation, a linear transformation of the logarithm of the correct response is maximized if the examinee reports accurately his personal probability. To equate…
Descriptors: Confidence Testing, Guessing (Tests), Multiple Choice Tests, Probability

Blixt, Sonya L.; Dinero, Thomas E. – Educational and Psychological Measurement, 1985
Sato's Caution Index is a method of analyzing item response patterns independently of total test score to provide diagnostic information about students. This research supports the validity of the diagnostic interpretations of the Index when compared to graduate students' self evaluation. (Author/BS)
Descriptors: Academic Achievement, Educational Diagnosis, Graduate Students, Higher Education