Publication Date
In 2025 | 1 |
Since 2024 | 6 |
Since 2021 (last 5 years) | 9 |
Since 2016 (last 10 years) | 17 |
Since 2006 (last 20 years) | 21 |
Descriptor
Response Style (Tests) | 165 |
Test Construction | 165 |
Test Items | 51 |
Test Validity | 41 |
Test Reliability | 40 |
Higher Education | 39 |
Item Analysis | 36 |
Multiple Choice Tests | 35 |
Testing | 32 |
Testing Problems | 30 |
Test Format | 29 |
More ▼ |
Source
Author
Weiss, David J. | 5 |
Benson, Jeri | 3 |
Paulson, James A. | 3 |
Reckase, Mark D. | 3 |
Williams, David V. | 3 |
Betz, Nancy E. | 2 |
Diamond, James J. | 2 |
Ebel, Robert L. | 2 |
Garvin, Alfred D. | 2 |
Gray, William M. | 2 |
Hanna, Gerald S. | 2 |
More ▼ |
Publication Type
Education Level
Higher Education | 5 |
Elementary Education | 3 |
Postsecondary Education | 3 |
Secondary Education | 3 |
High Schools | 2 |
Intermediate Grades | 2 |
Middle Schools | 2 |
Grade 1 | 1 |
Grade 4 | 1 |
Grade 5 | 1 |
Grade 6 | 1 |
More ▼ |
Audience
Researchers | 9 |
Practitioners | 5 |
Teachers | 2 |
Location
Germany | 2 |
Australia | 1 |
California | 1 |
Europe | 1 |
Ghana | 1 |
Israel | 1 |
Kenya | 1 |
Mexico | 1 |
Netherlands | 1 |
New Zealand | 1 |
Nigeria | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Powell, Janet L.; Gillespie, Cindy – 1990
Traditional tests fall into two categories, both of which have several advantages and disadvantages that need to be considered when determining the type of test to use. Constructed-response tests, such as essay tests, ask students to construct their own responses. Thus, students are required not only to recall but to organize and often apply…
Descriptors: Elementary Secondary Education, Essay Tests, Higher Education, Objective Tests
Carlson, Robert E. – 1986
Field tests of test items are useful in developing cutting scores for important tests, such as teacher certification examinations and high school graduation tests, because they indicate test item quality and estimate future examinee performance. However, field test data may provide a faulty indication of examinee performance on the real test.…
Descriptors: Cutting Scores, Graduation Requirements, High Schools, Item Analysis
Weber, Margaret B. – 1977
The effects of different choice formats on the reliability of teacher-made tests were examined for high and low achievers. The first study examined the effect of 3 and 5 choice items on the reliability of dichotomously scored teacher-made tests. The second study examined the effect of 3 and 4 choice items on the reliability of similarly designed…
Descriptors: Academic Achievement, Achievement Tests, Guessing (Tests), High Achievement
Schurr, K. Terry; Henriksen, L. W. – 1980
Five questionnaire forms containing 61 items specifying potential inservice topics for public school teachers were sent to a stratified random sample of Indiana public school administrators and curriculum supervisors. The five forms differed in that, for two forms, the items were ungrouped and appeared in different orders; and, for three forms,…
Descriptors: Administrator Attitudes, Correlation, Elementary Secondary Education, Factor Analysis
Research Triangle Inst., Durham, NC. Center for Educational Research and Evaluation. – 1976
After reviewing the research design, objectives, and basic conceptual model for the National Longitudinal Study of the High School Class of 1972 (NLS), this document describes the survey plan and issues for the third followup survey. Data analyses--both those that are in progress or planned--are summarized. These activities include composite score…
Descriptors: Data Analysis, Followup Studies, Graduate Surveys, High School Graduates
Instructional Sensitivity Statistics Appropriate for Objectives-Based Test Items. CSE Report No. 91.
Kosecoff, Jacqueline B.; Klein, Stephen P. – 1974
Two types of sensitivity indices were developed in this paper, one internal to the total test and the second external. To evaluate the success of these statistics the three criteria suggested for a satisfactory index of item quality were considered. The Internal Sensitivity Index appears to meet these demands. Certainly it is easily computed. In…
Descriptors: Academic Achievement, Correlation, Criterion Referenced Tests, Evaluation Methods
Koos, Eugenia M.; Chan, James Y. – 1972
The development of a series of parallel single-topic tests for testing attainment of 14 objectives concerned with inquiry skill in biology is discussed. The series of eight two-part tests are called "Explorations in Biology" (EIB). (CK)
Descriptors: Biology, Cognitive Objectives, Correlation, Criterion Referenced Tests
Jakwerth, Pamela R.; Stancavage, Frances B.; Reed, Ellen D. – National Center for Education Statistics, 2003
Over the past decade, developers of the National Assessment of Educational Progress (NAEP) have changed substantially the mix of item types on the NAEP assessments by decreasing the numbers of multiple-choice questions and increasing the numbers of questions requiring short- or extended-constructed responses. These changes have been motivated…
Descriptors: National Competency Tests, Response Style (Tests), Test Validity, Qualitative Research

Friel, S.; Johnstone, A. H. – Education in Chemistry, 1979
Presents the results of an investigation to determine if the position of a distractor in a multiple choice question influences the degree of difficulty of an item. The data support the hypothesis that the placement of the distractor immediately before the key alters the difficulty of the item significantly. (Authors/SA)
Descriptors: Educational Research, Item Analysis, Multiple Choice Tests, Research
Allen, Thomas E. – 1984
In 1983, four screening tests for assigning students to the appropriate levels of the Stanford Achievement Test, Seventh Edition, were developed with a national sample of hearing impaired students. While students are normally assigned to one of six test level booklets according to grade, this is inappropriate for certain students. This paper…
Descriptors: Achievement Tests, Difficulty Level, Elementary Education, Hearing Impairments
Betz, Nancy E.; Weiss, David J. – 1974
Monte Carlo simulation procedures were used to study the psychometric characteristics of two two-stage adaptive tests and a conventional "peaked" ability test. Results showed that scores yielded by both two-stage tests better reflected the normal distribution of underlying ability. Ability estimates yielded by one of the two stage tests…
Descriptors: Ability, Academic Ability, Adaptive Testing, Computers
Bond, Jack H. – 1974
The model developed by the Computer Based Project for the Evaluation of Media for the Handicapped in Syracuse, New York to evaluate the use of captioned films for the deaf with mentally handicapped and emotionally disturbed children is briefly described, followed by a review of recent research conducted by the project staff. Among the areas which…
Descriptors: Attention Span, Audiovisual Instruction, Captions, Color
Froman, Robin D. – 1976
To determine whether placement of items on a teacher rating scale affects the factor structure underlying the scale and to determine whether changing the item format alters the ratings given, a 12-item, high-inference student rating scale was developed containing two global items pertaining to overall teacher effectiveness and 10 evaluative items…
Descriptors: Analysis of Variance, Factor Analysis, Factor Structure, Higher Education

Budescu, David V.; Nevo, Baruch – Journal of Educational Measurement, 1985
The proportionality model assumes that total testing time is proportional to the number of test items and the number of options per multiple choice test item. This assumption was examined, using test items having from two to five options. The model was not supported. (Author/GDC)
Descriptors: College Entrance Examinations, Foreign Countries, Higher Education, Item Analysis
Goldberg, Gail Lynn; Kapinus, Barbara – 1992
The Maryland School Performance Assessment Program (MSPAP) is a relatively new, statewide performance assessment of students in grades 3, 5, and 8. When first administered in May of 1991, the MSPAP included a battery of performance assessment tasks designed to generate written or drawn responses to reading texts. This study evaluated selected…
Descriptors: Comparative Testing, Elementary Education, Elementary School Teachers, Evaluators