ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	5

Descriptor

Comparative Testing	26
Test Items	26
Test Validity	26
Higher Education	10
Test Construction	10
Test Format	10
Test Reliability	10
Item Analysis	8
Multiple Choice Tests	8
Computer Assisted Testing	6
Foreign Countries	5
High Schools	5
Psychometrics	5
Adaptive Testing	4
Difficulty Level	4
High School Students	4
Achievement Tests	3
College Students	3
Item Response Theory	3
Predictive Validity	3
Scores	3
Scoring	3
Test Length	3
Testing Problems	3
Undergraduate Students	3
More ▼

Source

Educational and Psychological…	3
Journal of Educational…	2
Applied Psychological…	1
Educational Assessment	1
Educational Studies in…	1
Evaluation and the Health…	1
Focus	1
Geographical Education	1
Journal of Economic Education	1
Journal of Experimental…	1
Physical Review Special…	1
More ▼

Publication Type

Reports - Research	19
Journal Articles	13
Speeches/Meeting Papers	8
Reports - Evaluative	4
Reports - Descriptive	2
Collected Works - Serials	1
Tests/Questionnaires	1

Education Level

Higher Education	2
Elementary Education	1
Grade 4	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Alabama	1
Canada	1
Canada (Edmonton)	1
Germany	1
Netherlands	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

Alabama High School…	1
Behavior Assessment System…	1
California Achievement Tests	1
Embedded Figures Test	1
Gates MacGinitie Reading Tests	1
Iowa Tests of Basic Skills	1
Teacher Rating Scale	1

What Works Clearinghouse Rating

Showing 1 to 15 of 26 results Save | Export

The Impact of National Examinations on Geography Teachers' Assessment Practices in the Netherlands

Peer reviewed
PDF on ERIC

Download full text

Bijsterbosch, Erik – Geographical Education, 2018

Geography teachers' school-based (internal) examinations in pre-vocational geography education in the Netherlands appear to be in line with the findings in the literature, namely that teachers' assessment practices tend to focus on the recall of knowledge. These practices are strongly influenced by national (external) examinations. This paper…

Descriptors: Foreign Countries, Instructional Effectiveness, National Competency Tests, Geography Instruction

Comparison of Integrated Testlet and Constructed-Response Question Formats

Peer reviewed

Direct link

Slepkov, Aaron D.; Shiell, Ralph C. – Physical Review Special Topics - Physics Education Research, 2014

Constructed-response (CR) questions are a mainstay of introductory physics textbooks and exams. However, because of the time, cost, and scoring reliability constraints associated with this format, CR questions are being increasingly replaced by multiple-choice (MC) questions in formal exams. The integrated testlet (IT) is a recently developed…

Descriptors: Science Tests, Physics, Responses, Multiple Choice Tests

Not Read, but Nevertheless Solved? Three Experiments on PIRLS Multiple Choice Reading Comprehension Test Items

Peer reviewed

Direct link

Sparfeldt, Jorn R.; Kimmel, Rumena; Lowenkamp, Lena; Steingraber, Antje; Rost, Detlef H. – Educational Assessment, 2012

Multiple-choice (MC) reading comprehension test items comprise three components: text passage, questions about the text, and MC answers. The construct validity of this format has been repeatedly criticized. In three between-subjects experiments, fourth graders (N[subscript 1] = 230, N[subscript 2] = 340, N[subscript 3] = 194) worked on three…

Descriptors: Test Items, Reading Comprehension, Construct Validity, Grade 4

Implicit Aspects of Paper and Pencil Mathematics Assessment that Come to Light through the Use of the Computer

Peer reviewed

Direct link

Threlfall, John; Pool, Peter; Homer, Matthew; Swinnerton, Bronwen – Educational Studies in Mathematics, 2007

This article explores the effect on assessment of "translating" paper and pencil test items into their computer equivalents. Computer versions of a set of mathematics questions derived from the paper-based end of key stage 2 and 3 assessments in England were administered to age appropriate pupil samples, and the outcomes compared.…

Descriptors: Test Items, Student Evaluation, Foreign Countries, Test Validity

Setting the Response Time Threshold Parameter to Differentiate Solution Behavior from Rapid-Guessing Behavior

Peer reviewed

Direct link

Kong, Xiaojing J.; Wise, Steven L.; Bhola, Dennison S. – Educational and Psychological Measurement, 2007

This study compared four methods for setting item response time thresholds to differentiate rapid-guessing behavior from solution behavior. Thresholds were either (a) common for all test items, (b) based on item surface features such as the amount of reading required, (c) based on visually inspecting response time frequency distributions, or (d)…

Descriptors: Test Items, Reaction Time, Timed Tests, Item Response Theory

Multiple Choice and True-False: Reliability and Validity Compared.

Peer reviewed

Green, Kathy – Journal of Experimental Education, 1979

Reliabilities and concurrent validities of teacher-made multiple-choice and true-false tests were compared. No significant differences were found even when multiple-choice reliability was adjusted to equate testing time. (Author/MH)

Descriptors: Comparative Testing, Higher Education, Multiple Choice Tests, Test Format

Convergent and Discriminant Validity of the Locus of Control Construct.

Download full text

Borich, Gary D.; Paver, Sydney W. – 1974

Eighty undergraduates were administered four self-report locus of control inventories, in order to evaluate the convergent and discriminant validity of four categories common to these inventories: chance, fate, personal control, and powerful others. The four inventories were: (1) Internal, Powerful Others and Chance scales; (2) James Internal…

Descriptors: Comparative Testing, Higher Education, Individual Differences, Locus of Control

An Examination of the Feasibility of Using Criterion-Referenced Measurement in Large-Scale, Survey Testing Situations.

Download full text

Graham, Darol L. – 1974

The adequacy of a test developed for statewide assessment of basic mathematics skills was investigated. The test, comprised of multiple-choice items reflecting a series of behavioral objectives, was compared with a more extensive criterion measure generated from the same objectives by the application of a strict item sampling model. In many…

Descriptors: Comparative Testing, Criterion Referenced Tests, Educational Assessment, Item Sampling

A Comparison of the Performance of Simulated Hierarchical and Linear Testlets.

Peer reviewed

Wainer, Howard; And Others – Journal of Educational Measurement, 1992

Computer simulations were run to measure the relationship between testlet validity and factors of item pool size and testlet length for both adaptive and linearly constructed testlets. Making a testlet adaptive yields only modest increases in aggregate validity because of the peakedness of the typical proficiency distribution. (Author/SLD)

Descriptors: Adaptive Testing, Comparative Testing, Computer Assisted Testing, Computer Simulation

The Effect of Negation and Polar Opposite Item Reversals on Questionnaire Reliability and Validity: An Experimental Investigation.

Peer reviewed

Schriesheim, Chester A.; And Others – Educational and Psychological Measurement, 1991

Effects of item wording on questionnaire reliability and validity were studied, using 280 undergraduate business students who completed a questionnaire comprising 4 item types: (1) regular; (2) polar opposite; (3) negated polar opposite; and (4) negated regular. Implications of results favoring regular and negated regular items are discussed. (SLD)

Descriptors: Business Education, Comparative Testing, Higher Education, Negative Forms (Language)

Building Algebra Testlets: A Comparison of Hierarchical and Linear Structures.

Peer reviewed

Wainer, Howard; And Others – Journal of Educational Measurement, 1991

Hierarchical (adaptive) and linear methods of testlet construction were compared. The performance of 2,080 ninth and tenth graders on a 4-item testlet was used to predict performance on the entire test. The adaptive test was slightly superior as a predictor, but the cost of obtaining that superiority was considerable. (SLD)

Descriptors: Adaptive Testing, Algebra, Comparative Testing, High School Students

Assessing the Effects of Computer Administration on Scores and Parameter Estimates Using IRT Models.

Download full text

Sykes, Robert C.; And Others – 1991

To investigate the psychometric feasibility of replacing a paper-and-pencil licensing examination with a computer-administered test, a validity study was conducted. The computer-administered test (Cadm) was a common set of items for all test takers, distinct from computerized adaptive testing, in which test takers receive items appropriate to…

Descriptors: Adults, Certification, Comparative Testing, Computer Assisted Testing

Multiple-Choice and Alternate-Choice Questions: Description and Analysis.

Download full text

Dowd, Steven B. – 1992

An alternative to multiple-choice (MC) testing is suggested as it pertains to the field of radiologic technology education. General principles for writing MC questions are given and contrasted with a new type of MC question, the alternate-choice (AC) question, in which the answer choices are embedded in the question in a short form that resembles…

Descriptors: Comparative Testing, Difficulty Level, Evaluation Methods, Higher Education

The Instructional Validity of Computer Administered Tests.

Download full text

Siskind, Theresa G.; And Others – 1992

The instructional validity of computer administered tests was studied with a focus on whether differences in test scores and item behavior are a function of instructional mode (computer versus non-computer). In the first of 3 studies, performance test scores for approximately 400 high school students in 1990-91 for tasks accomplished with the…

Descriptors: Comparative Testing, Comprehension, Computer Assisted Instruction, Computer Assisted Testing

Effects of Two Testing Conditions on Classroom Achievement: Traditional In-Class versus Experimental Take-Home Conditions.

Download full text

Andrada, Gilbert N.; Linden, Kathryn W. – 1993

The psychometric properties of objective tests administered in two testing conditions were compared, using an experimental take-home testing condition and a traditional in-class testing condition. Subjects were 290 college students in a basic educational psychology course who took a test developed and tested the previous semester. Two equivalent…

Descriptors: Class Activities, Classroom Techniques, Cognitive Processes, College Students

Previous Page | Next Page »

Pages: 1 | 2

Wainer, Howard	2
Albanese, Mark A.	1
Andrada, Gilbert N.	1
Benderson, Albert, Ed.	1
Bhola, Dennison S.	1
Bijsterbosch, Erik	1
Borich, Gary D.	1
Clarke, S. C. T.	1
Coffman, William E.	1
Dowd, Steven B.	1
Downey, Ronald G.	1
Graham, Darol L.	1
Green, Kathy	1
Homer, Matthew	1
Kamphaus, Randy W.	1
Kent, Thomas H.	1
Kimmel, Rumena	1
Kong, Xiaojing J.	1
Lett, Nancy J.	1
Linden, Kathryn W.	1
Lowenkamp, Lena	1
Melancon, Janet G.	1
Paver, Sydney W.	1
Pine, Steven M.	1
More ▼