ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	12

Descriptor

Item Analysis	21
Test Items	21
Test Wiseness	21
Test Construction	11
Multiple Choice Tests	9
Difficulty Level	6
Test Validity	6
Higher Education	5
Test Reliability	5
Item Response Theory	4
Scores	4
Computer Assisted Testing	3
Guessing (Tests)	3
Student Evaluation	3
Testing Problems	3
Bayesian Statistics	2
Cues	2
English (Second Language)	2
High Stakes Tests	2
Medical Education	2
Models	2
Psychometrics	2
Response Style (Tests)	2
Second Language Learning	2
Test Coaching	2
More ▼

Source

Educational Measurement:…	3
Journal of Educational…	3
ProQuest LLC	2
Applied Measurement in…	1
Educational and Psychological…	1
Grantee Submission	1
IEEE Transactions on Learning…	1
International Journal of…	1
Journal of Chemical Education	1
Journal of Experimental…	1
Language Testing	1
Nursing Education Perspectives	1
Nursing Outlook	1
Online Submission	1
US Citizenship and…	1
More ▼

Publication Type

Journal Articles	15
Reports - Research	13
Dissertations/Theses -…	2
Reports - Descriptive	2
Reports - Evaluative	2
Guides - Classroom - Teacher	1
Numerical/Quantitative Data	1
Speeches/Meeting Papers	1
Tests/Questionnaires	1

Education Level

Higher Education	2
Postsecondary Education	2
Secondary Education	2
Adult Education	1
Elementary Education	1
Elementary Secondary Education	1
Grade 4	1
High Schools	1
Intermediate Grades	1

Audience

Practitioners	1
Teachers	1

Location

Bangladesh

Laws, Policies, & Programs

Assessments and Surveys

ACT Assessment	1
National Assessment of…	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 21 results Save | Export

Exploration of Latent Structure in Test Revision and Review Log Data

Peer reviewed

Direct link

Zhang, Susu; Li, Anqi; Wang, Shiyu – Educational Measurement: Issues and Practice, 2023

In computer-based tests allowing revision and reviews, examinees' sequence of visits and answer changes to questions can be recorded. The variable-length revision log data introduce new complexities to the collected data but, at the same time, provide additional information on examinees' test-taking behavior, which can inform test development and…

Descriptors: Computer Assisted Testing, Test Construction, Test Wiseness, Test Items

Learning to Reuse Distractors to Support Multiple-Choice Question Generation in Education

Peer reviewed

Direct link

Semere Kiros Bitew; Amir Hadifar; Lucas Sterckx; Johannes Deleu; Chris Develder; Thomas Demeester – IEEE Transactions on Learning Technologies, 2024

Multiple-choice questions (MCQs) are widely used in digital learning systems, as they allow for automating the assessment process. However, owing to the increased digital literacy of students and the advent of social media platforms, MCQ tests are widely shared online, and teachers are continuously challenged to create new questions, which is an…

Descriptors: Multiple Choice Tests, Computer Assisted Testing, Test Construction, Test Items

Using Nominal Models to Examine How High School Students Use an I Do Not Know Response Option When Answering Scale Items

Direct link

Laura Laclede – ProQuest LLC, 2023

Because non-cognitive constructs can influence student success in education beyond academic achievement, it is essential that they are reliably conceptualized and measured. Within this context, there are several gaps in the literature related to correctly interpreting the meaning of scale scores when a non-standard response option like I do not…

Descriptors: High School Students, Test Wiseness, Models, Test Items

A Novel Examination of None-of-the-Above as It Influences Examinee Item Responses

Direct link

Thompson, Kathryn N. – ProQuest LLC, 2023

It is imperative to collect validity evidence prior to interpreting and using test scores. During the process of collecting validity evidence, test developers should consider whether test scores are contaminated by sources of extraneous information. This is referred to as construct irrelevant variance, or the "degree to which test scores are…

Descriptors: Test Wiseness, Test Items, Item Response Theory, Scores

Classical Item Analysis from a Signal Detection Perspective

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2023

A conceptualization of multiple-choice exams in terms of signal detection theory (SDT) leads to simple measures of item difficulty and item discrimination that are closely related to, but also distinct from, those used in classical item analysis (CIA). The theory defines a "true split," depending on whether or not examinees know an item,…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Test Wiseness

A Mixture Response Time Process Model for Aberrant Behaviors and Item Nonresponses

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jing Lu; Chun Wang; Ningzhong Shi – Grantee Submission, 2023

In high-stakes, large-scale, standardized tests with certain time limits, examinees are likely to engage in either one of the three types of behavior (e.g., van der Linden & Guo, 2008; Wang & Xu, 2015): solution behavior, rapid guessing behavior, and cheating behavior. Oftentimes examinees do not always solve all items due to various…

Descriptors: High Stakes Tests, Standardized Tests, Guessing (Tests), Cheating

Exploring General Science Questions of Secondary School Certificate (SSC) Examination: Assessment Effects on Students Learning

Download full text

Sultana, Protiva; Rahman, Md. Mehadi – Online Submission, 2018

Creative question is generally considered as a tool to measure students' various levels of learning. The study focused on exploring the present situation of General Science test items/creative questions in Bangladesh. This descriptive study was conducted using a concurrent triangulation research design. To conduct this study both quantitative and…

Descriptors: Secondary School Students, Secondary School Science, Foreign Countries, Science Tests

Effect of Content Knowledge on Angoff-Style Standard Setting Judgments

Peer reviewed

Direct link

Margolis, Melissa J.; Mee, Janet; Clauser, Brian E.; Winward, Marcia; Clauser, Jerome C. – Educational Measurement: Issues and Practice, 2016

Evidence to support the credibility of standard setting procedures is a critical part of the validity argument for decisions made based on tests that are used for classification. One area in which there has been limited empirical study is the impact of standard setting judge selection on the resulting cut score. One important issue related to…

Descriptors: Academic Standards, Standard Setting (Scoring), Cutting Scores, Credibility

Guide to Developing High-Quality, Reliable, and Valid Multiple-Choice Assessments

Peer reviewed

Direct link

Towns, Marcy H. – Journal of Chemical Education, 2014

Chemistry faculty members are highly skilled in obtaining, analyzing, and interpreting physical measurements, but often they are less skilled in measuring student learning. This work provides guidance for chemistry faculty from the research literature on multiple-choice item development in chemistry. Areas covered include content, stem, and…

Descriptors: Multiple Choice Tests, Test Construction, Psychometrics, Test Items

Accessibility Theory for Enhancing the Validity of Test Results for Students with Special Needs

Peer reviewed

Direct link

Beddow, Peter A. – International Journal of Disability, Development and Education, 2012

In the arena of educational testing, accessibility refers to the degree to which students are given the opportunity to participate in and engage a test. Accessibility theory is a model for examining the interactions between the test-taker and the test itself and defining how they may decrease some students' access to the test event, ultimately…

Descriptors: Test Results, Test Items, Educational Testing, Scores

Naturalization Test Redesign Project: Civics Item Selection Analysis

Download full text

US Citizenship and Immigration Services, 2008

"Naturalization Test Redesign Project: Civics Item Selection Analysis" provides an overview of the development of content items for the U.S. history and government (civics) portion of the redesigned naturalization test. This document also reviews the process used to gather and analyze data from multiple studies to determine which civics…

Descriptors: History, Test Items, Citizenship, Individual Testing

A Closer Look at Using Judgments of Item Difficulty to Change Answers on Computerized Adaptive Tests

Peer reviewed

Direct link

Vispoel, Walter P.; Clough, Sara J.; Bleiler, Timothy – Journal of Educational Measurement, 2005

Recent studies have shown that restricting review and answer change opportunities on computerized adaptive tests (CATs) to items within successive blocks reduces time spent in review, satisfies most examinees' desires for review, and controls against distortion in proficiency estimates resulting from intentional incorrect answering of items prior…

Descriptors: Mathematics, Item Analysis, Adaptive Testing, Computer Assisted Testing

Can Relevant Grammatical Cues Result in Invalid Test Items?

Peer reviewed

Plake, Barbara S.; Huntley, Renee M. – Educational and Psychological Measurement, 1984

Two studies examined the effect of making the correct answer of a multiple choice test item grammatically consistent with the item. American College Testing Assessment experimental items were constructed to investigate grammatical compliance to investigate grammatical compliance for plural-singular and vowel-consonant agreement. Results suggest…

Descriptors: Grammar, Higher Education, Item Analysis, Multiple Choice Tests

Test Performance Under The Condition of Known Item Difficulty

Peer reviewed

Huck, Schuyler W. – Journal of Educational Measurement, 1978

Providing examinees with advanced knowledge of the difficulty of an item led to an increase in test performance with no loss of reliability. This finding was consistent across several test formats. ( Author/JKS)

Descriptors: Difficulty Level, Feedback, Higher Education, Item Analysis

Violation of Selected Item Construction Principles in Educational Measurement.

Peer reviewed

Weiten, Wayne – Journal of Experimental Education, 1984

The effects of violating four item construction principles were examined to assess the validity of the principles and the importance of students' test wiseness. While flawed items were significantly less difficult than sound items, differences in item discrimination, test reliability, and concurrent validity were not observed. (Author/BW)

Descriptors: Difficulty Level, Higher Education, Item Analysis, Multiple Choice Tests

Previous Page | Next Page »

Pages: 1 | 2

Amir Hadifar	1
Beddow, Peter A.	1
Bleiler, Timothy	1
Bosher, Susan	1
Carr, Nathan T.	1
Carter, Kathy	1
Chris Develder	1
Chun Wang	1
Clauser, Brian E.	1
Clauser, Jerome C.	1
Clough, Sara J.	1
Cohen, Allan S.	1
DeCarlo, Lawrence T.	1
Huck, Schuyler W.	1
Huntley, Renee M.	1
Jing Lu	1
Johannes Deleu	1
Kim, Seock-Ho	1
Kuntz, Patricia	1
Laura Laclede	1
Li, Anqi	1
Lucas Sterckx	1
Margolis, Melissa J.	1
Mee, Janet	1
Ningzhong Shi	1
More ▼