Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 16 |
Since 2006 (last 20 years) | 37 |
Descriptor
Case Studies | 41 |
Item Response Theory | 41 |
Statistical Analysis | 13 |
Foreign Countries | 12 |
Test Items | 10 |
Comparative Analysis | 9 |
Models | 8 |
Psychometrics | 8 |
Reliability | 7 |
Student Evaluation | 7 |
Test Bias | 7 |
More ▼ |
Source
Author
Abdallah, Abdallah A. | 1 |
Acevedo, Daniela | 1 |
Adedoyin, O. O. | 1 |
Allevato, Anthony J. | 1 |
Almond, Russell G. | 1 |
Bao, Lei | 1 |
Batdi, Veli | 1 |
Black, Deborah | 1 |
Bos, Wilfried | 1 |
Boubonari, Theodora | 1 |
Breithaupt, Krista | 1 |
More ▼ |
Publication Type
Education Level
Higher Education | 13 |
Postsecondary Education | 9 |
Elementary Education | 7 |
Secondary Education | 7 |
Middle Schools | 4 |
Intermediate Grades | 3 |
Junior High Schools | 3 |
Elementary Secondary Education | 2 |
Grade 4 | 2 |
Grade 6 | 2 |
Grade 8 | 2 |
More ▼ |
Audience
Researchers | 1 |
Location
Australia | 2 |
Botswana | 2 |
California | 2 |
Chile | 1 |
Finland | 1 |
France | 1 |
Georgia | 1 |
Greece | 1 |
Honduras | 1 |
Hong Kong | 1 |
Russia | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Graduate Record Examinations | 2 |
Program for International… | 2 |
SAT (College Admission Test) | 2 |
Trends in International… | 2 |
Early Childhood Longitudinal… | 1 |
What Works Clearinghouse Rating
Yi-Hsuan Lee; Yue Jia – Applied Measurement in Education, 2024
Test-taking experience is a consequence of the interaction between students and assessment properties. We define a new notion, rapid-pacing behavior, to reflect two types of test-taking experience -- disengagement and speededness. To identify rapid-pacing behavior, we extend existing methods to develop response-time thresholds for individual items…
Descriptors: Adaptive Testing, Reaction Time, Item Response Theory, Test Format
The Choice between Cognitive Diagnosis and Item Response Theory: A Case Study from Medical Education
Youn Seon Lim; Catherine Bangeranye – International Journal of Testing, 2024
Feedback is a powerful instructional tool for motivating learning. But effective feedback, requires that instructors have accurate information about their students' current knowledge status and their learning progress. In modern educational measurement, two major theoretical perspectives on student ability and proficiency can be distinguished.…
Descriptors: Cognitive Measurement, Diagnostic Tests, Item Response Theory, Case Studies
Wolkowitz, Amanda A.; Foley, Brett; Zurn, Jared – Practical Assessment, Research & Evaluation, 2023
The purpose of this study is to introduce a method for converting scored 4-option multiple-choice (MC) items into scored 3-option MC items without re-pretesting the 3-option MC items. This study describes a six-step process for achieving this goal. Data from a professional credentialing exam was used in this study and the method was applied to 24…
Descriptors: Multiple Choice Tests, Test Items, Accuracy, Test Format
Meneses, Alejandra; Uccelli, Paola; Santelices, María Verónica; Ruiz, Marcela; Acevedo, Daniela; Figueroa, Javiera – Reading Research Quarterly, 2018
Although literacy achievement has improved in Chile, adolescents' underperformance in reading comprehension is still a serious concern. In English, core academic-language skills (CALS) have been found to significantly predict reading comprehension, even controlling for academic vocabulary knowledge. CALS are high-utility language skills that…
Descriptors: Reading Achievement, Foreign Countries, Academic Discourse, Reading Comprehension
Xiao, Yang; Han, Jing; Koenig, Kathleen; Xiong, Jianwen; Bao, Lei – Physical Review Physics Education Research, 2018
Assessment instruments composed of two-tier multiple choice (TTMC) items are widely used in science education as an effective method to evaluate students' sophisticated understanding. In practice, however, there are often concerns regarding the common scoring methods of TTMC items, which include pair scoring and individual scoring schemes. The…
Descriptors: Hierarchical Linear Modeling, Item Response Theory, Multiple Choice Tests, Case Studies
Wind, Stefanie A.; Engelhard, George, Jr. – Educational and Psychological Measurement, 2016
Mokken scale analysis is a probabilistic nonparametric approach that offers statistical and graphical tools for evaluating the quality of social science measurement without placing potentially inappropriate restrictions on the structure of a data set. In particular, Mokken scaling provides a useful method for evaluating important measurement…
Descriptors: Nonparametric Statistics, Statistical Analysis, Measurement, Psychometrics
Covitt, Beth A.; Gunckel, Kristin L.; Caplan, Bess; Syswerda, Sara – Applied Measurement in Education, 2018
While learning progressions (LPs) hold promise as instructional tools, researchers are still in the early stages of understanding how teachers use LPs in formative assessment practices. We report on a study that assessed teachers' proficiency in using a LP for student ideas about hydrologic systems. Research questions were: (a) what were teachers'…
Descriptors: Skill Development, Behavioral Objectives, Formative Evaluation, Student Evaluation
Martinková, Patricia; Drabinová, Adéla; Liaw, Yuan-Ling; Sanders, Elizabeth A.; McFarland, Jenny L.; Price, Rebecca M. – CBE - Life Sciences Education, 2017
We provide a tutorial on differential item functioning (DIF) analysis, an analytic method useful for identifying potentially biased items in assessments. After explaining a number of methodological approaches, we test for gender bias in two scenarios that demonstrate why DIF analysis is crucial for developing assessments, particularly because…
Descriptors: Test Bias, Test Items, Gender Bias, Science Tests
Abdallah, Abdallah A. – Online Submission, 2018
This study conducted to assess the effectiveness of peer observation on enhancing teacher competence among teacher trainees during teaching practice: A case study of Mwenge Catholic University. The study used a cross-sectional design whereby the target population included first, second and third year education students, lecturers, and TP from…
Descriptors: Preservice Teachers, Foreign Countries, Teacher Competencies, Case Studies
Kornilov, Sergey A.; Kornilova, Tatiana V.; Grigorenko, Elena L. – New Directions for Child and Adolescent Development, 2016
Unlike intelligence, creativity has rarely been investigated from the standpoint of cross-cultural invariance of the structure of the instruments used to measure it. In the study reported in this article, we investigated the cross-cultural invariance of expert ratings of creative stories written by undergraduate students from the Russian…
Descriptors: Creative Writing, Cross Cultural Studies, Case Studies, Undergraduate Students
Oliveri, Maria Elena; Lawless, Rene; Robin, Frederic; Bridgeman, Brent – Applied Measurement in Education, 2018
We analyzed a pool of items from an admissions test for differential item functioning (DIF) for groups based on age, socioeconomic status, citizenship, or English language status using Mantel-Haenszel and item response theory. DIF items were systematically examined to identify its possible sources by item type, content, and wording. DIF was…
Descriptors: Test Bias, Comparative Analysis, Item Banks, Item Response Theory
Linking Errors between Two Populations and Tests: A Case Study in International Surveys in Education
Hastedt, Dirk; Desa, Deana – Practical Assessment, Research & Evaluation, 2015
This simulation study was prompted by the current increased interest in linking national studies to international large-scale assessments (ILSAs) such as IEA's TIMSS, IEA's PIRLS, and OECD's PISA. Linkage in this scenario is achieved by including items from the international assessments in the national assessments on the premise that the average…
Descriptors: Case Studies, Simulation, International Programs, Testing Programs
Kortemeyer, Gerd – Journal of Science Education and Technology, 2016
Classroom response systems (often referred to as "clickers") have slowly gained adoption over the recent decade; however, critics frequently doubt their pedagogical value starting with the validity of the gathered responses: There is concern that students simply "click" random answers. This case study looks at different…
Descriptors: Audience Response Systems, Case Studies, Psychometrics, Reliability
Wagler, Amy; Wagler, Ron – International Journal of Science Education, 2013
The Measure of Acceptance of the Theory of Evolution (MATE) was constructed to be a single-factor instrument that assesses an individual's overall acceptance of evolutionary theory. The MATE was validated and the scores resulting from the MATE were found to be reliable for the population of inservice high school biology teachers. However, many…
Descriptors: Evolution, Theories, Measures (Individuals), Preservice Teachers
Markos, Angelos; Boubonari, Theodora; Mogias, Athanasios; Kevrekidis, Theodoros – Environmental Education Research, 2017
The aim of the present study was to respond to the increasing demand for comprehensive tools for the measurement of ocean literacy, by investigating the psychometric characteristics of a Greek version of the Survey of Ocean Literacy and Experience (SOLE), an instrument that assesses conceptual understanding of general ocean sciences content,…
Descriptors: Literacy, Oceanography, Measurement Techniques, Psychometrics