ERIC - Search Results

Publication Date

In 2025	2
Since 2024	11
Since 2021 (last 5 years)	39
Since 2016 (last 10 years)	124
Since 2006 (last 20 years)	393

Descriptor

Test Theory	393
Item Response Theory	139
Test Items	107
Foreign Countries	104
Psychometrics	93
Test Reliability	93
Test Validity	82
Scores	79
Comparative Analysis	63
Models	56
Test Construction	56
Correlation	53
Reliability	50
Statistical Analysis	49
Evaluation Methods	46
Difficulty Level	44
Error of Measurement	42
Item Analysis	40
Measures (Individuals)	39
Factor Analysis	34
Measurement Techniques	34
Testing	34
Educational Assessment	32
Student Evaluation	32
Computation	29
More ▼

Publication Type

Journal Articles	332
Reports - Research	206
Reports - Evaluative	84
Reports - Descriptive	53
Opinion Papers	31
Dissertations/Theses -…	26
Tests/Questionnaires	10
Numerical/Quantitative Data	8
Information Analyses	7
Speeches/Meeting Papers	7
Guides - Non-Classroom	2
Books	1
Collected Works - General	1
Dissertations/Theses -…	1
Guides - Classroom - Teacher	1
Reference Materials -…	1
Reports - General	1
More ▼

Education Level

Higher Education	90
Postsecondary Education	63
Secondary Education	47
Elementary Education	35
Elementary Secondary Education	29
Middle Schools	25
High Schools	24
Junior High Schools	21
Grade 8	17
Grade 7	14
Grade 4	11
Grade 6	11
Adult Education	10
Grade 5	9
Early Childhood Education	8
Intermediate Grades	8
Grade 3	7
Preschool Education	4
Primary Education	4
Grade 10	3
Grade 9	3
Kindergarten	3
Grade 1	2
Grade 12	2
Grade 2	2
More ▼

Audience

Practitioners	3
Teachers	3
Counselors	1
Researchers	1

Location

United States	16
Turkey	12
United Kingdom (England)	10
United Kingdom	7
Australia	6
Sweden	6
Taiwan	6
Texas	6
Tennessee	5
Canada	4
Colorado	4
Florida	4
Japan	4
New York	4
Spain	4
California	3
Chile	3
China	3
Germany	3
Illinois	3
Indonesia	3
Italy	3
Nigeria	3
Turkey (Ankara)	3
Hong Kong	2
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	4
Individuals with Disabilities…	3

What Works Clearinghouse Rating

Showing 1 to 15 of 393 results Save | Export

Latent Trait Item Response Models for Continuous Responses

Peer reviewed

Direct link

Gerhard Tutz; Pascal Jordan – Journal of Educational and Behavioral Statistics, 2024

A general framework of latent trait item response models for continuous responses is given. In contrast to classical test theory (CTT) models, which traditionally distinguish between true scores and error scores, the responses are clearly linked to latent traits. It is shown that CTT models can be derived as special cases, but the model class is…

Descriptors: Item Response Theory, Responses, Scores, Models

Comparison of Item Response Theory Ability and Item Parameters According to Classical and Bayesian Estimation Methods

Peer reviewed
PDF on ERIC

Download full text

Eray Selçuk; Ergül Demir – International Journal of Assessment Tools in Education, 2024

This research aims to compare the ability and item parameter estimations of Item Response Theory according to Maximum likelihood and Bayesian approaches in different Monte Carlo simulation conditions. For this purpose, depending on the changes in the priori distribution type, sample size, test length, and logistics model, the ability and item…

Descriptors: Item Response Theory, Item Analysis, Test Items, Simulation

A Dialectic on Validity: Explanation-Focused and the Many Ways of Being Human

Peer reviewed
PDF on ERIC

Download full text

Bruno D. Zumbo – International Journal of Assessment Tools in Education, 2023

In line with the journal volume's theme, this essay considers lessons from the past and visions for the future of test validity. In the first part of the essay, a description of historical trends in test validity since the early 1900s leads to the natural question of whether the discipline has progressed in its definition and description of test…

Descriptors: Test Theory, Test Validity, True Scores, Definitions

Examining Rating Quality in Rater-Mediated Activities for Standard-Item Alignment Research

Direct link

Yvette Jackson – ProQuest LLC, 2023

Rater-mediated activities in educational research occur when an expert judge or rater utilizes an instrument to judge persons or items and generates scale scores. Scale scores are from a subjective judgment and must undergo a quality control measure called rating quality. Rating quality in this study is broadly defined as the extent to which…

Descriptors: Educational Research, Evaluators, Test Theory, Item Response Theory

Comparison of the Results of the Generalizability Theory with the Inter-Rater Agreement Coefficients

Peer reviewed
PDF on ERIC

Download full text

Eser, Mehmet Taha; Aksu, Gökhan – International Journal of Curriculum and Instruction, 2022

The agreement between raters is examined within the scope of the concept of "inter-rater reliability". Although there are clear definitions of the concepts of agreement between raters and reliability between raters, there is no clear information about the conditions under which agreement and reliability level methods are appropriate to…

Descriptors: Generalizability Theory, Interrater Reliability, Evaluation Methods, Test Theory

Rasch Measurement v. Item Response Theory: Knowing When to Cross the Line

Peer reviewed
PDF on ERIC

Download full text

Stemler, Steven E.; Naples, Adam – Practical Assessment, Research & Evaluation, 2021

When students receive the same score on a test, does that mean they know the same amount about the topic? The answer to this question is more complex than it may first appear. This paper compares classical and modern test theories in terms of how they estimate student ability. Crucial distinctions between the aims of Rasch Measurement and IRT are…

Descriptors: Item Response Theory, Test Theory, Ability, Computation

Assessing the Fairness of Mathematical Literacy Test in Indonesia: Evidence from Gender-Based Differential Item Function Analysis

Peer reviewed
PDF on ERIC

Download full text

Kartianom Kartianom; Heri Retnawati; Kana Hidayati – Journal of Pedagogical Research, 2024

Conducting a fair test is important for educational research. Unfair assessments can lead to gender disparities in academic achievement, ultimately resulting in disparities in opportunities, wages, and career choice. Differential Item Function [DIF] analysis is presented to provide evidence of whether the test is truly fair, where it does not harm…

Descriptors: Foreign Countries, Test Bias, Item Response Theory, Test Theory

Conditional Standard Error of Measurement: Classical Test Theory, Generalizability Theory and Many-Facet Rasch Measurement with Applications to Writing Assessment

Peer reviewed
PDF on ERIC

Download full text

Huebner, Alan; Skar, Gustaf B. – Practical Assessment, Research & Evaluation, 2021

Writing assessments often consist of students responding to multiple prompts, which are judged by more than one rater. To establish the reliability of these assessments, there exist different methods to disentangle variation due to prompts and raters, including classical test theory, Many Facet Rasch Measurement (MFRM), and Generalizability Theory…

Descriptors: Error of Measurement, Test Theory, Generalizability Theory, Item Response Theory

A Comparison of the Efficacies of Differential Item Functioning Detection Methods

Peer reviewed
PDF on ERIC

Download full text

Basman, Munevver – International Journal of Assessment Tools in Education, 2023

To ensure the validity of the tests is to check that all items have similar results across different groups of individuals. However, differential item functioning (DIF) occurs when the results of individuals with equal ability levels from different groups differ from each other on the same test item. Based on Item Response Theory and Classic Test…

Descriptors: Test Bias, Test Items, Test Validity, Item Response Theory

Programme Evaluation in Action: Theory to Practice from an Asian Educational Context

Peer reviewed

Direct link

Ser Ming Mark Lee; Wei Cheng Liu – Asia Pacific Journal of Education, 2024

Programme evaluation has developed tremendously over the past 50 years, with a proliferation of evaluation research, an increase in the institutionalization of evaluation, and growth in the professionalization of evaluation. However, existing research and developments are still largely in North America, Europe, Australia, and New Zealand, with…

Descriptors: Foreign Countries, Evaluation Research, Evaluation Methods, Evaluation Criteria

Classical Item Analysis from a Signal Detection Perspective

Peer reviewed

Direct link

DeCarlo, Lawrence T. – Journal of Educational Measurement, 2023

A conceptualization of multiple-choice exams in terms of signal detection theory (SDT) leads to simple measures of item difficulty and item discrimination that are closely related to, but also distinct from, those used in classical item analysis (CIA). The theory defines a "true split," depending on whether or not examinees know an item,…

Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Test Wiseness

Further Validation of the Social Efficacy and Social Outcome Expectations Scale

Peer reviewed

Direct link

Stephen L. Wright; Michael A. Jenkins-Guarnieri – Journal of Psychoeducational Assessment, 2024

The current study sought out to advance the Social Self-Efficacy and Social Outcome Expectations scale using multiple approaches to scale development. Data from 583 undergraduate students were used in two scale development approaches: Classic Test Theory (CTT) and Item Response Theory (IRT). Confirmatory factor analysis suggested a 2-factor…

Descriptors: Measures (Individuals), Expectation, Self Efficacy, Item Response Theory

The PSI-20: Development of a Viable Short Form Alternative of the Problem Solving Inventory Using Item Response Theory

Peer reviewed

Direct link

Tyrone B. Pretorius; P. Paul Heppner; Anita Padmanabhanunni; Serena Ann Isaacs – SAGE Open, 2023

In previous studies, problem solving appraisal has been identified as playing a key role in promoting positive psychological well-being. The Problem Solving Inventory is the most widely used measure of problem solving appraisal and consists of 32 items. The length of the instrument, however, may limit its applicability to large-scale surveys…

Descriptors: Problem Solving, Measures (Individuals), Test Construction, Item Response Theory

Modeling Partial Knowledge in Multiple-Choice Cognitive Diagnostic Assessment

Peer reviewed

Direct link

Kentaro Fukushima; Nao Uchida; Kensuke Okada – Journal of Educational and Behavioral Statistics, 2025

Diagnostic tests are typically administered in a multiple-choice (MC) format due to their advantages of objectivity and time efficiency. The MC-deterministic input, noisy "and" gate (DINA) family of models, a representative class of cognitive diagnostic models for MC items, efficiently and parsimoniously estimates the mastery profiles of…

Descriptors: Diagnostic Tests, Cognitive Measurement, Multiple Choice Tests, Educational Assessment

Comparison of Classical Test Theory vs. Multi-Facet Rasch Theory

Peer reviewed
PDF on ERIC

Download full text

Polat, Murat; Turhan, Nihan S.; Toraman, Cetin – Pegem Journal of Education and Instruction, 2022

Testing English writing skills could be multi-dimensional; thus, the study aimed to compare students' writing scores calculated according to Classical Test Theory (CTT) and Multi-Facet Rasch Model (MFRM). The research was carried out in 2019 with 100 university students studying at a foreign language preparatory class and four experienced…

Descriptors: Comparative Analysis, Test Theory, Item Response Theory, Student Evaluation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 27

ProQuest LLC	26
Educational and Psychological…	19
Measurement:…	14
Online Submission	13
Assessment in Education:…	12
Applied Psychological…	11
International Journal of…	10
Journal of Educational…	10
International Journal of…	9
Educational Measurement:…	8
Journal of Educational and…	8
Educational Research and…	6
Practical Assessment,…	6
Applied Measurement in…	5
Astronomy Education Review	5
ETS Research Report Series	5
Educational Testing Service	5
Language Testing	5
Physical Review Physics…	5
Research Papers in Education	5
Behavioral Research and…	4
Educational Sciences: Theory…	4
Grantee Submission	4
Psychometrika	4
Advances in Health Sciences…	3
More ▼

Sinharay, Sandip	9
van der Linden, Wim J.	9
Prather, Edward E.	6
Andrich, David	5
Baird, Jo-Anne	5
Haberman, Shelby J.	5
Petscher, Yaacov	5
Mislevy, Robert J.	4
Tindal, Gerald	4
Wallace, Colin S.	4
Beddow, Peter A.	3
Kettler, Ryan J.	3
Lane, Kathleen Lynne	3
Lee, Young-Sun	3
Liu, Kimy	3
Marcoulides, George A.	3
Puhan, Gautam	3
Raykov, Tenko	3
Stobart, Gordon	3
Wiliam, Dylan	3
Bailey, Janelle M.	2
Bhatti, Muhammad Tariq	2
Bishop, Crystal Crowe	2
Bramley, Tom	2
Breivik, Einar	2
More ▼

National Assessment of…	6
ACT Assessment	5
Program for International…	5
SAT (College Admission Test)	5
Trends in International…	4
Strengths and Difficulties…	3
Advanced Placement…	2
Test of English as a Foreign…	2
Wechsler Intelligence Scale…	2
Armed Services Vocational…	1
Bayley Scales of Infant…	1
Center for Epidemiologic…	1
Defining Issues Test	1
Dyadic Adjustment Scale	1
English Proficiency Test	1
Eysenck Personality Inventory	1
Gates MacGinitie Reading Tests	1
General Aptitude Test Battery	1
Graduate Record Examinations	1
Kaufman Assessment Battery…	1
Kaufman Brief Intelligence…	1
Kaufman Test of Educational…	1
Law School Admission Test	1
Leadership Practices Inventory	1
Learning and Study Strategies…	1
More ▼