ERIC - Search Results

Publication Date

In 2025	103
Since 2024	950
Since 2021 (last 5 years)	3486
Since 2016 (last 10 years)	7671
Since 2006 (last 20 years)	14844

Descriptor

Test Reliability	14596
Test Validity	9898
Reliability	9570
Foreign Countries	6774
Test Construction	4627
Validity	4130
Measures (Individuals)	3759
Factor Analysis	3728
Psychometrics	3406
Interrater Reliability	3068
Correlation	3013
Evaluation Methods	2687
Statistical Analysis	2527
Higher Education	2480
Questionnaires	2419
Scores	2326
College Students	2146
Student Attitudes	2068
Comparative Analysis	1930
Factor Structure	1759
Student Evaluation	1654
Rating Scales	1582
Measurement Techniques	1543
Elementary Secondary Education	1480
Test Items	1470
More ▼

Author

Thompson, Bruce	44
Tindal, Gerald	41
Raykov, Tenko	39
Erford, Bradley T.	37
Marsh, Herbert W.	36
Feldt, Leonard S.	33
Fraser, Barry J.	33
Brennan, Robert L.	32
Alonzo, Julie	31
Matson, Johnny L.	29
Zimmerman, Donald W.	29
Epstein, Michael H.	26
Briesch, Amy M.	24
Tsai, Chin-Chung	24
Lane, Kathleen Lynne	23
Petscher, Yaacov	23
Anderson, Daniel	22
Hambleton, Ronald K.	22
Michael, William B.	22
Reckase, Mark D.	22
Huynh, Huynh	21
Livingston, Samuel A.	21
Attali, Yigal	19
Elliott, Stephen N.	19
More ▼

Publication Type

Journal Articles	18646
Reports - Research	16837
Reports - Evaluative	3313
Speeches/Meeting Papers	1852
Reports - Descriptive	1526
Tests/Questionnaires	1523
Information Analyses	925
Dissertations/Theses -…	652
Opinion Papers	645
Guides - Non-Classroom	323
Numerical/Quantitative Data	249
Books	124
Guides - Classroom - Teacher	80
Reports - General	70
Guides - General	56
Reference Materials -…	53
Collected Works - General	39
Book/Product Reviews	38
Collected Works - Serials	35
Collected Works - Proceedings	32
ERIC Publications	31
Multilingual/Bilingual…	26
Dissertations/Theses	21
ERIC Digests in Full Text	20
Guides - Classroom - Learner	15
More ▼

Education Level

Higher Education	4499
Postsecondary Education	3512
Secondary Education	2143
Elementary Education	2095
High Schools	1027
Middle Schools	985
Elementary Secondary Education	853
Early Childhood Education	834
Junior High Schools	682
Primary Education	404
Intermediate Grades	375
Preschool Education	375
Grade 5	326
Grade 8	322
Grade 4	305
Grade 6	291
Grade 7	273
Grade 3	263
Kindergarten	258
Adult Education	205
Grade 1	197
Grade 2	165
Grade 9	152
Grade 10	138
Grade 11	101
More ▼

Audience

Researchers	703
Practitioners	447
Teachers	204
Administrators	121
Policymakers	62
Counselors	42
Students	37
Parents	11
Community	7
Media Staff	5
Support Staff	5
More ▼

Location

Turkey	1249
Australia	428
Canada	371
China	332
United States	265
United Kingdom	246
Taiwan	222
Netherlands	217
Indonesia	215
California	208
Spain	204
United Kingdom (England)	188
Germany	187
Malaysia	164
Florida	159
Hong Kong	159
Iran	146
Nigeria	146
Texas	130
South Korea	124
India	117
New York	117
Pennsylvania	110
South Africa	107
Greece	103
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	8
Meets WWC Standards with or without Reservations	9
Does not meet standards	6

Showing 61 to 75 of 26,435 results Save | Export

An Exploration of "Real Time" Assessments as a Means to Better Understand Preceptors' Judgments of Student Performance

Peer reviewed

Direct link

Luu, Kimberly; Sidhu, Ravi; Chadha, Neil K.; Eva, Kevin W. – Advances in Health Sciences Education, 2023

Clinical supervisors are known to assess trainee performance idiosyncratically, causing concern about the validity of their ratings. The literature on this issue relies heavily on retrospective collection of decisions, resulting in the risk of inaccurate information regarding what actually drives raters' perceptions. Capturing in-the-moment…

Descriptors: Clinical Experience, Practicum Supervision, Student Evaluation, Evaluation Methods

Reliability and Validity of Representational Mind-Mindedness in Mothers of Infants

Peer reviewed

Direct link

Egmose, Ida; Skou, Mia; Madsen, Eva Back; Stuart, Anne Christine; Krogh, Marianne Thode; Haase, Tina Wahl; Vaever, Mette Skovgaard – European Journal of Developmental Psychology, 2023

Mind-mindedness (MM) refers to the parent's ability to treat the child as an individual with a mind of his or her own. Studies have found representational and interactional MM to predict child development, but more research is needed on the validity of representational MM in parents of infants. Therefore, we examine the reliability and validity of…

Descriptors: Individualism, Mothers, Infants, Foreign Countries

An Experimental Study of Standard Setting Methods for Diagnostic Profiles

Direct link

Feldberg, Zachary R. – ProQuest LLC, 2023

Cognitive diagnostic models (CDMs) provide pedagogically relevant information in the form of a student profile of multiple binary categorizations of students into mastery or nonmastery statuses on latent traits called attributes. Federal educational accountability requires accountability measures to designate students into one of at least three…

Descriptors: Accountability, Standards, Cutting Scores, Models

"Rater Training" Re-Imagined for Work-Based Assessment in Medical Education

Peer reviewed

Direct link

Tavares, Walter; Kinnear, Benjamin; Schumacher, Daniel J.; Forte, Milena – Advances in Health Sciences Education, 2023

In this perspective, the authors critically examine "rater training" as it has been conceptualized and used in medical education. By "rater training," they mean the educational events intended to "improve" rater performance and contributions during assessment events. Historically, rater training programs have focused…

Descriptors: Medical Education, Interrater Reliability, Evaluation Methods, Training

Visualizing Agreement: Bland-Altman Plots as a Supplement to Inter-Rater Reliability Indices

Peer reviewed

Direct link

Brogan L. Barr; Virginia V. W. McIntosh; Eileen F. Britt; Jennifer Jordan; Janet D. Carter – Measurement: Interdisciplinary Research and Perspectives, 2024

Even when raters demonstrate agreement in the use of a measure, limited score variability or violation of often-ignored statistical assumptions can result in lower reliability estimates than intuitively expected. This article uses data drawn from two randomized controlled trials of schema therapy and cognitive behavioral therapy for the treatment…

Descriptors: Evaluators, Interrater Reliability, Reliability, Measurement Techniques

Psychometric Synthesis of the Drug Abuse Screening Test (DAST) Versions

Peer reviewed

Direct link

Erin Johnson; Samantha Barstack; Yikai Xu; Hannah Wise; Bradley T. Erford; Catharina Chang; David Delmonico – Measurement and Evaluation in Counseling and Development, 2025

Problem Statement: Among individuals aged 12 years or older, 14.3% (40.0 million) reporting the use of an illicit drug in the previous year. Given the prevalence of drug abuse, it is increasingly important to determine effective screening practices, treatment procedures, and best practices among various subpopulations to identify drug use-related…

Descriptors: Drug Abuse, Screening Tests, Psychometrics, Synthesis

Superficially Plausible Outputs from a Black Box: Problematising GenAI Tools for Analysing Qualitative SoTL Data

Peer reviewed
PDF on ERIC

Download full text

Mirjam Sophia Glessmer; Rachel Forsyth – Teaching & Learning Inquiry, 2025

Generative AI tools (GenAI) are increasingly used for academic tasks, including qualitative data analysis for the Scholarship of Teaching and Learning (SoTL). In our practice as academic developers, we are frequently asked for advice on whether this use for GenAI is reliable, valid, and ethical. Since this is a new field, we have not been able to…

Descriptors: Artificial Intelligence, Research Methodology, Data Analysis, Scholarship

Examining the Psychometric Impact of Targeted and Random Double-Scoring in Mixed-Format Assessments

Peer reviewed

Direct link

Yangmeng Xu; Stefanie A. Wind – Educational Measurement: Issues and Practice, 2025

Double-scoring constructed-response items is a common but costly practice in mixed-format assessments. This study explored the impacts of Targeted Double-Scoring (TDS) and random double-scoring procedures on the quality of psychometric outcomes, including student achievement estimates, person fit, and student classifications under various…

Descriptors: Academic Achievement, Psychometrics, Scoring, Evaluation Methods

The Behavior Problem Inventory--Short Form: Psychometric Properties in a Spanish Sample of Intellectual Disabilities

Peer reviewed

Direct link

Juliana Reyes-Martin; David Simó-Pinatella; Ana Andrés – Journal of Applied Research in Intellectual Disabilities, 2025

Background: Behavioural problems in individuals with intellectual disabilities have a negative impact on them. Limited assessment measures exist in Spain. This study aimed to validate the Behavior Problems Inventory--Short Form (BPI-S) in the Spanish population by examining its psychometric properties and factorial structures. Method: This study…

Descriptors: Foreign Countries, Behavior Problems, Students with Disabilities, Intellectual Disability

GPT-4 in Education: Evaluating Aptness, Reliability, and Loss of Coherence in Solving Calculus Problems and Grading Submissions

Peer reviewed

Direct link

Alberto Gandolfi – International Journal of Artificial Intelligence in Education, 2025

In this paper, we initially investigate the capabilities of GPT-3 5 and GPT-4 in solving college-level calculus problems, an essential segment of mathematics that remains under-explored so far. Although improving upon earlier versions, GPT-4 attains approximately 65% accuracy for standard problems and decreases to 20% for competition-like…

Descriptors: Artificial Intelligence, Reliability, Problem Solving, Mathematics Skills

The Living Codebook: Documenting the Process of Qualitative Data Analysis

Peer reviewed

Direct link

Victoria Reyes; Elizabeth Bogumil; Levin Elias Welch – Sociological Methods & Research, 2024

Transparency is once again a central issue of debate across types of qualitative research. Work on how to conduct qualitative data analysis, on the other hand, walks us through the step-by-step process on how to code and understand the data we've collected. Although there are a few exceptions, less focus is on transparency regarding…

Descriptors: Qualitative Research, Data Analysis, Guides, Databases

Detecting Rater Bias in Mixed-Format Assessments

Peer reviewed

Direct link

Stefanie A. Wind; Yuan Ge – Measurement: Interdisciplinary Research and Perspectives, 2024

Mixed-format assessments made up of multiple-choice (MC) items and constructed response (CR) items that are scored using rater judgments include unique psychometric considerations. When these item types are combined to estimate examinee achievement, information about the psychometric quality of each component can depend on that of the other. For…

Descriptors: Interrater Reliability, Test Bias, Multiple Choice Tests, Responses

Reliability of Quadriceps Twitch Muscle Properties and Explosive Voluntary Contractions at Different Knee Joint Angles

Peer reviewed

Direct link

Haiko Bruno Zimmermann; Debora Knihs; Raphael Sakugawa; Chris Bishop; Juliano Dal Pupo – Measurement in Physical Education and Exercise Science, 2024

Background: Measures that assess muscle strength and its development, either voluntarily or involuntarily, are important in the clinical and research context. The main aim of this study was to verify the interday reliability and the minimum detectable change (MDC) of the knee extensors muscles torque using evoked contractions and explosive…

Descriptors: Human Body, Physiology, Motor Reactions, Muscular Strength

Using Regularization to Identify Measurement Bias across Multiple Background Characteristics: A Penalized Expectation-Maximization Algorithm

Peer reviewed

Direct link

William C. M. Belzak; Daniel J. Bauer – Journal of Educational and Behavioral Statistics, 2024

Testing for differential item functioning (DIF) has undergone rapid statistical developments recently. Moderated nonlinear factor analysis (MNLFA) allows for simultaneous testing of DIF among multiple categorical and continuous covariates (e.g., sex, age, ethnicity, etc.), and regularization has shown promising results for identifying DIF among…

Descriptors: Test Bias, Algorithms, Factor Analysis, Error of Measurement

Modeling the Intraindividual Relation of Ability and Speed within a Test

Peer reviewed

Direct link

Augustin Mutak; Robert Krause; Esther Ulitzsch; Sören Much; Jochen Ranger; Steffi Pohl – Journal of Educational Measurement, 2024

Understanding the intraindividual relation between an individual's speed and ability in testing scenarios is essential to assure a fair assessment. Different approaches exist for estimating this relationship, that either rely on specific study designs or on specific assumptions. This paper aims to add to the toolbox of approaches for estimating…

Descriptors: Testing, Academic Ability, Time on Task, Correlation

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 1763

Educational and Psychological…	810
ProQuest LLC	639
Journal of Psychoeducational…	378
Online Submission	324
Journal of Educational…	242
Measurement and Evaluation in…	230
Journal of Autism and…	224
Psychology in the Schools	210
Psychological Assessment	180
Grantee Submission	178
Journal of Speech, Language,…	170
Measurement in Physical…	161
Applied Psychological…	149
Assessment for Effective…	134
Journal of Consulting and…	131
Educational Research and…	130
Psychometrika	120
Research on Social Work…	120
Educational Sciences: Theory…	119
Assessment & Evaluation in…	118
Language Testing	115
International Journal of…	112
Applied Measurement in…	111
ETS Research Report Series	105
Assessment	100
More ▼

No Child Left Behind Act 2001	136
Individuals with Disabilities…	43
Race to the Top	27
Elementary and Secondary…	19
Every Student Succeeds Act…	19
Elementary and Secondary…	15
Individuals with Disabilities…	11
American Recovery and…	10
Rehabilitation Act 1973…	8
Americans with Disabilities…	5
Elementary and Secondary…	5
Education Consolidation…	4
Education for All Handicapped…	4
Head Start	4
Individuals with Disabilities…	4
Adoption and Safe Families…	2
Child Abuse Prevention and…	2
Comprehensive Employment and…	2
Education Amendments 1974	2
Education of the Handicapped…	2
Elementary and Secondary…	2
Individuals with Disabilities…	2
Individuals with Disabilities…	2
Kentucky Education Reform Act…	2
Title IX Education Amendments…	2
More ▼

General Aptitude Test Battery	463
Wechsler Intelligence Scale…	173
Peabody Picture Vocabulary…	87
SAT (College Admission Test)	85
Test of English as a Foreign…	78
Wechsler Adult Intelligence…	74
Strengths and Difficulties…	62
Program for International…	59
Child Behavior Checklist	57
National Assessment of…	56
Minnesota Multiphasic…	52
Stanford Achievement Tests	52
ACT Assessment	49
Beck Depression Inventory	48
Stanford Binet Intelligence…	45
Woodcock Johnson Tests of…	44
Autism Diagnostic Observation…	43
Motivated Strategies for…	43
Behavior Assessment System…	42
Raven Progressive Matrices	42
Graduate Record Examinations	41
Iowa Tests of Basic Skills	41
Marlowe Crowne Social…	41
Kaufman Assessment Battery…	38
Vineland Adaptive Behavior…	36
More ▼