ERIC - Search Results

Publication Date

In 2025	67
Since 2024	901
Since 2021 (last 5 years)	3415
Since 2016 (last 10 years)	7595
Since 2006 (last 20 years)	14761

Descriptor

Test Reliability	14547
Test Validity	9865
Reliability	9544
Foreign Countries	6751
Test Construction	4608
Validity	4120
Measures (Individuals)	3750
Factor Analysis	3720
Psychometrics	3393
Interrater Reliability	3054
Correlation	3009
Evaluation Methods	2674
Statistical Analysis	2527
Higher Education	2475
Questionnaires	2412
Scores	2324
College Students	2141
Student Attitudes	2060
Comparative Analysis	1930
Factor Structure	1755
Student Evaluation	1647
Rating Scales	1580
Measurement Techniques	1539
Elementary Secondary Education	1478
Test Items	1467
More ▼

Author

Thompson, Bruce	44
Tindal, Gerald	41
Raykov, Tenko	39
Erford, Bradley T.	37
Marsh, Herbert W.	36
Feldt, Leonard S.	33
Fraser, Barry J.	33
Brennan, Robert L.	32
Alonzo, Julie	31
Matson, Johnny L.	29
Zimmerman, Donald W.	29
Epstein, Michael H.	26
Briesch, Amy M.	24
Tsai, Chin-Chung	24
Lane, Kathleen Lynne	23
Petscher, Yaacov	23
Anderson, Daniel	22
Hambleton, Ronald K.	22
Michael, William B.	22
Reckase, Mark D.	22
Huynh, Huynh	21
Livingston, Samuel A.	21
Attali, Yigal	19
Elliott, Stephen N.	19
More ▼

Publication Type

Journal Articles	18597
Reports - Research	16784
Reports - Evaluative	3311
Speeches/Meeting Papers	1851
Reports - Descriptive	1526
Tests/Questionnaires	1520
Information Analyses	923
Opinion Papers	645
Dissertations/Theses -…	625
Guides - Non-Classroom	323
Numerical/Quantitative Data	249
Books	118
Guides - Classroom - Teacher	80
Reports - General	71
Guides - General	56
Reference Materials -…	53
Collected Works - General	39
Book/Product Reviews	38
Collected Works - Serials	35
Collected Works - Proceedings	32
ERIC Publications	31
Multilingual/Bilingual…	26
Dissertations/Theses	21
ERIC Digests in Full Text	20
Guides - Classroom - Learner	15
More ▼

Education Level

Higher Education	4474
Postsecondary Education	3487
Secondary Education	2126
Elementary Education	2086
High Schools	1018
Middle Schools	980
Elementary Secondary Education	851
Early Childhood Education	833
Junior High Schools	678
Primary Education	403
Intermediate Grades	374
Preschool Education	374
Grade 5	325
Grade 8	322
Grade 4	305
Grade 6	291
Grade 7	273
Grade 3	263
Kindergarten	257
Adult Education	205
Grade 1	197
Grade 2	165
Grade 9	152
Grade 10	137
Grade 11	101
More ▼

Audience

Researchers	703
Practitioners	447
Teachers	204
Administrators	121
Policymakers	62
Counselors	42
Students	37
Parents	11
Community	7
Media Staff	5
Support Staff	5
More ▼

Location

Turkey	1246
Australia	428
Canada	371
China	329
United States	264
United Kingdom	246
Taiwan	221
Netherlands	217
Indonesia	214
California	208
Spain	201
United Kingdom (England)	188
Germany	187
Malaysia	164
Florida	159
Hong Kong	159
Nigeria	146
Iran	145
Texas	130
South Korea	124
India	117
New York	117
Pennsylvania	109
South Africa	107
Greece	103
More ▼

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	8
Meets WWC Standards with or without Reservations	9
Does not meet standards	6

Showing 196 to 210 of 26,352 results Save | Export

Development and Preliminary Validation of the Evaluation Use Scale for Evaluation Systems (EUS-ES)

Peer reviewed

Direct link

Roberto Brazileio Paixão; Michael C. Rodriguez – Educational Research and Evaluation, 2023

The usefulness of evaluation is critical. Evaluation use occurs when, from its results or process, decisions are made about the program, it changes people's mindsets, or persuasive or legitimation actions happen (instrumental, conceptual, and symbolic uses respectively). Few quantitative evaluation use studies have been conducted in recent years.…

Descriptors: Measures (Individuals), College Faculty, Test Validity, Test Reliability

Improving Peer Assessment Validity and Reliability Through a Fuzzy Coherence Measure

Peer reviewed

Direct link

El Alaoui, Mohamed – IEEE Transactions on Learning Technologies, 2023

Classical evaluation methods, assessments, exams, and so forth accentuate the perception of one against all, professor versus learners. Including students in the assessment process, allows transforming the professor from an opponent to a critical friend, with the role of helping students to recognize both their strengths and weaknesses. However,…

Descriptors: Peer Evaluation, Educational Improvement, Test Validity, Test Reliability

Approaches to Estimating Longitudinal Diagnostic Classification Models

Peer reviewed
PDF on ERIC

Download full text

Matthew J. Madison; Seungwon Chung; Junok Kim; Laine P. Bradshaw – Grantee Submission, 2023

Recent developments have enabled the modeling of longitudinal assessment data in a diagnostic classification model (DCM) framework. These longitudinal DCMs were developed to provide measures of student growth on a discrete scale in the form of attribute mastery transitions, thereby supporting categorical and criterion-referenced interpretations of…

Descriptors: Models, Cognitive Measurement, Diagnostic Tests, Classification

Do Mathematicians and Undergraduates Agree about Explanation Quality?

Peer reviewed

Direct link

Evans, Tanya; Mejía-Ramos, Juan Pablo; Inglis, Matthew – Educational Studies in Mathematics, 2022

Offering explanations is a central part of teaching mathematics, and understanding those explanations is a vital activity for learners. Given this, it is natural to ask what makes a good mathematical explanation. This question has received surprisingly little attention in the mathematics education literature, perhaps because the field has no…

Descriptors: Mathematics, Professional Personnel, Undergraduate Students, Mathematics Activities

What Factors Influence Children's Creative Artistic Orientation? The Novel "Children's Creative Orientation Test: Artistic"

Peer reviewed

Direct link

Simner, Julia; Smees, Rebecca; Rinaldi, Louisa J.; Carmichael, Duncan A.; McDonald, Toby J. – Journal of Creative Behavior, 2022

Creative orientation is the extent to which different individuals are drawn toward creative activities (e.g., art, music). We know relatively little about child-level creative orientation given certain testing limitations. Adult tools often measure time spent engaged in creative pursuits, but this method is unsuitable for children because their…

Descriptors: Influences, Creativity, Creative Activities, Measures (Individuals)

Reconceptualization of Coefficient Alpha Reliability for Test Summed and Scaled Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Educational Measurement: Issues and Practice, 2022

Coefficient alpha reliability persists as the most common reliability coefficient reported in research. The assumptions for its use are, however, not well-understood. The current paper challenges the commonly used expressions of coefficient alpha and argues that while these expressions are correct when estimating reliability for summed scores,…

Descriptors: Reliability, Scores, Scaling, Statistical Analysis

Does Confidence in a Wrong Answer Imply a Misconception?

Peer reviewed

Direct link

Hull, Michael M.; Jansky, Alexandra; Hopf, Martin – Physical Review Physics Education Research, 2022

Our study investigates whether confidence correlates with consistency in reasoning, specifically about radioactive decay. In prior work, we developed and tested a survey designed to measure consistency of student reasoning about radioactive decay by comparing responses to three prompts that are isomorphic, meaning that, despite having different…

Descriptors: Students, Self Esteem, Responses, Accuracy

On a New Conception of Evaluation: Exploring the Potential for Human Performance Assessment Grounded in Peircean Semiotics

Direct link

Eric Jones – ProQuest LLC, 2022

The assessment of human performance is not a new phenomenon. We have evidence that people have been required to prove their worth dating back at least to the Epic of Gilgamesh. What has changed, at least on a large scale, is the importance given to quantitative evidence in the evaluation process. For example, many employers have begun subjecting…

Descriptors: Performance Based Assessment, Evaluation Methods, Semiotics, Theories

Beyond a Coefficient: An Interactive Process for Achieving Inter-Rater Consistency in Qualitative Coding

Peer reviewed
PDF on ERIC

Download full text

Direct link

Vonna L. Hemmler; Allison W. Kenney; Susan Dulong Langley; Carolyn M. Callahan; E. Jean Gubbins; Shannon Holder – Grantee Submission, 2022

Though qualitative research has become more prevalent in practice over the last 30 years, there is still considerable uncertainty among researchers regarding how to ensure inter-rater consistency when teams are tasked with coding qualitative data. In this article, we offer an explanation of a methodology our qualitative team used to achieve…

Descriptors: Interrater Reliability, Coding, Guides, Data Collection

Digital Module 12: Think-Aloud Interviews and Cognitive Labs https://ncme.elevate.commpartners.com

Peer reviewed

Direct link

Leighton, Jacqueline P.; Lehman, Blair – Educational Measurement: Issues and Practice, 2020

In this digital ITEMS module, Dr. Jacqueline Leighton and Dr. Blair Lehman review differences between think-aloud interviews to measure problem-solving processes and cognitive labs to measure comprehension processes. Learners are introduced to historical, theoretical, and procedural differences between these methods and how to use and analyze…

Descriptors: Protocol Analysis, Interviews, Problem Solving, Cognitive Processes

A Closed-Form Alternative for Estimating [omega] Reliability under Unidimensionality

Peer reviewed

Direct link

Hancock, Gregory R.; An, Ji – Measurement: Interdisciplinary Research and Perspectives, 2020

As an alternative to Cronbach's [alpha] for estimating scale reliability, McDonald's [omega] has attracted increased attention within the methodological community for its less stringent measurement assumptions. Notwithstanding, [omega] is still seldom used by practitioners, likely due to its unavailability in popular software packages (e.g., SPSS)…

Descriptors: Evaluation, Alternative Assessment, Reliability, Test Reliability

Accuracy and Reliability of Large Language Models in Assessing Learning Outcomes Achievement across Cognitive Domains

Peer reviewed

Direct link

Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024

The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…

Descriptors: Accuracy, Reliability, Computational Linguistics, Standards

Structural Validity, Internal Consistency, and Rater Reliability of the Modified Barium Swallow Impairment Profile: Breaking Ground on a 52,726-Patient, Clinical Data Set

Peer reviewed

Direct link

Clain, Alex E.; Alkhuwaiter, Munirah; Davidson, Kate; Martin-Harris, Bonnie – Journal of Speech, Language, and Hearing Research, 2022

Purpose: The purpose of this study was to extend the assessment of the psychometric properties of the Modified Barium Swallow Impairment Profile (MBSImP). Here, we re-examined structural validity and internal consistency using a large clinical-registry data set and formally examined rater reliability in a smaller data set. Method: This study…

Descriptors: Diagnostic Tests, Disability Identification, Physical Disabilities, Eating Disorders

The Impact of Inconsistent Responders to Mixed-Worded Scales on Inferences in International Large-Scale Assessments

Peer reviewed

Direct link

Steinmann, Isa; Sánchez, Daniel; van Laar, Saskia; Braeken, Johan – Assessment in Education: Principles, Policy & Practice, 2022

Questionnaire scales that are mixed-worded, i.e. include both positively and negatively worded items, often suffer from issues like low reliability and more complex latent structures than intended. Part of the problem might be that some responders fail to respond consistently to the mixed-worded items. We investigated the prevalence and impact of…

Descriptors: Response Style (Tests), Test Items, Achievement Tests, Foreign Countries

Measuring Returns to Experience Using Supervisor Ratings of Observed Performance: The Case of Classroom Teachers

Peer reviewed

Direct link

Courtney Bell; Jessalynn James; Eric S. Taylor; James Wyckoff – Journal of Policy Analysis and Management, 2025

We study the returns to experience in teaching, estimated using supervisor ratings from classroom observations. We describe the assumptions required to interpret changes in observation ratings over time as the causal effect of experience on performance. We compare two difference-in-differences strategies: the two-way fixed effects estimator common…

Descriptors: Lesson Observation Criteria, Teaching Experience, Teacher Evaluation, Supervisors

« Previous Page | Next Page »

Pages: 1 | ... | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | ... | 1757

Educational and Psychological…	810
ProQuest LLC	612
Journal of Psychoeducational…	378
Online Submission	324
Journal of Educational…	242
Measurement and Evaluation in…	230
Journal of Autism and…	219
Psychology in the Schools	210
Psychological Assessment	180
Grantee Submission	177
Journal of Speech, Language,…	170
Measurement in Physical…	161
Applied Psychological…	149
Assessment for Effective…	134
Journal of Consulting and…	131
Educational Research and…	130
Psychometrika	120
Research on Social Work…	120
Educational Sciences: Theory…	119
Assessment & Evaluation in…	118
Language Testing	115
International Journal of…	112
Applied Measurement in…	111
ETS Research Report Series	101
Assessment	100
More ▼

No Child Left Behind Act 2001	136
Individuals with Disabilities…	43
Race to the Top	27
Elementary and Secondary…	19
Every Student Succeeds Act…	19
Elementary and Secondary…	15
Individuals with Disabilities…	11
American Recovery and…	10
Rehabilitation Act 1973…	8
Americans with Disabilities…	5
Elementary and Secondary…	5
Education Consolidation…	4
Education for All Handicapped…	4
Head Start	4
Individuals with Disabilities…	4
Adoption and Safe Families…	2
Child Abuse Prevention and…	2
Comprehensive Employment and…	2
Education Amendments 1974	2
Education of the Handicapped…	2
Elementary and Secondary…	2
Individuals with Disabilities…	2
Individuals with Disabilities…	2
Kentucky Education Reform Act…	2
Title IX Education Amendments…	2
More ▼

General Aptitude Test Battery	463
Wechsler Intelligence Scale…	173
Peabody Picture Vocabulary…	87
SAT (College Admission Test)	85
Test of English as a Foreign…	78
Wechsler Adult Intelligence…	74
Strengths and Difficulties…	62
Program for International…	58
Child Behavior Checklist	57
National Assessment of…	55
Minnesota Multiphasic…	52
Stanford Achievement Tests	52
ACT Assessment	49
Beck Depression Inventory	48
Stanford Binet Intelligence…	45
Woodcock Johnson Tests of…	44
Autism Diagnostic Observation…	43
Motivated Strategies for…	43
Behavior Assessment System…	42
Raven Progressive Matrices	42
Graduate Record Examinations	41
Iowa Tests of Basic Skills	41
Marlowe Crowne Social…	41
Kaufman Assessment Battery…	38
Vineland Adaptive Behavior…	36
More ▼