ERIC - Search Results

Publication Date

In 2025	2
Since 2024	15
Since 2021 (last 5 years)	68
Since 2016 (last 10 years)	171
Since 2006 (last 20 years)	439

Descriptor

Generalizability Theory	728
Reliability	168
Scores	146
Error of Measurement	133
Test Reliability	125
Interrater Reliability	120
Foreign Countries	102
Statistical Analysis	85
Evaluation Methods	82
Psychometrics	75
Research Methodology	67
Validity	66
Test Validity	64
Models	62
Comparative Analysis	59
Correlation	59
Higher Education	59
Scoring	59
Item Response Theory	57
Performance Based Assessment	57
Research Design	57
Test Items	54
Test Construction	49
Elementary School Students	48
Test Theory	47
More ▼

Education Level

Higher Education	115
Postsecondary Education	68
Elementary Education	59
Secondary Education	42
Middle Schools	33
Elementary Secondary Education	29
Early Childhood Education	24
Junior High Schools	22
Grade 8	17
Grade 3	15
Preschool Education	15
Grade 4	14
Grade 5	13
Primary Education	13
Grade 7	12
High Schools	12
Intermediate Grades	11
Adult Education	10
Grade 6	7
Kindergarten	7
Grade 10	6
Grade 9	6
Grade 1	4
Grade 2	4
Two Year Colleges	3
More ▼

Audience

Researchers	28
Practitioners	2
Policymakers	1
Students	1

Location

Turkey	14
Canada	10
United States	10
California	9
Netherlands	9
Australia	6
Germany	6
South Korea	6
Iowa	5
Norway	5
Turkey (Ankara)	5
United Kingdom	5
Florida	4
South Africa	4
Tennessee	4
China	3
Hong Kong	3
Indiana	3
Japan	3
North Carolina	3
Texas	3
Alabama	2
China (Beijing)	2
Colorado	2
Cyprus	2
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	2
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 46 to 60 of 728 results Save | Export

Preliminary Examination of the Stability of Sequential Associations between the Talk of Educators and Autistic Preschoolers Using Generalizability Theory

Peer reviewed

Direct link

Andrea L. B. Ford; Marianne Elmquist; LeAnne D. Johnson; Jon Tapp – Journal of Speech, Language, and Hearing Research, 2025

Purpose: Estimating the sequential associations between educators' and children's talk during language learning interactions requires careful consideration of factors that may impact measurement stability and resultant inferences. This research note will describe a preliminary study that used generalizability theory to understand the contribution…

Descriptors: Preschool Children, Preschool Curriculum, Preschool Education, Preschool Teachers

How/Should We Generalize?

Peer reviewed

Direct link

Erickson, Ainsley T. – History of Education Quarterly, 2020

Carl Kaestle defines a generalization as "how we know when we know." Kaestle sketches a model of increasing certainty in historical claims as they are developed and refined at increasing scales of research, from local to international. A historical claim might originate in the study of a particular place or case, but to know that the…

Descriptors: Generalization, Generalizability Theory, Historical Interpretation, Archives

Comparison of G and Phi Coefficients Estimated in Generalizability Theory with Real Cases

Peer reviewed
PDF on ERIC

Download full text

Deniz, Kaan Zulfikar; Ilican, Emel – International Journal of Assessment Tools in Education, 2021

This study aims to compare the G and Phi coefficients as estimated by D studies for a measurement tool with the G and Phi coefficients obtained from real cases in which items of differing difficulty levels were added and also to determine the conditions under which the D studies estimated reliability coefficients closer to reality. The study group…

Descriptors: Generalizability Theory, Test Items, Difficulty Level, Test Reliability

Robustness, Generalization and Fairness in Learning: Analysis and Design

Direct link

Zhun Deng – ProQuest LLC, 2021

Machine learning has achieved state-of-the-art performance in many areas, including image recognition and natural language processing. However, there are still many challenges and mysteries attracting numerous researchers. This dissertation comprises a series of works concerning problems at the intersection of computer science theory, adversarial…

Descriptors: Learning Analytics, Instructional Design, Artificial Intelligence, Computer Science

Decolonizing and Diversifying Research in Cognitive Development

Peer reviewed

Direct link

Leher Singh – Journal of Cognition and Development, 2024

This article serves as an introduction to the Special Issue on "Decolonizing and Diversifying Research in Cognitive Development." The Special Issue comprises six articles: two articles are empirical articles that focus on executive function development in under-represented environments, two articles address barriers pathways toward…

Descriptors: Decolonization, Cognitive Development, Theory Practice Relationship, Research and Development

Learning Analytics Application to Examine Validity and Generalizability of Game-Based Assessment for Spatial Reasoning

Peer reviewed

Direct link

Kim, Yoon Jeon; Knowles, Mariah A.; Scianna, Jennifer; Lin, Grace; Ruipérez-Valiente, José A. – British Journal of Educational Technology, 2023

Game-based assessment (GBA), a specific application of games for learning, has been recognized as an alternative form of assessment. While there is a substantive body of literature that supports the educational benefits of GBA, limited work investigates the validity and generalizability of such systems. In this paper, we describe applications of…

Descriptors: Learning Analytics, Validity, Generalizability Theory, Game Based Learning

Conditional Standard Error of Measurement: Classical Test Theory, Generalizability Theory and Many-Facet Rasch Measurement with Applications to Writing Assessment

Peer reviewed
PDF on ERIC

Download full text

Huebner, Alan; Skar, Gustaf B. – Practical Assessment, Research & Evaluation, 2021

Writing assessments often consist of students responding to multiple prompts, which are judged by more than one rater. To establish the reliability of these assessments, there exist different methods to disentangle variation due to prompts and raters, including classical test theory, Many Facet Rasch Measurement (MFRM), and Generalizability Theory…

Descriptors: Error of Measurement, Test Theory, Generalizability Theory, Item Response Theory

Quantile Reliability: Beyond Global Estimates of Internal Consistency

Peer reviewed

Direct link

Jeffrey Shero; Jessica Logan – Society for Research on Educational Effectiveness, 2024

Background/Context: Previous research in educational assessment has consistently emphasized the importance of reliability as a cornerstone of test quality. Traditional measures of reliability, such as test-retest and split-half reliability, offer a broad view of how internally consistent a measure is but overlook the variability in this internal…

Descriptors: Educational Assessment, Special Education, Students with Disabilities, Learning Disabilities

The Power and Type I Error of Wilcoxon-Mann-Whitney, Welch's "t," and Student's "t" Tests for Likert-Type Data

Peer reviewed
PDF on ERIC

Download full text

Simsek, Ahmet Salih – International Journal of Assessment Tools in Education, 2023

Likert-type item is the most popular response format for collecting data in social, educational, and psychological studies through scales or questionnaires. However, there is no consensus on whether parametric or non-parametric tests should be preferred when analyzing Likert-type data. This study examined the statistical power of parametric and…

Descriptors: Error of Measurement, Likert Scales, Nonparametric Statistics, Statistical Analysis

Beyond Statistical Significance: A Holistic View of What Makes a Research Finding "Important"

Peer reviewed
PDF on ERIC

Download full text

Jane E. Miller – Numeracy, 2023

Students often believe that statistical significance is the only determinant of whether a quantitative result is "important." In this paper, I review traditional null hypothesis statistical testing to identify what questions inferential statistics can and cannot answer, including statistical significance, effect size and direction,…

Descriptors: Statistical Significance, Holistic Approach, Statistical Inference, Effect Size

Not Just Generalizability: A Case for Multifaceted Latent Trait Models in Teacher Observation Systems

Peer reviewed

Direct link

Wind, Stefanie A.; Jones, Eli – Educational Researcher, 2019

Teacher evaluation systems often include classroom observations in which raters use rating scales to evaluate teachers' effectiveness. Recently, researchers have promoted the use of multifaceted approaches to investigating reliability using Generalizability theory, instead of rater reliability statistics. Generalizability theory allows analysts to…

Descriptors: Teacher Evaluation, Observation, Generalizability Theory, Item Response Theory

Coping with Unbalanced Designs of Generalizability Theory: G String V

Peer reviewed
PDF on ERIC

Download full text

Teker, Gülsen Tasdelen – International Journal of Assessment Tools in Education, 2019

The aim of this paper is to introduce a software that is appropriate for the generalizability theory for not only balanced but also unbalanced data sets. Because it is possible to have unbalanced data sets while conducting a study, the researchers have devised an easy solution, other than deleting data, to balance the design to cope with this…

Descriptors: Generalizability Theory, Research Design, Computer Software, Data

Validity. Improving Literacy Brief: Understanding Screening

Direct link

Petscher, Y.; Pentimonti, J.; Stanley, C. – National Center on Improving Literacy, 2019

Validity is broadly defined as how well something measures what it's supposed to measure. The reliability and validity of scores from assessments are two concepts that are closely knit together and feed into each other.

Descriptors: Screening Tests, Scores, Test Validity, Test Reliability

The Use of Open-Ended Questions in Large-Scale Tests for Selection: Generalizability and Dependability

Peer reviewed
PDF on ERIC

Download full text

Atilgan, Hakan; Demir, Elif Kübra; Ogretmen, Tuncay; Basokcu, Tahsin Oguz – International Journal of Progressive Education, 2020

It has become a critical question what the reliability level would be when open-ended questions are used in large-scale selection tests. One of the aims of the present study is to determine what the reliability would be in the event that the answers given by test-takers are scored by experts when open-ended short answer questions are used in…

Descriptors: Foreign Countries, Secondary School Students, Test Items, Test Reliability

How Well Is Each Learner Learning? Validity Investigation of a Learning Curve-Based Assessment Approach for ECG Interpretation

Peer reviewed

Direct link

Hatala, Rose; Gutman, Jacqueline; Lineberry, Matthew; Triola, Marc; Pusic, Martin – Advances in Health Sciences Education, 2019

Learning curves can support a competency-based approach to assessment for learning. When interpreting repeated assessment data displayed as learning curves, a key assessment question is: "How well is each learner learning?" We outline the validity argument and investigation relevant to this question, for a computer-based repeated…

Descriptors: Medicine, Metabolism, Physicians, Clinical Diagnosis

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 49

Educational and Psychological…	44
Advances in Health Sciences…	27
Journal of Educational…	24
Applied Measurement in…	23
ProQuest LLC	18
Language Testing	13
Grantee Submission	11
Society for Research on…	11
Psychometrika	10
School Psychology Review	10
Applied Psychological…	9
Educational Measurement:…	9
Online Submission	8
International Journal of…	7
Journal of Educational…	7
Journal of Educational…	7
Multivariate Behavioral…	7
Behavioral Research and…	6
Educational Researcher	6
Educational Sciences: Theory…	6
Journal of Psychoeducational…	6
Measurement and Evaluation in…	6
Review of Educational Research	6
School Psychology Quarterly	6
Assessment for Effective…	5
More ▼

Brennan, Robert L.	18
Lee, Guemin	13
Briesch, Amy M.	11
Clauser, Brian E.	9
Chafouleas, Sandra M.	8
Riley-Tillman, T. Chris	8
Solano-Flores, Guillermo	8
Volpe, Robert J.	8
Christ, Theodore J.	7
Lee, Yong-Won	7
Marcoulides, George A.	7
Shavelson, Richard J.	7
Tindal, Gerald	7
Alonzo, Julie	6
Anderson, Daniel	6
Hagtvet, Knut A.	5
Harik, Polina	5
Miller, M. David	5
Raymond, Mark R.	5
Atilgan, Hakan	4
Chang, Lei	4
Fitzpatrick, Anne R.	4
French, Brian F.	4
Guler, Nese	4
More ▼

Journal Articles	534
Reports - Research	426
Reports - Evaluative	180
Speeches/Meeting Papers	115
Reports - Descriptive	57
Opinion Papers	27
Information Analyses	25
Dissertations/Theses -…	19
Numerical/Quantitative Data	19
Tests/Questionnaires	11
Guides - Non-Classroom	6
Books	5
Collected Works - General	3
Book/Product Reviews	2
Reference Materials -…	2
Reports - General	2
Collected Works - Serials	1
Dissertations/Theses -…	1
Guides - Classroom - Learner	1
Guides - General	1
Non-Print Media	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	6
Program for International…	4
Teacher Performance…	4
Dynamic Indicators of Basic…	3
Trends in International…	3
ACT Assessment	2
Childrens Depression Inventory	2
National Assessment of…	2
National Survey of Student…	2
Progress in International…	2
SAT (College Admission Test)	2
Students Evaluation of…	2
Test of English for…	2
United States Medical…	2
Wechsler Adult Intelligence…	2
Advanced Placement…	1
Battelle Developmental…	1
Behavior Assessment System…	1
Big Five Inventory	1
Classroom Assessment Scoring…	1
Cognitive Abilities Test	1
Conners Teacher Rating Scale	1
Early Childhood Environment…	1
Eating Disorder Inventory	1
Eysenck Personality Inventory	1
More ▼