ERIC - Search Results

Publication Date

In 2025	1
Since 2024	10
Since 2021 (last 5 years)	47
Since 2016 (last 10 years)	157
Since 2006 (last 20 years)	394

Descriptor

Reliability	489
Validity	163
Item Response Theory	162
Foreign Countries	147
Generalizability Theory	118
Correlation	107
Scores	107
Measures (Individuals)	92
Statistical Analysis	81
Psychometrics	66
Factor Analysis	65
Student Attitudes	53
Comparative Analysis	50
Error of Measurement	50
Test Items	50
Undergraduate Students	47
College Students	45
Models	42
Rating Scales	42
Teaching Methods	41
Questionnaires	35
Elementary School Students	34
Item Analysis	33
Construct Validity	31
Higher Education	31
More ▼

Publication Type

Reports - Research	489
Journal Articles	421
Speeches/Meeting Papers	41
Tests/Questionnaires	32
Numerical/Quantitative Data	10
Information Analyses	6
Reports - Evaluative	2
Collected Works - General	1
Guides - Non-Classroom	1

Education Level

Higher Education	138
Postsecondary Education	108
Secondary Education	52
Elementary Education	50
Middle Schools	23
Early Childhood Education	21
Elementary Secondary Education	20
High Schools	19
Junior High Schools	14
Primary Education	12
Preschool Education	10
Grade 3	8
Grade 4	8
Grade 8	8
Intermediate Grades	8
Grade 7	7
Kindergarten	6
Adult Education	5
Grade 10	5
Grade 6	5
Grade 2	4
Grade 5	4
Grade 11	3
Grade 1	2
Grade 12	2
More ▼

Audience

Researchers	13
Teachers	1

Location

Turkey	24
Australia	12
Canada	11
United States	11
China	9
Florida	7
Hong Kong	7
Spain	7
California	6
Netherlands	6
United Kingdom	5
Germany	4
Iran	4
Japan	4
North Carolina	4
Singapore	4
Texas	4
Turkey (Ankara)	4
Belgium	3
France	3
Indiana	3
Jordan	3
Mexico	3
South Korea	3
Tanzania	3
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001

What Works Clearinghouse Rating

Showing 1 to 15 of 489 results Save | Export

Using Regularization to Identify Measurement Bias across Multiple Background Characteristics: A Penalized Expectation-Maximization Algorithm

Peer reviewed

Direct link

William C. M. Belzak; Daniel J. Bauer – Journal of Educational and Behavioral Statistics, 2024

Testing for differential item functioning (DIF) has undergone rapid statistical developments recently. Moderated nonlinear factor analysis (MNLFA) allows for simultaneous testing of DIF among multiple categorical and continuous covariates (e.g., sex, age, ethnicity, etc.), and regularization has shown promising results for identifying DIF among…

Descriptors: Test Bias, Algorithms, Factor Analysis, Error of Measurement

Designing Multisite Randomized Trials to Detect (Moderated) Mediation Effects

Peer reviewed

Direct link

Fangxing Bai; Ben Kelcey; Amota Ataneka; Yanli Xie; Kyle Cox; Nianbo Dong – Society for Research on Educational Effectiveness, 2024

Purpose: Multisite mediation studies are a cornerstone in mapping out developmental processes because they probe the mechanisms of a treatment while creating key opportunities to learn from and about variation in those mechanisms across sites. Despite the prevalence of multisite studies, a significant gap in the literature is how to plan such…

Descriptors: Randomized Controlled Trials, Mediation Theory, Statistical Analysis, Robustness (Statistics)

Research-Informed Teaching: The Case of Musical Futures

Peer reviewed

Direct link

Mariguddi, Anna; Cain, Tim – British Educational Research Journal, 2022

Drawing on descriptions of research-into-practice initiatives, this article presents a new framework to aid understanding of how research findings influence educational practice at scale. The framework focuses upon five areas: trustworthiness of the findings and generalisability; implications and instructions for practice; support for…

Descriptors: Educational Research, Reliability, Generalizability Theory, Fidelity

Integrating Bifactor Models into a Generalizability Theory Based Structural Equation Modeling Framework

Peer reviewed

Direct link

Vispoel, Walter P.; Lee, Hyeryung; Xu, Guanlan; Hong, Hyeri – Journal of Experimental Education, 2023

Although generalizability theory (GT) designs have traditionally been analyzed within an ANOVA framework, identical results can be obtained with structural equation models (SEMs) but extended to represent multiple sources of both systematic and measurement error variance, include estimation methods less likely to produce negative variance…

Descriptors: Generalizability Theory, Structural Equation Models, Programming Languages, Scores

Methods of Research Design and Analysis for Identifying Knowledge Resources

Peer reviewed

Direct link

Barth-Cohen, Lauren A.; Swanson, Hillary; Arnell, Jared – Physical Review Physics Education Research, 2023

Within physics education research (PER), resource theory has proven to be a useful framework for investigating knowledge and learning and informing instructional design. To analyze learning over longer timescales and across cases, PER scholars must first identify and describe the resources activated within and across physics contexts and domains.…

Descriptors: Physics, Science Instruction, Teaching Methods, Research Design

Indices of Subscore Utility for Individuals and Subgroups Based on Multivariate Generalizability Theory

Peer reviewed

Direct link

Raymond, Mark R.; Jiang, Zhehan – Educational and Psychological Measurement, 2020

Conventional methods for evaluating the utility of subscores rely on traditional indices of reliability and on correlations among subscores. One limitation of correlational methods is that they do not explicitly consider variation in subtest means. An exception is an index of score profile reliability designated as [G], which quantifies the ratio…

Descriptors: Generalizability Theory, Multivariate Analysis, Scores, Reliability

Knowledge Tracing over Time: A Longitudinal Analysis

Peer reviewed
PDF on ERIC

Download full text

Lee, Morgan P.; Croteau, Ethan; Gurung, Ashish; Botelho, Anthony F.; Heffernan, Neil T. – International Educational Data Mining Society, 2023

The use of Bayesian Knowledge Tracing (BKT) models in predicting student learning and mastery, especially in mathematics, is a well-established and proven approach in learning analytics. In this work, we report on our analysis examining the generalizability of BKT models across academic years attributed to "detector rot." We compare the…

Descriptors: Bayesian Statistics, Models, Generalizability Theory, Longitudinal Studies

Triangulating Natural Language Processing (NLP)-Based Analysis of Rater Comments and Many-Facet Rasch Measurement (MFRM): An Innovative Approach to Investigating Raters' Application of Rating Scales in Writing Assessment

Peer reviewed

Direct link

Huiying Cai; Xun Yan – Language Testing, 2024

Rater comments tend to be qualitatively analyzed to indicate raters' application of rating scales. This study applied natural language processing (NLP) techniques to quantify meaningful, behavioral information from a corpus of rater comments and triangulated that information with a many-facet Rasch measurement (MFRM) analysis of rater scores. The…

Descriptors: Natural Language Processing, Item Response Theory, Rating Scales, Writing Evaluation

Generalizability Theory and Its Application to Institutional Research. The AIR Professional File, Spring 2022. Article 156

Download full text

Sturgis, Paul W.; Marchand, Leslie; Miller, M. David; Xu, Wei; Castiglioni, Analia – Association for Institutional Research, 2022

This article introduces generalizability theory (G-theory) to institutional research and assessment practitioners, and explains how it can be utilized to evaluate the reliability of assessment procedures in order to improve student learning outcomes. The fundamental concepts associated with G-theory are briefly discussed, followed by a discussion…

Descriptors: Generalizability Theory, Institutional Research, Reliability, Computer Software

Violation of Conditional Independence in the Many-Facets Rasch Model

Peer reviewed

Direct link

DeMars, Christine E. – Applied Measurement in Education, 2021

Estimation of parameters for the many-facets Rasch model requires that conditional on the values of the facets, such as person ability, item difficulty, and rater severity, the observed responses within each facet are independent. This requirement has often been discussed for the Rasch models and 2PL and 3PL models, but it becomes more complex…

Descriptors: Item Response Theory, Test Items, Ability, Scores

Accounting for Standard Errors of Measurement When Modeling Change

Peer reviewed

Direct link

Grimm, Kevin J.; Fine, Kimberly; Stegmann, Gabriela – International Journal of Behavioral Development, 2021

Modeling within-person change over time and between-person differences in change over time is a primary goal in prevention science. When modeling change in an observed score over time with multilevel or structural equation modeling approaches, each observed score counts toward the estimation of model parameters equally. However, observed scores…

Descriptors: Error of Measurement, Weighted Scores, Accuracy, Item Response Theory

Role of Advanced Theory of Mind in Teenagers' Evaluation of Source Information

Peer reviewed

Direct link

Dyoniziak, Yann; Potocki, Anna; Rouet, Jean-François – Discourse Processes: A Multidisciplinary Journal, 2023

With the development of the Internet as a main source of information, teenagers are increasingly faced with multiple documents which may contain contradictory statements, and whose reliability must be assessed. One way to assess information reliability is to evaluate the source of the information (e.g., author expertise, intention). However,…

Descriptors: Theory of Mind, Information Literacy, Information Sources, Reliability

Item Response Theory Analysis and Measurement Invariance Testing of the Cultural Humility and Enactment Scale

Peer reviewed

Direct link

Peitao Zhu; Ching-Chen Chen; Qiu Wang; Melissa M. Luke; Yanhong Liu – Measurement and Evaluation in Counseling and Development, 2025

Objective: This study aimed to validate the Cultural Humility and Enactment Scale (CHES) through (a) examining its factor structure with multiple samples; (b) employing item response theory (IRT) analysis to examine its item-level characteristics; (c) reducing potential redundancies among items; and (d) conducting measurement invariance (MI)…

Descriptors: Item Response Theory, Cultural Awareness, Measurement Techniques, Construct Validity

Considerations for Effective Use of Moral Exemplars in Education: Based on the Self-Determination Theory and Data Syntheses

Peer reviewed

Direct link

Hyemin Han; Marja Graham – Theory and Research in Education, 2024

The present study aimed to examine how to improve the effectiveness of moral exemplar-applied interventions based on the pillars of the self-determination theory framework, autonomy, competence, and relatedness. Past research has mainly focused on the relatedness and attainability of moral exemplars for predicting motivation outcomes. The data for…

Descriptors: Moral Values, Self Determination, Intervention, Reliability

Leveraging the Power of Observations: Locating the Sources of Error in the Individualized Classroom Assessment Scoring System

Peer reviewed

Direct link

Carbonneau, Kira J.; Van Orman, Dustin S. J.; Lemberger-Truelove, Matthew E.; Atencio, David J. – Early Education and Development, 2020

Research Findings: Given the variable nature of early childhood settings, practitioners and researchers need better guidance on what conditions influence observations conducted within early childhood settings (National Research Council, 2008). Using 230 observations from 23 three- and four-year-old children, we conducted a Generalizability study…

Descriptors: Classroom Environment, Observation, Preschool Children, Influences

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 33

Educational and Psychological…	21
Applied Measurement in…	14
ETS Research Report Series	12
Online Submission	9
Journal of Educational…	8
Advances in Health Sciences…	7
Educational Measurement:…	6
Grantee Submission	6
International Journal of…	6
Journal of Cognition and…	6
Applied Psychological…	5
Assessment for Effective…	5
Eurasian Journal of…	5
International Journal of…	5
International Journal of…	5
Journal of Psychoeducational…	5
Language Testing	5
School Psychology Quarterly	5
Educational Sciences: Theory…	4
International Journal of…	4
International Journal of…	4
International Review of…	4
Journal of Education and…	4
Psychological Assessment	4
Society for Research on…	4
More ▼

Haberman, Shelby J.	7
Briesch, Amy M.	6
Lee, Yong-Won	6
Christ, Theodore J.	4
Kantor, Robert	4
Kolen, Michael J.	4
Riley-Tillman, T. Chris	4
Chafouleas, Sandra M.	3
Ferrando, Pere J.	3
Foorman, Barbara R.	3
Lin, Chih-Kai	3
Petscher, Yaacov	3
Russell, Brian E.	3
Volpe, Robert J.	3
Acar-Ciftci, Yasemin	2
Al Otaiba, Stephanie	2
An, Ji	2
Boone, William J.	2
Brosseau-Liard, Patricia E.	2
Cai, Li	2
Callahan, Carolyn M.	2
Ciorba, Charles R.	2
Coniam, David	2
Correnti, Richard	2
More ▼

Peabody Picture Vocabulary…	6
Program for International…	3
Stanford Achievement Tests	3
Test of English as a Foreign…	3
Autism Diagnostic Observation…	2
Dynamic Indicators of Basic…	2
Florida Comprehensive…	2
Iowa Tests of Basic Skills	2
Motivated Strategies for…	2
Peabody Individual…	2
Trends in International…	2
ACT Assessment	1
Advanced Placement…	1
Aging Semantic Differential	1
Beck Depression Inventory	1
Big Five Inventory	1
Brazelton Neonatal Assessment…	1
Cattell Culture Fair…	1
Child Behavior Checklist	1
Classroom Assessment Scoring…	1
Comprehensive Tests of Basic…	1
Defining Issues Test	1
Early Childhood Environment…	1
Expressive One Word Picture…	1
Eysenck Personality Inventory	1
More ▼