NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
No Child Left Behind Act 20011
What Works Clearinghouse Rating
Showing 1 to 15 of 489 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
William C. M. Belzak; Daniel J. Bauer – Journal of Educational and Behavioral Statistics, 2024
Testing for differential item functioning (DIF) has undergone rapid statistical developments recently. Moderated nonlinear factor analysis (MNLFA) allows for simultaneous testing of DIF among multiple categorical and continuous covariates (e.g., sex, age, ethnicity, etc.), and regularization has shown promising results for identifying DIF among…
Descriptors: Test Bias, Algorithms, Factor Analysis, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Fangxing Bai; Ben Kelcey; Amota Ataneka; Yanli Xie; Kyle Cox; Nianbo Dong – Society for Research on Educational Effectiveness, 2024
Purpose: Multisite mediation studies are a cornerstone in mapping out developmental processes because they probe the mechanisms of a treatment while creating key opportunities to learn from and about variation in those mechanisms across sites. Despite the prevalence of multisite studies, a significant gap in the literature is how to plan such…
Descriptors: Randomized Controlled Trials, Mediation Theory, Statistical Analysis, Robustness (Statistics)
Peer reviewed Peer reviewed
Direct linkDirect link
Mariguddi, Anna; Cain, Tim – British Educational Research Journal, 2022
Drawing on descriptions of research-into-practice initiatives, this article presents a new framework to aid understanding of how research findings influence educational practice at scale. The framework focuses upon five areas: trustworthiness of the findings and generalisability; implications and instructions for practice; support for…
Descriptors: Educational Research, Reliability, Generalizability Theory, Fidelity
Peer reviewed Peer reviewed
Direct linkDirect link
Vispoel, Walter P.; Lee, Hyeryung; Xu, Guanlan; Hong, Hyeri – Journal of Experimental Education, 2023
Although generalizability theory (GT) designs have traditionally been analyzed within an ANOVA framework, identical results can be obtained with structural equation models (SEMs) but extended to represent multiple sources of both systematic and measurement error variance, include estimation methods less likely to produce negative variance…
Descriptors: Generalizability Theory, Structural Equation Models, Programming Languages, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Barth-Cohen, Lauren A.; Swanson, Hillary; Arnell, Jared – Physical Review Physics Education Research, 2023
Within physics education research (PER), resource theory has proven to be a useful framework for investigating knowledge and learning and informing instructional design. To analyze learning over longer timescales and across cases, PER scholars must first identify and describe the resources activated within and across physics contexts and domains.…
Descriptors: Physics, Science Instruction, Teaching Methods, Research Design
Peer reviewed Peer reviewed
Direct linkDirect link
Raymond, Mark R.; Jiang, Zhehan – Educational and Psychological Measurement, 2020
Conventional methods for evaluating the utility of subscores rely on traditional indices of reliability and on correlations among subscores. One limitation of correlational methods is that they do not explicitly consider variation in subtest means. An exception is an index of score profile reliability designated as [G], which quantifies the ratio…
Descriptors: Generalizability Theory, Multivariate Analysis, Scores, Reliability
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Lee, Morgan P.; Croteau, Ethan; Gurung, Ashish; Botelho, Anthony F.; Heffernan, Neil T. – International Educational Data Mining Society, 2023
The use of Bayesian Knowledge Tracing (BKT) models in predicting student learning and mastery, especially in mathematics, is a well-established and proven approach in learning analytics. In this work, we report on our analysis examining the generalizability of BKT models across academic years attributed to "detector rot." We compare the…
Descriptors: Bayesian Statistics, Models, Generalizability Theory, Longitudinal Studies
Peer reviewed Peer reviewed
Direct linkDirect link
Huiying Cai; Xun Yan – Language Testing, 2024
Rater comments tend to be qualitatively analyzed to indicate raters' application of rating scales. This study applied natural language processing (NLP) techniques to quantify meaningful, behavioral information from a corpus of rater comments and triangulated that information with a many-facet Rasch measurement (MFRM) analysis of rater scores. The…
Descriptors: Natural Language Processing, Item Response Theory, Rating Scales, Writing Evaluation
Sturgis, Paul W.; Marchand, Leslie; Miller, M. David; Xu, Wei; Castiglioni, Analia – Association for Institutional Research, 2022
This article introduces generalizability theory (G-theory) to institutional research and assessment practitioners, and explains how it can be utilized to evaluate the reliability of assessment procedures in order to improve student learning outcomes. The fundamental concepts associated with G-theory are briefly discussed, followed by a discussion…
Descriptors: Generalizability Theory, Institutional Research, Reliability, Computer Software
Peer reviewed Peer reviewed
Direct linkDirect link
DeMars, Christine E. – Applied Measurement in Education, 2021
Estimation of parameters for the many-facets Rasch model requires that conditional on the values of the facets, such as person ability, item difficulty, and rater severity, the observed responses within each facet are independent. This requirement has often been discussed for the Rasch models and 2PL and 3PL models, but it becomes more complex…
Descriptors: Item Response Theory, Test Items, Ability, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Grimm, Kevin J.; Fine, Kimberly; Stegmann, Gabriela – International Journal of Behavioral Development, 2021
Modeling within-person change over time and between-person differences in change over time is a primary goal in prevention science. When modeling change in an observed score over time with multilevel or structural equation modeling approaches, each observed score counts toward the estimation of model parameters equally. However, observed scores…
Descriptors: Error of Measurement, Weighted Scores, Accuracy, Item Response Theory
Peer reviewed Peer reviewed
Direct linkDirect link
Dyoniziak, Yann; Potocki, Anna; Rouet, Jean-François – Discourse Processes: A Multidisciplinary Journal, 2023
With the development of the Internet as a main source of information, teenagers are increasingly faced with multiple documents which may contain contradictory statements, and whose reliability must be assessed. One way to assess information reliability is to evaluate the source of the information (e.g., author expertise, intention). However,…
Descriptors: Theory of Mind, Information Literacy, Information Sources, Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Peitao Zhu; Ching-Chen Chen; Qiu Wang; Melissa M. Luke; Yanhong Liu – Measurement and Evaluation in Counseling and Development, 2025
Objective: This study aimed to validate the Cultural Humility and Enactment Scale (CHES) through (a) examining its factor structure with multiple samples; (b) employing item response theory (IRT) analysis to examine its item-level characteristics; (c) reducing potential redundancies among items; and (d) conducting measurement invariance (MI)…
Descriptors: Item Response Theory, Cultural Awareness, Measurement Techniques, Construct Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Hyemin Han; Marja Graham – Theory and Research in Education, 2024
The present study aimed to examine how to improve the effectiveness of moral exemplar-applied interventions based on the pillars of the self-determination theory framework, autonomy, competence, and relatedness. Past research has mainly focused on the relatedness and attainability of moral exemplars for predicting motivation outcomes. The data for…
Descriptors: Moral Values, Self Determination, Intervention, Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Carbonneau, Kira J.; Van Orman, Dustin S. J.; Lemberger-Truelove, Matthew E.; Atencio, David J. – Early Education and Development, 2020
Research Findings: Given the variable nature of early childhood settings, practitioners and researchers need better guidance on what conditions influence observations conducted within early childhood settings (National Research Council, 2008). Using 230 observations from 23 three- and four-year-old children, we conducted a Generalizability study…
Descriptors: Classroom Environment, Observation, Preschool Children, Influences
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  33