Publication Date
In 2025 | 4 |
Since 2024 | 25 |
Since 2021 (last 5 years) | 79 |
Since 2016 (last 10 years) | 139 |
Since 2006 (last 20 years) | 151 |
Descriptor
Evaluation Methods | 151 |
Models | 23 |
Student Evaluation | 22 |
Elementary School Students | 21 |
Comparative Analysis | 18 |
Psychometrics | 18 |
Correlation | 17 |
Scores | 17 |
Test Validity | 16 |
Intervention | 14 |
Item Response Theory | 14 |
More ▼ |
Source
Grantee Submission | 151 |
Author
Chun Wang | 7 |
Gongjun Xu | 6 |
Andres De Los Reyes | 4 |
Danielle S. McNamara | 4 |
Avi Feller | 3 |
De Los Reyes, Andres | 3 |
Lloyd, Blair P. | 3 |
McKown, Clark | 3 |
O'Reilly, Tenaha | 3 |
Rahimi, Seyedahmad | 3 |
Shute, Valerie | 3 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 1 |
Location
District of Columbia | 3 |
Florida | 3 |
Maryland | 3 |
California (Los Angeles) | 2 |
Illinois (Chicago) | 2 |
Kansas | 2 |
Kentucky | 2 |
Massachusetts | 2 |
New York (New York) | 2 |
North Carolina | 2 |
Virginia | 2 |
More ▼ |
Laws, Policies, & Programs
Head Start | 2 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating

Jason Schoeneberger; Christopher Rhoads – Grantee Submission, 2024
Regression discontinuity (RD) designs are increasingly used for causal evaluations. For example, if a student's need for a literacy intervention is determined by a low score on a past performance indicator and that intervention is provided to all students who fall below a cutoff on that indicator, an RD study can determine the intervention's main…
Descriptors: Regression (Statistics), Causal Models, Evaluation Methods, Multivariate Analysis
Jon Wai; Joni M. Lakin – Grantee Submission, 2024
Students' talent and potential cannot be served until they are recognized by schools or caregivers. While the field of gifted education has had success in identifying talent among many students with talents in reading and mathematics, those with spatial talents are often overlooked. This article reviews how we might identify spatial talent using…
Descriptors: Spatial Ability, Identification, Talent, Student Evaluation
Liyang Sun; Eli Ben-Michael; Avi Feller – Grantee Submission, 2024
The synthetic control method (SCM) is a popular approach for estimating the impact of a treatment on a single unit with panel data. Two challenges arise with higher frequency data (e.g., monthly versus yearly): (1) achieving excellent pre-treatment fit is typically more challenging; and (2) overfitting to noise is more likely. Aggregating data…
Descriptors: Evaluation Methods, Comparative Analysis, Computation, Data Analysis
Sam Choo; Reagan Mergen; Jechun An; Haoran Li; Xuejing Liu; Martin Odima; Linda J. Gassaway – Grantee Submission, 2025
The importance of mathematical problem solving (MPS) has been widely recognized. While there has been significant progress in developing and studying interventions to support teaching and learning MPS for students with disabilities, the research on how to accurately and effectively assess the impact of those interventions has lagged, leaving a gap…
Descriptors: Mathematics Skills, Problem Solving, Student Evaluation, Evaluation Methods
Alexander D. Latham; David A. Klingbeil – Grantee Submission, 2024
The visual analysis of data presented in time-series graphs are common in single-case design (SCD) research and applied practice in school psychology. A growing body of research suggests that visual analysts' ratings are often influenced by construct-irrelevant features including Y-axis truncation and compression of the number of data points per…
Descriptors: Intervention, School Psychologists, Graphs, Evaluation Methods
Lingbo Tong; Wen Qu; Zhiyong Zhang – Grantee Submission, 2025
Factor analysis is widely utilized to identify latent factors underlying the observed variables. This paper presents a comprehensive comparative study of two widely used methods for determining the optimal number of factors in factor analysis, the K1 rule, and parallel analysis, along with a more recently developed method, the bass-ackward method.…
Descriptors: Factor Analysis, Monte Carlo Methods, Statistical Analysis, Sample Size
Jiaying Xiao; Chun Wang; Gongjun Xu – Grantee Submission, 2024
Accurate item parameters and standard errors (SEs) are crucial for many multidimensional item response theory (MIRT) applications. A recent study proposed the Gaussian Variational Expectation Maximization (GVEM) algorithm to improve computational efficiency and estimation accuracy (Cho et al., 2021). However, the SE estimation procedure has yet to…
Descriptors: Error of Measurement, Models, Evaluation Methods, Item Analysis
Nazanin Nezami; Parian Haghighat; Denisa Gándara; Hadis Anahideh – Grantee Submission, 2024
The education sector has been quick to recognize the power of predictive analytics to enhance student success rates. However, there are challenges to widespread adoption, including the lack of accessibility and the potential perpetuation of inequalities. These challenges present in different stages of modeling, including data preparation, model…
Descriptors: Evaluation Methods, College Students, Success, Predictor Variables
Xiao Liu; Zhiyong Zhang; Lijuan Wang – Grantee Submission, 2024
In psychology, researchers are often interested in testing hypotheses about mediation, such as testing the presence of a mediation effect of a treatment (e.g., intervention assignment) on an outcome via a mediator. An increasingly popular approach to testing hypotheses is the Bayesian testing approach with Bayes factors (BFs). Despite the growing…
Descriptors: Sample Size, Bayesian Statistics, Programming Languages, Simulation
Edgar C. Merkle; Oludare Ariyo; Sonja D. Winter; Mauricio Garnier-Villarreal – Grantee Submission, 2023
We review common situations in Bayesian latent variable models where the prior distribution that a researcher specifies differs from the prior distribution used during estimation. These situations can arise from the positive definite requirement on correlation matrices, from sign indeterminacy of factor loadings, and from order constraints on…
Descriptors: Models, Bayesian Statistics, Correlation, Evaluation Methods
Mauricio Garnier-Villarreal; Terrence D. Jorgensen – Grantee Submission, 2024
Model evaluation is a crucial step in SEM, consisting of two broad areas: global and local fit, where local fit indices are use to modify the original model. In the modification process, the modification index (MI) and the standardized expected parameter change (SEPC) are used to select the parameters that can be added to improve the fit. The…
Descriptors: Bayesian Statistics, Structural Equation Models, Goodness of Fit, Indexes
Madeline A. Schellman; Matthew J. Madison – Grantee Submission, 2024
Diagnostic classification models (DCMs) have grown in popularity as stakeholders increasingly desire actionable information related to students' skill competencies. Longitudinal DCMs offer a psychometric framework for providing estimates of students' proficiency status transitions over time. For both cross-sectional and longitudinal DCMs, it is…
Descriptors: Diagnostic Tests, Classification, Models, Psychometrics
Daniel McNeish – Grantee Submission, 2023
Factor analysis is often used to model scales created to measure latent constructs, and internal structure validity evidence is commonly assessed with indices like SRMR, RMSEA, and CFI. These indices are essentially effect size measures and definitive benchmarks regarding which values connote reasonable fit have been elusive. Simulations from the…
Descriptors: Models, Testing, Indexes, Factor Analysis
Jianing Zhou; Ziheng Zeng; Hongyu Gong; Suma Bhat – Grantee Submission, 2022
Idiomatic expressions (IEs) play an essential role in natural language. In this paper, we study the task of idiomatic sentence paraphrasing (ISP), which aims to paraphrase a sentence with an IE by replacing the IE with its literal paraphrase. The lack of large scale corpora with idiomatic-literal parallel sentences is a primary challenge for this…
Descriptors: Language Patterns, Sentences, Language Processing, Phrase Structure
Bonifay, Wes – Grantee Submission, 2022
Traditional statistical model evaluation typically relies on goodness-of-fit testing and quantifying model complexity by counting parameters. Both of these practices may result in overfitting and have thereby contributed to the generalizability crisis. The information-theoretic principle of minimum description length addresses both of these…
Descriptors: Statistical Analysis, Models, Goodness of Fit, Evaluation Methods