Publication Date
In 2025 | 2 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 27 |
Since 2016 (last 10 years) | 77 |
Since 2006 (last 20 years) | 174 |
Descriptor
Statistical Analysis | 196 |
Computation | 86 |
Models | 56 |
Sample Size | 33 |
Error of Measurement | 29 |
Comparative Analysis | 28 |
Regression (Statistics) | 27 |
Simulation | 27 |
Test Items | 26 |
Correlation | 24 |
Effect Size | 24 |
More ▼ |
Source
Journal of Educational and… | 196 |
Author
Schochet, Peter Z. | 8 |
Sinharay, Sandip | 7 |
Goldstein, Harvey | 4 |
Moerbeek, Mirjam | 4 |
Bonett, Douglas G. | 3 |
Dong, Nianbo | 3 |
Hedges, Larry V. | 3 |
Hong, Guanglei | 3 |
Kelcey, Benjamin | 3 |
Leckie, George | 3 |
Liu, Yang | 3 |
More ▼ |
Publication Type
Journal Articles | 196 |
Reports - Research | 117 |
Reports - Descriptive | 42 |
Reports - Evaluative | 35 |
Opinion Papers | 2 |
Book/Product Reviews | 1 |
Education Level
Audience
Location
Canada | 3 |
California | 2 |
Colombia | 2 |
Germany | 2 |
Netherlands | 2 |
United Kingdom (England) | 2 |
Arizona | 1 |
Austria (Vienna) | 1 |
Brazil | 1 |
California (Los Angeles) | 1 |
California (Riverside) | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Kaitlyn G. Fitzgerald; Elizabeth Tipton – Journal of Educational and Behavioral Statistics, 2025
This article presents methods for using extant data to improve the properties of estimators of the standardized mean difference (SMD) effect size. Because samples recruited into education research studies are often more homogeneous than the populations of policy interest, the variation in educational outcomes can be smaller in these samples than…
Descriptors: Data Use, Computation, Effect Size, Meta Analysis
San Martín, Ernesto; González, Jorge – Journal of Educational and Behavioral Statistics, 2022
The nonequivalent groups with anchor test (NEAT) design is widely used in test equating. Under this design, two groups of examinees are administered different test forms with each test form containing a subset of common items. Because test takers from different groups are assigned only one test form, missing score data emerge by design rendering…
Descriptors: Tests, Scores, Statistical Analysis, Models
Doran, Harold – Journal of Educational and Behavioral Statistics, 2023
This article is concerned with a subset of numerically stable and scalable algorithms useful to support computationally complex psychometric models in the era of machine learning and massive data. The subset selected here is a core set of numerical methods that should be familiar to computational psychometricians and considers whitening transforms…
Descriptors: Scaling, Algorithms, Psychometrics, Computation
Su, Kun; Henson, Robert A. – Journal of Educational and Behavioral Statistics, 2023
This article provides a process to carefully evaluate the suitability of a content domain for which diagnostic classification models (DCMs) could be applicable and then optimized steps for constructing a test blueprint for applying DCMs and a real-life example illustrating this process. The content domains were carefully evaluated using a set of…
Descriptors: Classification, Models, Science Tests, Physics
Peter Z. Schochet – Journal of Educational and Behavioral Statistics, 2025
Random encouragement designs evaluate treatments that aim to increase participation in a program or activity. These randomized controlled trials (RCTs) can also assess the mediated effects of participation itself on longer term outcomes using a complier average causal effect (CACE) estimation framework. This article considers power analysis…
Descriptors: Statistical Analysis, Computation, Causal Models, Research Design
Joo, Seang-Hwane; Wang, Yan; Ferron, John; Beretvas, S. Natasha; Moeyaert, Mariola; Van Den Noortgate, Wim – Journal of Educational and Behavioral Statistics, 2022
Multiple baseline (MB) designs are becoming more prevalent in educational and behavioral research, and as they do, there is growing interest in combining effect size estimates across studies. To further refine the meta-analytic methods of estimating the effect, this study developed and compared eight alternative methods of estimating intervention…
Descriptors: Meta Analysis, Effect Size, Computation, Statistical Analysis
Huang, Francis L. – Journal of Educational and Behavioral Statistics, 2022
The presence of clustered data is common in the sociobehavioral sciences. One approach that specifically deals with clustered data but has seen little use in education is the generalized estimating equations (GEEs) approach. We provide a background on GEEs, discuss why it is appropriate for the analysis of clustered data, and provide worked…
Descriptors: Multivariate Analysis, Computation, Correlation, Error of Measurement
Liu, Jin; Perera, Robert A.; Kang, Le; Sabo, Roy T.; Kirkpatrick, Robert M. – Journal of Educational and Behavioral Statistics, 2022
This study proposes transformation functions and matrices between coefficients in the original and reparameterized parameter spaces for an existing linear-linear piecewise model to derive the interpretable coefficients directly related to the underlying change pattern. Additionally, the study extends the existing model to allow individual…
Descriptors: Longitudinal Studies, Statistical Analysis, Matrices, Mathematics
Molenaar, Dylan; Cúri, Mariana; Bazán, Jorge L. – Journal of Educational and Behavioral Statistics, 2022
Bounded continuous data are encountered in many applications of item response theory, including the measurement of mood, personality, and response times and in the analyses of summed item scores. Although different item response theory models exist to analyze such bounded continuous data, most models assume the data to be in an open interval and…
Descriptors: Item Response Theory, Data, Responses, Intervals
Yajuan Si; Roderick J. A. Little; Ya Mo; Nell Sedransk – Journal of Educational and Behavioral Statistics, 2023
Nonresponse bias is a widely prevalent problem for data on education. We develop a ten-step exemplar to guide nonresponse bias analysis (NRBA) in cross-sectional studies and apply these steps to the Early Childhood Longitudinal Study, Kindergarten Class of 2010-2011. A key step is the construction of indices of nonresponse bias based on proxy…
Descriptors: Educational Assessment, Response Rates (Questionnaires), Bias, Children
Sinharay, Sandip; Johnson, Matthew S. – Journal of Educational and Behavioral Statistics, 2021
Score differencing is one of the six categories of statistical methods used to detect test fraud (Wollack & Schoenig, 2018) and involves the testing of the null hypothesis that the performance of an examinee is similar over two item sets versus the alternative hypothesis that the performance is better on one of the item sets. We suggest, to…
Descriptors: Probability, Bayesian Statistics, Cheating, Statistical Analysis
Vembye, Mikkel Helding; Pustejovsky, James Eric; Pigott, Therese Deocampo – Journal of Educational and Behavioral Statistics, 2023
Meta-analytic models for dependent effect sizes have grown increasingly sophisticated over the last few decades, which has created challenges for a priori power calculations. We introduce power approximations for tests of average effect sizes based upon several common approaches for handling dependent effect sizes. In a Monte Carlo simulation, we…
Descriptors: Meta Analysis, Robustness (Statistics), Statistical Analysis, Models
Ranger, Jochen; Brauer, Kay – Journal of Educational and Behavioral Statistics, 2022
The generalized S-X[superscript 2]-test is a test of item fit for items with polytomous responses format. The test is based on a comparison of the observed and expected number of responses in strata defined by the test score. In this article, we make four contributions. We demonstrate that the performance of the generalized S-X[superscript 2]-test…
Descriptors: Goodness of Fit, Test Items, Statistical Analysis, Item Response Theory
Wang, Weimeng; Liu, Yang; Liu, Hongyun – Journal of Educational and Behavioral Statistics, 2022
Differential item functioning (DIF) occurs when the probability of endorsing an item differs across groups for individuals with the same latent trait level. The presence of DIF items may jeopardize the validity of an instrument; therefore, it is crucial to identify DIF items in routine operations of educational assessment. While DIF detection…
Descriptors: Test Bias, Test Items, Equated Scores, Regression (Statistics)
Wendy Chan; Larry Vernon Hedges – Journal of Educational and Behavioral Statistics, 2022
Multisite field experiments using the (generalized) randomized block design that assign treatments to individuals within sites are common in education and the social sciences. Under this design, there are two possible estimands of interest and they differ based on whether sites or blocks have fixed or random effects. When the average treatment…
Descriptors: Research Design, Educational Research, Statistical Analysis, Statistical Inference