Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 23 |
Descriptor
Statistical Analysis | 24 |
Equated Scores | 15 |
Comparative Analysis | 9 |
Computation | 9 |
Accuracy | 8 |
Differences | 6 |
Error of Measurement | 6 |
Models | 6 |
Sample Size | 6 |
Simulation | 6 |
Scores | 5 |
More ▼ |
Source
ETS Research Report Series | 10 |
Journal of Educational… | 5 |
Educational Testing Service | 4 |
Journal of Educational and… | 3 |
Applied Psychological… | 1 |
Educational and Psychological… | 1 |
Author
Moses, Tim | 24 |
Holland, Paul | 4 |
Kim, Sooyeon | 3 |
Deng, Weiling | 2 |
Dorans, Neil | 2 |
Dorans, Neil J. | 2 |
Holland, Paul W. | 2 |
Miao, Jing | 2 |
Zhang, Wenmin | 2 |
Casabianca, Jodi | 1 |
Klockars, Alan | 1 |
More ▼ |
Publication Type
Journal Articles | 20 |
Reports - Research | 12 |
Reports - Evaluative | 10 |
Reports - Descriptive | 2 |
Numerical/Quantitative Data | 1 |
Education Level
Higher Education | 2 |
Postsecondary Education | 2 |
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
Praxis Series | 1 |
SAT (College Admission Test) | 1 |
What Works Clearinghouse Rating
Moses, Tim – ETS Research Report Series, 2008
Nine statistical strategies for selecting equating functions in an equivalent groups design were evaluated. The strategies of interest were likelihood ratio chi-square tests, regression tests, Kolmogorov-Smirnov tests, and significance tests for equated score differences. The most accurate strategies in the study were the likelihood ratio tests…
Descriptors: Equated Scores, Statistical Analysis, Statistical Significance, Regression (Statistics)
Moses, Tim; Klockars, Alan – Educational Testing Service, 2009
The robustness and power of 9 strategies for testing the differences in groups' regression slopes were assessed under nonnormality and residual variance heterogeneity. For the conditions considered, the most robust strategies were the trimmed and Winsorized slope estimates used with the James second-order test, the Theil-Sen slope estimates used…
Descriptors: Evaluation Methods, Maximum Likelihood Statistics, Regression (Statistics), Robustness (Statistics)
Moses, Tim – Journal of Educational and Behavioral Statistics, 2008
Equating functions are supposed to be population invariant, meaning that the choice of subpopulation used to compute the equating function should not matter. The extent to which equating functions are population invariant is typically assessed in terms of practical difference criteria that do not account for equating functions' sampling…
Descriptors: Equated Scores, Error of Measurement, Sampling, Evaluation Methods
Moses, Tim; Yang, Wen-Ling; Wilson, Christine – Journal of Educational Measurement, 2007
This study explored the use of kernel equating for integrating and extending two procedures proposed for assessing item order effects in test forms that have been administered to randomly equivalent groups. When these procedures are used together, they can provide complementary information about the extent to which item order effects impact test…
Descriptors: Advanced Placement, Equated Scores, Test Items, Item Analysis
Moses, Tim; Holland, Paul – ETS Research Report Series, 2007
The purpose of this study was to empirically evaluate the impact of loglinear presmoothing accuracy on equating bias and variability across chained and post-stratification equating methods, kernel and percentile-rank continuization methods, and sample sizes. The results of evaluating presmoothing on equating accuracy generally agreed with those of…
Descriptors: Equated Scores, Statistical Analysis, Accuracy, Sample Size
Yu, Lei; Moses, Tim; Puhan, Gautam; Dorans, Neil – ETS Research Report Series, 2008
All differential item functioning (DIF) methods require at least a moderate sample size for effective DIF detection. Samples that are less than 200 pose a challenge for DIF analysis. Smoothing can improve upon the estimation of the population distribution by preserving major features of an observed frequency distribution while eliminating the…
Descriptors: Test Bias, Item Response Theory, Sample Size, Evaluation Criteria
Moses, Tim; Kim, Sooyeon – ETS Research Report Series, 2007
This study evaluated the impact of unequal reliability on test equating methods in the nonequivalent groups with anchor test (NEAT) design. Classical true score-based models were compared in terms of their assumptions about how reliability impacts test scores. These models were related to treatment of population ability differences by different…
Descriptors: Reliability, Equated Scores, Test Items, Statistical Analysis
Moses, Tim – ETS Research Report Series, 2006
Population invariance is an important requirement of test equating. An equating function is said to be population invariant when the choice of (sub)population used to compute the equating function does not matter. In recent studies, the extent to which equating functions are population invariant is typically addressed in terms of practical…
Descriptors: Equated Scores, Computation, Error of Measurement, Statistical Analysis
Moses, Tim; von Davier, Alina A.; Casabianca, Jodi – ETS Research Report Series, 2004
The purpose of this report is to demonstrate loglinear smoothing using SAS PROC GENMOD. The results from four published examples, which include the smoothing of a) univariate distributions, b) bivariate distributions, c) distributions with teeth, and d) bivariate distributions with structural zeros, are reproduced to show the flexibility of the…
Descriptors: Statistical Analysis, Statistical Distributions, Comparative Analysis, Graphs
« Previous Page | Next Page
Pages: 1 | 2