ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	6
Since 2006 (last 20 years)	11

Descriptor

Computation	11
Item Response Theory	10
Simulation	5
Comparative Analysis	4
Goodness of Fit	4
Error of Measurement	3
Methods	3
Statistical Analysis	3
Equations (Mathematics)	2
Error Patterns	2
Models	2
Nonparametric Statistics	2
Prediction	2
Regression (Statistics)	2
Test Format	2
Test Items	2
Testing Problems	2
Ability	1
Academic Achievement	1
Accuracy	1
Adaptive Testing	1
Bayesian Statistics	1
Change	1
Classification	1
Computer Assisted Testing	1
More ▼

Source

Journal of Educational and…	4
Applied Measurement in…	2
Educational Testing Service	2
Educational and Psychological…	1
Grantee Submission	1
Journal of Educational…	1

Author

Sinharay, Sandip	11
Guo, Hongwen	1
Haberman, Shelby	1
Johnson, Matthew S.	1
Larkin, Kevin	1
Puhan, Gautam	1
von Davier, Matthias	1

Publication Type

Journal Articles	8
Reports - Research	8
Reports - Evaluative	3
Speeches/Meeting Papers	1

Education Level

Grade 4	1
Grade 8	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Reporting Proficiency Levels for Examinees with Incomplete Data

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2022

Takers of educational tests often receive proficiency levels instead of or in addition to scaled scores. For example, proficiency levels are reported for the Advanced Placement (AP®) and U.S. Medical Licensing examinations. Technical difficulties and other unforeseen events occasionally lead to missing item scores and hence to incomplete data on…

Descriptors: Computation, Data Analysis, Educational Testing, Accuracy

Three New Methods for Analysis of Answer Changes

Peer reviewed

Direct link

Sinharay, Sandip; Johnson, Matthew S. – Educational and Psychological Measurement, 2017

In a pioneering research article, Wollack and colleagues suggested the "erasure detection index" (EDI) to detect test tampering. The EDI can be used with or without a continuity correction and is assumed to follow the standard normal distribution under the null hypothesis of no test tampering. When used without a continuity correction,…

Descriptors: Deception, Identification, Testing Problems, Error of Measurement

Extension of Caution Indices to Mixed-Format Tests

Peer reviewed
PDF on ERIC

Download full text

Direct link

Sinharay, Sandip – Grantee Submission, 2018

Tatsuoka (1984) suggested several extended caution indices and their standardized versions that have been used as person-fit statistics by researchers such as Drasgow, Levine, and McLaughlin (1987), Glas and Meijer (2003), and Molenaar and Hoijtink (1990). However, these indices are only defined for tests with dichotomous items. This paper extends…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Error Patterns

The Asymptotic Distribution of Ability Estimates: Beyond Dichotomous Items and Unidimensional IRT Models

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015

The maximum likelihood estimate (MLE) of the ability parameter of an item response theory model with known item parameters was proved to be asymptotically normally distributed under a set of regularity conditions for tests involving dichotomous items and a unidimensional ability parameter (Klauer, 1990; Lord, 1983). This article first considers…

Descriptors: Item Response Theory, Maximum Likelihood Statistics, Test Items, Ability

Assessment of Person Fit Using Resampling-Based Approaches

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational Measurement, 2016

De la Torre and Deng suggested a resampling-based approach for person-fit assessment (PFA). The approach involves the use of the [math equation unavailable] statistic, a corrected expected a posteriori estimate of the examinee ability, and the Monte Carlo (MC) resampling method. The Type I error rate of the approach was closer to the nominal level…

Descriptors: Sampling, Research Methodology, Error Patterns, Monte Carlo Methods

Are the Nonparametric Person-Fit Statistics More Powerful than Their Parametric Counterparts? Revisiting the Simulations in Karabatsos (2003)

Peer reviewed

Direct link

Sinharay, Sandip – Applied Measurement in Education, 2017

Karabatsos compared the power of 36 person-fit statistics using receiver operating characteristics curves and found the "H[superscript T]" statistic to be the most powerful in identifying aberrant examinees. He found three statistics, "C", "MCI", and "U3", to be the next most powerful. These four statistics,…

Descriptors: Nonparametric Statistics, Goodness of Fit, Simulation, Comparative Analysis

Person Fit Analysis in Computerized Adaptive Testing Using Tests for a Change Point

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2016

Meijer and van Krimpen-Stoop noted that the number of person-fit statistics (PFSs) that have been designed for computerized adaptive tests (CATs) is relatively modest. This article partially addresses that concern by suggesting three new PFSs for CATs. The statistics are based on tests for a change point and can be used to detect an abrupt change…

Descriptors: Computer Assisted Testing, Adaptive Testing, Item Response Theory, Goodness of Fit

Assessment of Person Fit for Mixed-Format Tests

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2015

Person-fit assessment may help the researcher to obtain additional information regarding the answering behavior of persons. Although several researchers examined person fit, there is a lack of research on person-fit assessment for mixed-format tests. In this article, the lz statistic and the ?2 statistic, both of which have been used for tests…

Descriptors: Test Format, Goodness of Fit, Item Response Theory, Bayesian Statistics

Measurement Error in Nonparametric Item Response Curve Estimation. Research Report. ETS RR-11-28

Download full text

Guo, Hongwen; Sinharay, Sandip – Educational Testing Service, 2011

Nonparametric, or kernel, estimation of item response curve (IRC) is a concern theoretically and operationally. Accuracy of this estimation, often used in item analysis in testing programs, is biased when the observed scores are used as the regressor because the observed scores are contaminated by measurement error. In this study, we investigate…

Descriptors: Error of Measurement, Nonparametric Statistics, Item Response Theory, Computation

The Utility of Augmented Subscores in a Licensure Exam: An Evaluation of Methods Using Empirical Data

Peer reviewed

Direct link

Puhan, Gautam; Sinharay, Sandip; Haberman, Shelby; Larkin, Kevin – Applied Measurement in Education, 2010

Will subscores provide additional information than what is provided by the total score? Is there a method that can estimate more trustworthy subscores than observed subscores? To answer the first question, this study evaluated whether the true subscore was more accurately predicted by the observed subscore or total score. To answer the second…

Descriptors: Licensing Examinations (Professions), Scores, Computation, Methods

Stochastic Approximation Methods for Latent Regression Item Response Models. Research Report. ETS RR-09-09

Download full text

von Davier, Matthias; Sinharay, Sandip – Educational Testing Service, 2009

This paper presents an application of a stochastic approximation EM-algorithm using a Metropolis-Hastings sampler to estimate the parameters of an item response latent regression model. Latent regression models are extensions of item response theory (IRT) to a 2-level latent variable model in which covariates serve as predictors of the…

Descriptors: Item Response Theory, Regression (Statistics), Models, Methods