ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	4
Since 2016 (last 10 years)	12
Since 2006 (last 20 years)	34

Source

ETS Research Report Series	9
Journal of Educational and…	7
Grantee Submission	6
Educational Testing Service	5
Journal of Educational…	4
Educational and Psychological…	2
Applied Psychological…	1
Educational Measurement:…	1
Language Testing	1
Psychometrika	1

Author

Sinharay, Sandip	37
Haberman, Shelby J.	5
Johnson, Matthew S.	5
Haberman, Shelby	3
Holland, Paul W.	3
Choi, Seung W.	2
Dorans, Neil J.	2
Feng, Ying	2
Kim, Dong-In	2
Lee, Yi-Hsuan	2
Powers, Donald E.	2
Puhan, Gautam	2
Saldivia, Luis	2
Simpson, Annabelle	2
Wan, Ping	2
Weng, Vincent	2
von Davier, Matthias	2
Almond, Russell	1
Blew, Edwin O.	1
Curley, Edward	1
Duong, Minh Q.	1
Feigenbaum, Miriam	1
Ginuta, Anthony	1
Giunta, Anthony	1
Grant, Mary C.	1
More ▼

Publication Type

Reports - Research	30
Journal Articles	26
Reports - Evaluative	5
Numerical/Quantitative Data	3
Opinion Papers	1
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Elementary Education	4
Middle Schools	3
Secondary Education	2
Elementary Secondary Education	1
Grade 4	1
Grade 5	1
Grade 8	1
High Schools	1
Intermediate Grades	1
Junior High Schools	1

Audience

Location

Chile	2
Colombia	2
Ecuador	2

Laws, Policies, & Programs

Assessments and Surveys

Indiana Statewide Testing for…	2
National Assessment of…	2
Pre Professional Skills Tests	1
SAT (College Admission Test)	1
Test of English for…	1

What Works Clearinghouse Rating

Sinharay, Sandip X

Showing 31 to 37 of 37 results Save | Export

Using Past Data to Enhance Small-Sample DIF Estimation: A Bayesian Approach. Research Report. ETS RR-06-09

Peer reviewed
PDF on ERIC

Download full text

Sinharay, Sandip; Dorans, Neil J.; Grant, Mary C.; Blew, Edwin O.; Knorr, Colleen M. – ETS Research Report Series, 2006

The application of the Mantel-Haenszel test statistic (and other popular DIF-detection methods) to determine DIF requires large samples, but test administrators often need to detect DIF with small samples. There is no universally agreed upon statistical approach for performing DIF analysis with small samples; hence there is substantial scope of…

Descriptors: Test Bias, Computation, Sample Size, Bayesian Statistics

Experiences with Markov Chain Monte Carlo Convergence Assessment in Two Psychometric Examples

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2004

There is an increasing use of Markov chain Monte Carlo (MCMC) algorithms for fitting statistical models in psychometrics, especially in situations where the traditional estimation techniques are very difficult to apply. One of the disadvantages of using an MCMC algorithm is that it is not straightforward to determine the convergence of the…

Descriptors: Psychometrics, Mathematics, Inferences, Markov Processes

Testing the Untestable Assumptions of the Chain and Poststratification Equating Methods for the NEAT Design. Research Report. ETS RR-06-17

Peer reviewed
PDF on ERIC

Download full text

Holland, Paul W.; von Davier, Alina A.; Sinharay, Sandip; Han, Ning – ETS Research Report Series, 2006

This paper focuses on the Non-Equivalent Groups with Anchor Test (NEAT) design for test equating and on two classes of observed--score equating (OSE) methods--chain equating (CE) and poststratification equating (PSE). These two classes of methods reflect two distinctly different ways of using the information provided by the anchor test for…

Descriptors: Equated Scores, Test Items, Statistical Analysis, Comparative Analysis

Establishing the Validity of TOEIC Bridge™ Test Scores for Students in Colombia, Chile, and Ecuador. Research Report. ETS RR-08-58

Peer reviewed
PDF on ERIC

Download full text

Sinharay, Sandip; Feng, Ying; Saldivia, Luis; Powers, Donald E.; Ginuta, Anthony; Simpson, Annabelle; Weng, Vincent – ETS Research Report Series, 2008

The validity of TOEIC Bridge™ scores as a measure of English language skill was examined from the standpoint of a unified concept of test validity. In this study, more than 6,000 test takers in 3 Latin American countries (Chile, Colombia, and Ecuador) took 1 form of the TOEIC Bridge test, and their scores were compared to additional information…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Test Validity

Model Diagnostics for Bayesian Networks

Peer reviewed

Direct link

Sinharay, Sandip – Journal of Educational and Behavioral Statistics, 2006

Bayesian networks are frequently used in educational assessments primarily for learning about students' knowledge and skills. There is a lack of works on assessing fit of Bayesian networks. This article employs the posterior predictive model checking method, a popular Bayesian model checking tool, to assess fit of simple Bayesian networks. A…

Descriptors: Models, Educational Assessment, Diagnostic Tests, Evaluation Methods

Assessing Fit of Models with Discrete Proficiency Variable in Educational Assessment. Research Report. RR-04-07

Download full text

Sinharay, Sandip; Almond, Russell; Yan, Duanli – Educational Testing Service, 2004

Model checking is a crucial part of any statistical analysis. As educators tie models for testing to cognitive theory of the domains, there is a natural tendency to represent participant proficiencies with latent variables representing the presence or absence of the knowledge, skills, and proficiencies to be tested (Mislevy, Almond, Yan, &…

Descriptors: Statistical Analysis, Epistemology, Educational Assessment, Item Response Theory

Model Diagnostics for Bayesian Networks. Research Report. ETS RR-04-17

Peer reviewed
PDF on ERIC

Download full text

Sinharay, Sandip – ETS Research Report Series, 2004

Assessing fit of psychometric models has always been an issue of enormous interest, but there exists no unanimously agreed upon item fit diagnostic for the models. Bayesian networks, frequently used in educational assessments (see, for example, Mislevy, Almond, Yan, & Steinberg, 2001) primarily for learning about students' knowledge and…

Descriptors: Bayesian Statistics, Networks, Models, Goodness of Fit

« Previous Page | Next Page

Pages: 1 | 2 | 3

Statistical Analysis	37
Item Response Theory	16
Test Items	14
Scores	12
Goodness of Fit	11
Identification	9
Bayesian Statistics	8
Cheating	8
Models	7
Simulation	7
Comparative Analysis	6
Licensing Examinations…	6
Computation	5
Correlation	5
Regression (Statistics)	5
Testing Problems	5
Tests	5
Equated Scores	4
Error of Measurement	4
Hypothesis Testing	4
Mathematics Tests	4
Test Bias	4
Achievement Tests	3
Computer Assisted Testing	3
Difficulty Level	3
More ▼