ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	6

Descriptor

Writing Tests	6
Scores	4
College Entrance Examinations	3
Computer Assisted Testing	3
English (Second Language)	3
Factor Analysis	3
Language Tests	3
Prediction	3
Reading Tests	3
Regression (Statistics)	3
Comparative Analysis	2
Construct Validity	2
Correlation	2
Essays	2
Goodness of Fit	2
Mathematics Tests	2
Scoring	2
Second Language Learning	2
Weighted Scores	2
Classification	1
Computer Software	1
Content Analysis	1
Data Analysis	1
Essay Tests	1
Evaluation Criteria	1
More ▼

Source

ETS Research Report Series	2
Educational Testing Service	2
Applied Measurement in…	1
Language Testing	1

Author

Sinharay, Sandip	6
Attali, Yigal	2
Deane, Paul	1
Dorans, Neil J.	1
Haberman, Shelby J.	1
Jia, Helena	1
Liang, Longjuan	1
Sawaki, Yasuyo	1
Zhang, Mo	1

Publication Type

Reports - Research	6
Journal Articles	4
Numerical/Quantitative Data	1
Tests/Questionnaires	1

Education Level

High Schools	1
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	2
Graduate Record Examinations	1
National Merit Scholarship…	1
Preliminary Scholastic…	1

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Prediction of Essay Scores from Writing Process and Product Features Using Data Mining Methods

Peer reviewed

Direct link

Sinharay, Sandip; Zhang, Mo; Deane, Paul – Applied Measurement in Education, 2019

Analysis of keystroke logging data is of increasing interest, as evident from a substantial amount of recent research on the topic. Some of the research on keystroke logging data has focused on the prediction of essay scores from keystroke logging features, but linear regression is the only prediction method that has been used in this research.…

Descriptors: Scores, Prediction, Writing Processes, Data Analysis

Do the TOEFL iBT® Section Scores Provide Value-Added Information to Stakeholders

Peer reviewed

Direct link

Sawaki, Yasuyo; Sinharay, Sandip – Language Testing, 2018

The present study examined the reliability of the reading, listening, speaking, and writing section scores for the TOEFL iBT® test and their interrelationship in order to collect empirical evidence to support, respectively, the "generalization" inference and the "explanation" inference in the TOEFL iBT validity argument…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing

Automated Trait Scores for "TOEFL"® Writing Tasks. Research Report. ETS RR-15-14

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015

The "e-rater"® automated essay scoring system is used operationally in the scoring of "TOEFL iBT"® independent and integrated tasks. In this study we explored the psychometric added value of reporting four trait scores for each of these two tasks, beyond the total e-rater score.The four trait scores are word choice, grammatical…

Descriptors: Writing Tests, Scores, Language Tests, English (Second Language)

Automated Trait Scores for "GRE"® Writing Tasks. Research Report. ETS RR-15-15

Peer reviewed
PDF on ERIC

Download full text

Attali, Yigal; Sinharay, Sandip – ETS Research Report Series, 2015

The "e-rater"® automated essay scoring system is used operationally in the scoring of the argument and issue tasks that form the Analytical Writing measure of the "GRE"® General Test. For each of these tasks, this study explored the value added of reporting 4 trait scores for each of these 2 tasks over the total e-rater score.…

Descriptors: Scores, Computer Assisted Testing, Computer Software, Grammar

Fit of Item Response Theory Models: A Survey of Data from Several Operational Tests. Research Report. ETS RR-11-29

Download full text

Sinharay, Sandip; Haberman, Shelby J.; Jia, Helena – Educational Testing Service, 2011

Standard 3.9 of the "Standards for Educational and Psychological Testing" (American Educational Research Association, American Psychological Association, & National Council for Measurement in Education, 1999) demands evidence of model fit when an item response theory (IRT) model is used to make inferences from a data set. We applied two recently…

Descriptors: Item Response Theory, Goodness of Fit, Statistical Analysis, Language Tests

First Language of Examinees and Its Relationship to Differential Item Functioning. Research Report. ETS RR-09-11

Download full text

Sinharay, Sandip; Dorans, Neil J.; Liang, Longjuan – Educational Testing Service, 2009

To ensure fairness, it is important to better understand the relationship of language proficiency to standard psychometric analysis procedures. This paper examines how results of differential item functioning (DIF) analysis are affected by an increase in the proportion of examinees who report that English is not their first language in the…

Descriptors: Test Bias, Language Proficiency, English (Second Language), Measurement