ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	7
Since 2016 (last 10 years)	9
Since 2006 (last 20 years)	15

Descriptor

Accuracy	15
Scoring	15
Writing Tests	15
Elementary School Students	8
Curriculum Based Assessment	7
Automation	6
Writing Evaluation	6
Classification	3
Computer Assisted Testing	3
Essays	3
Grade 3	3
Grade 4	3
Models	3
Predictor Variables	3
Writing Skills	3
Alphabets	2
Attention	2
College Entrance Examinations	2
Comparative Analysis	2
English (Second Language)	2
Essay Tests	2
Factor Analysis	2
Gender Differences	2
Grade 2	2
Handwriting	2
More ▼

Source

Grantee Submission	5
ETS Research Report Series	2
Educational Testing Service	2
ProQuest LLC	2
Canadian Journal of School…	1
Journal of Educational…	1
Measurement:…	1
Reading and Writing: An…	1

Publication Type

Reports - Research	12
Journal Articles	8
Dissertations/Theses -…	2
Numerical/Quantitative Data	1
Reports - Descriptive	1

Education Level

Elementary Education	9
Early Childhood Education	4
Primary Education	4
Grade 3	3
Grade 4	3
Intermediate Grades	3
Secondary Education	3
Elementary Secondary Education	2
Grade 2	2
Higher Education	2
Middle Schools	2
Postsecondary Education	2
Grade 10	1
Grade 11	1
Grade 5	1
Grade 9	1
High Schools	1
Junior High Schools	1
Kindergarten	1
More ▼

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Graduate Record Examinations	2
Oral and Written Language…	2
Test of English as a Foreign…	2
Wechsler Individual…	2
Woodcock Johnson Tests of…	2
Praxis Series	1

What Works Clearinghouse Rating

Showing all 15 results Save | Export

Exploring Rater Accuracy Using Unfolding Models Combined with Topic Models: Incorporating Supervised Latent Dirichlet Allocation

Peer reviewed

Direct link

Wheeler, Jordan M.; Engelhard, George; Wang, Jue – Measurement: Interdisciplinary Research and Perspectives, 2022

Objectively scoring constructed-response items on educational assessments has long been a challenge due to the use of human raters. Even well-trained raters using a rubric can inaccurately assess essays. Unfolding models measure rater's scoring accuracy by capturing the discrepancy between criterion and operational ratings by placing essays on an…

Descriptors: Accuracy, Scoring, Statistical Analysis, Models

Cost Analysis and Cost-Effectiveness of Hand-Scored and Automated Approaches to Writing Screening

Peer reviewed
PDF on ERIC

Download full text

Direct link

Michael Matta; Milena A. Keller-Margulis; Sterett H. Mercer – Grantee Submission, 2022

Although researchers have investigated technical adequacy and usability of written-expression curriculum-based measures (WE-CBM), the economic implications of different scoring approaches have largely been ignored. The absence of such knowledge can undermine the effective allocation of resources and lead to the adoption of suboptimal measures for…

Descriptors: Cost Effectiveness, Scoring, Automation, Writing Tests

Validity of Automated Text Evaluation Tools for Written-Expression Curriculum-Based Measurement: A Comparison Study

Peer reviewed

Direct link

Keller-Margulis, Milena A.; Mercer, Sterett H.; Matta, Michael – Reading and Writing: An Interdisciplinary Journal, 2021

Existing approaches to measuring writing performance are insufficient in terms of both technical adequacy as well as feasibility for use as a screening measure. This study examined the validity and diagnostic accuracy of several approaches to automated text evaluation as well as written expression curriculum-based measurement (WE-CBM) to determine…

Descriptors: Writing Evaluation, Validity, Automation, Curriculum Based Assessment

Accuracy of Automated Written Expression Curriculum-Based Measurement Scoring

Peer reviewed
PDF on ERIC

Download full text

Direct link

Mercer, Sterett H.; Cannon, Joanna E.; Squires, Bonita; Guo, Yue; Pinco, Ella – Canadian Journal of School Psychology, 2021

We examined the extent to which automated written expression curriculum-based measurement (aWE-CBM) can be accurately used to computer score student writing samples for screening and progress monitoring. Students (n = 174) with learning difficulties in Grades 1 to 12 who received 1:1 academic tutoring through a community-based organization…

Descriptors: Curriculum Based Assessment, Automation, Scoring, Writing Tests

Validity of Automated Text Evaluation Tools for Written-Expression Curriculum-Based Measurement: A Comparison Study

Peer reviewed
PDF on ERIC

Download full text

Direct link

Keller-Margulis, Milena A.; Mercer, Sterett H.; Matta, Michael – Grantee Submission, 2021

Descriptors: Writing Evaluation, Validity, Automation, Curriculum Based Assessment

Accuracy of Automated Written Expression Curriculum-Based Measurement Scoring

Peer reviewed
PDF on ERIC

Download full text

Direct link

Mercer, Sterett H.; Cannon, Joanna E.; Squires, Bonita; Guo, Yue; Pinco, Ella – Grantee Submission, 2021

We examined the extent to which automated written expression curriculum-based measurement (aWE-CBM) can be accurately used to computer score student writing samples for screening and progress monitoring. Students (n = 174) with learning difficulties in Grades 1-12 who received 1:1 academic tutoring through a community-based organization completed…

Descriptors: Curriculum Based Assessment, Automation, Scoring, Writing Tests

Developing a Generic Scorer for Practice Writing Tests of Statewide Assessment Essays with Natural Language Processing Transfer Learning Techniques

Direct link

Yi Gui – ProQuest LLC, 2024

This study explores using transfer learning in machine learning for natural language processing (NLP) to create generic automated essay scoring (AES) models, providing instant online scoring for statewide writing assessments in K-12 education. The goal is to develop an instant online scorer that is generalizable to any prompt, addressing the…

Descriptors: Writing Tests, Natural Language Processing, Writing Evaluation, Scoring

Classification Accuracy and Efficiency of Writing Screening Using Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Direct link

Wilson, Joshua; Rodrigues, Jessica – Grantee Submission, 2020

The present study leveraged advances in automated essay scoring (AES) technology to explore a proof of concept for a writing screener using the "Project Essay Grade" (PEG) program. First, the study investigated the extent to which an AES-scored multi-prompt writing screener accurately classified students as at risk of failing a Common…

Descriptors: Writing Tests, Screening Tests, Classification, Accuracy

Prediction of Writing True Scores in Automated Scoring of Essays by Best Linear Predictors and Penalized Best Linear Predictors. Research Report. ETS RR-19-13

Peer reviewed
PDF on ERIC

Download full text

Yao, Lili; Haberman, Shelby J.; Zhang, Mo – ETS Research Report Series, 2019

Many assessments of writing proficiency that aid in making high-stakes decisions consist of several essay tasks evaluated by a combination of human holistic scores and computer-generated scores for essay features such as the rate of grammatical errors per word. Under typical conditions, a summary writing score is provided by a linear combination…

Descriptors: Prediction, True Scores, Computer Assisted Testing, Scoring

An Examination of the Link between Rater Calibration Performance and Subsequent Scoring Accuracy in Graduate Record Examinations[R] (GRE[R]) Writing. Research Report. ETS RR-11-03

Download full text

Ricker-Pedley, Kathryn L. – Educational Testing Service, 2011

A pseudo-experimental study was conducted to examine the link between rater accuracy calibration performances and subsequent accuracy during operational scoring. The study asked 45 raters to score a 75-response calibration set and then a 100-response (operational) set of responses from a retired Graduate Record Examinations[R] (GRE[R]) writing…

Descriptors: Scoring, Accuracy, College Entrance Examinations, Writing Tests

Toward an Understanding of Dimensions, Predictors, and the Gender Gap in Written Composition

Peer reviewed

Direct link

Kim, Young-Suk; Al Otaiba, Stephanie; Wanzek, Jeanne; Gatlin, Brandy – Journal of Educational Psychology, 2015

We had 3 aims in the present study: (a) to examine the dimensionality of various evaluative approaches to scoring writing samples (e.g., quality, productivity, and curriculum-based measurement [CBM] writing scoring), (b) to investigate unique language and cognitive predictors of the identified dimensions, and (c) to examine gender gap in the…

Descriptors: Writing (Composition), Gender Differences, Curriculum Based Assessment, Scoring

Rater Drift in Constructed Response Scoring via Latent Class Signal Detection Theory and Item Response Theory

Direct link

Park, Yoon Soo – ProQuest LLC, 2011

The use of constructed response (CR) items or performance tasks to assess test takers' ability has grown tremendously over the past decade. Examples of CR items in psychological and educational measurement range from essays, works of art, and admissions interviews. However, unlike multiple-choice (MC) items that have predetermined options, CR…

Descriptors: Responses, Scoring, Item Response Theory, Testing

Toward an Understanding of Dimensions, Predictors, and the Gender Gap in Written Composition

Peer reviewed
PDF on ERIC

Download full text

Direct link

Kim, Young-Suk; Al Otaiba, Stephanie; Wanzek, Jeanne; Gatlin, Brandy – Grantee Submission, 2015

Descriptors: Writing (Composition), Gender Differences, Curriculum Based Assessment, Scoring

Use of e-rater[R] in Scoring of the TOEFL iBT[R] Writing Test. Research Report. ETS RR-11-25

Download full text

Haberman, Shelby J. – Educational Testing Service, 2011

Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…

Descriptors: Writing Tests, Scoring, Essays, Language Tests

Studies of a Latent-Class Signal-Detection Model for Constructed-Response Scoring. Research Report. ETS RR-08-63

Peer reviewed
PDF on ERIC

Download full text

DeCarlo, Lawrence T. – ETS Research Report Series, 2008

Rater behavior in essay grading can be viewed as a signal-detection task, in that raters attempt to discriminate between latent classes of essays, with the latent classes being defined by a scoring rubric. The present report examines basic aspects of an approach to constructed-response (CR) scoring via a latent-class signal-detection model. The…

Descriptors: Scoring, Responses, Test Format, Bias

Mercer, Sterett H.	4
Al Otaiba, Stephanie	2
Cannon, Joanna E.	2
Gatlin, Brandy	2
Guo, Yue	2
Haberman, Shelby J.	2
Keller-Margulis, Milena A.	2
Kim, Young-Suk	2
Matta, Michael	2
Pinco, Ella	2
Squires, Bonita	2
Wanzek, Jeanne	2
DeCarlo, Lawrence T.	1
Engelhard, George	1
Michael Matta	1
Milena A. Keller-Margulis	1
Park, Yoon Soo	1
Ricker-Pedley, Kathryn L.	1
Rodrigues, Jessica	1
Sterett H. Mercer	1
Wang, Jue	1
Wheeler, Jordan M.	1
Wilson, Joshua	1
Yao, Lili	1
Yi Gui	1
More ▼