ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	9
Since 2016 (last 10 years)	31
Since 2006 (last 20 years)	65

Descriptor

Essays	86
Interrater Reliability	86
Writing Evaluation	49
Scoring	37
English (Second Language)	29
Foreign Countries	28
Second Language Learning	24
Scoring Rubrics	21
Comparative Analysis	19
Evaluators	19
Correlation	18
Computer Assisted Testing	17
Scores	15
Evaluation Criteria	14
Grading	14
Statistical Analysis	14
Automation	13
College Students	13
Second Language Instruction	13
Writing Tests	13
Accuracy	11
Evaluation Methods	11
Student Evaluation	11
Language Tests	10
Computer Software	9
More ▼

Publication Type

Journal Articles	71
Reports - Research	62
Reports - Evaluative	15
Tests/Questionnaires	6
Speeches/Meeting Papers	5
Dissertations/Theses -…	3
Opinion Papers	3
Reports - Descriptive	2
Guides - General	1
Information Analyses	1
Non-Print Media	1
Numerical/Quantitative Data	1
Reference Materials - General	1
More ▼

Education Level

Higher Education	28
Postsecondary Education	21
Secondary Education	7
High Schools	5
Elementary Secondary Education	4
Elementary Education	3
Grade 11	2
Grade 8	2
Middle Schools	2
Adult Education	1
Grade 10	1
Grade 12	1
Junior High Schools	1
Two Year Colleges	1
More ▼

Audience

Practitioners	4
Teachers	4
Researchers	1

Location

China	6
Iran	3
Germany	2
Hong Kong	2
Taiwan	2
United Kingdom	2
West Virginia	2
Australia	1
California	1
Delaware	1
Israel	1
Japan	1
Ohio	1
Philippines	1
South Africa	1
Spain (Barcelona)	1
Sweden	1
Switzerland	1
Tunisia	1
Turkey	1
United Kingdom (England)	1
United Kingdom (Wales)	1
Washington	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	3
Advanced Placement…	2
Graduate Record Examinations	2
National Assessment of…	2
SAT (College Admission Test)	2
COMPASS (Computer Assisted…	1
Graduate Management Admission…	1
International English…	1
Test of Written English	1

What Works Clearinghouse Rating

Showing 1 to 15 of 86 results Save | Export

A Data-Driven Approach for the Identification of Features for Automated Feedback on Academic Essays

Peer reviewed

Direct link

Abbas, Mohsin; van Rosmalen, Peter; Kalz, Marco – IEEE Transactions on Learning Technologies, 2023

For predicting and improving the quality of essays, text analytic metrics (surface, syntactic, morphological, and semantic features) can be used to provide formative feedback to the students in higher education. In this study, the goal was to identify a sufficient number of features that exhibit a fair proxy of the scores given by the human raters…

Descriptors: Feedback (Response), Automation, Essays, Scoring

Examining the Calibration Process for Raters of the "GRE"® General Test. ETS GRE® Board Research Report. GRE®-19-01. Research Report Series. ETS RR-19-09

Peer reviewed
PDF on ERIC

Download full text

Wendler, Cathy; Glazer, Nancy; Cline, Frederick – ETS Research Report Series, 2019

One of the challenges in scoring constructed-response (CR) items and tasks is ensuring that rater drift does not occur during or across scoring windows. Rater drift reflects changes in how raters interpret and use established scoring criteria to assign essay scores. Calibration is a process used to help control rater drift and, as such, serves as…

Descriptors: College Entrance Examinations, Graduate Study, Accuracy, Test Reliability

Evaluating Quadratic Weighted Kappa as the Standard Performance Metric for Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Doewes, Afrizal; Kurdhi, Nughthoh Arfawi; Saxena, Akrati – International Educational Data Mining Society, 2023

Automated Essay Scoring (AES) tools aim to improve the efficiency and consistency of essay scoring by using machine learning algorithms. In the existing research work on this topic, most researchers agree that human-automated score agreement remains the benchmark for assessing the accuracy of machine-generated scores. To measure the performance of…

Descriptors: Essays, Writing Evaluation, Evaluators, Accuracy

Automated Essay Scoring and the Deep Learning Black Box: How Are Rubric Scores Determined?

Peer reviewed

Direct link

Kumar, Vivekanandan S.; Boulanger, David – International Journal of Artificial Intelligence in Education, 2021

This article investigates the feasibility of using automated scoring methods to evaluate the quality of student-written essays. In 2012, Kaggle hosted an Automated Student Assessment Prize contest to find effective solutions to automated testing and grading. This article: a) analyzes the datasets from the contest -- which contained hand-graded…

Descriptors: Automation, Scoring, Essays, Writing Evaluation

The Different Impact of a Structured Peer-Assessment Task in Relation to University Undergraduates' Initial Writing Skills

Peer reviewed

Direct link

Ramon-Casas, Marta; Nuño, Neus; Pons, Ferran; Cunillera, Toni – Assessment & Evaluation in Higher Education, 2019

This article presents an empirical evaluation of the validity and reliability of a peer-assessment activity to improve academic writing competences. Specifically, we explored a large group of psychology undergraduate students with different initial writing skills. Participants (n = 365) produced two different essays, which were evaluated by their…

Descriptors: Peer Evaluation, Validity, Reliability, Writing Skills

Correlating What We Know: A Mixed Methods Study of Reflection and Writing in First-Year Writing Assessment

Peer reviewed

Direct link

Pruchnic, Jeff; Barton, Ellen; Primeau, Sarah; Trimble, Thomas; Varty, Nicole; Foster, Tanina – Composition Forum, 2021

Over the past two decades, reflective writing has occupied an increasingly prominent position in composition theory, pedagogy, and assessment as researchers have described the value of reflection and reflective writing in college students' development of higher-order writing skills, such as genre conventions (Yancey, "Reflection";…

Descriptors: Reflection, Correlation, Essays, Freshman Composition

Examining Consistency among Different Rubrics for Assessing Writing

Peer reviewed

Direct link

Shabani, Enayat A.; Panahi, Jaleh – Language Testing in Asia, 2020

The literature on using scoring rubrics in writing assessment denotes the significance of rubrics as practical and useful means to assess the quality of writing tasks. This study tries to investigate the agreement among rubrics endorsed and used for assessing the essay writing tasks by the internationally recognized tests of English language…

Descriptors: Writing Evaluation, Scoring Rubrics, Scores, Interrater Reliability

Meta-Analysis of Inter-Rater Agreement and Discrepancy Between Human and Automated English Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Direct link

Jiyeo Yun – English Teaching, 2023

Studies on automatic scoring systems in writing assessments have also evaluated the relationship between human and machine scores for the reliability of automated essay scoring systems. This study investigated the magnitudes of indices for inter-rater agreement and discrepancy, especially regarding human and machine scoring, in writing assessment.…

Descriptors: Meta Analysis, Interrater Reliability, Essays, Scoring

Depth-Perception-Based Representation in Holistic Rating on ESL Essay Writing

Peer reviewed

Direct link

Lian Li; Jiehui Hu; Yu Dai; Ping Zhou; Wanhong Zhang – Reading & Writing Quarterly, 2024

This paper proposes to use depth perception to represent raters' decision in holistic evaluation of ESL essays, as an alternative medium to conventional form of numerical scores. The researchers verified the new method's accuracy and inter/intra-rater reliability by inviting 24 ESL teachers to perform different representations when rating 60…

Descriptors: Essays, Holistic Approach, Writing Evaluation, Accuracy

The Effects of Primacy on Rater Cognition: An Eye-Tracking Study

Direct link

Ballard, Laura – ProQuest LLC, 2017

Rater scoring has an impact on writing test reliability and validity. Thus, there has been a continued call for researchers to investigate issues related to rating (Crusan, 2015). Investigating the scoring process and understanding how raters arrive at particular scores are critical "because the score is ultimately what will be used in making…

Descriptors: Evaluators, Schemata (Cognition), Eye Movements, Scoring Rubrics

Impacts of ChatGPT-Assisted Writing for EFL English Majors: Feasibility and Challenges

Peer reviewed

Direct link

Chung-You Tsai; Yi-Ti Lin; Iain Kelsall Brown – Education and Information Technologies, 2024

To determine the impacts of using ChatGPT to assist English as a foreign language (EFL) English college majors in revising essays and the possibility of leading to higher scores and potentially causing unfairness. A prospective, double-blinded, paired-comparison study was conducted in Feb. 2023. A total of 44 students provided 44 original essays…

Descriptors: Artificial Intelligence, Computer Software, Technology Uses in Education, English (Second Language)

A Two-Stage Method for Classroom Assessments of Essay Writing

Peer reviewed

Direct link

Humphry, Stephen Mark; Heldsinger, Sandy – Journal of Educational Measurement, 2019

To capitalize on professional expertise in educational assessment, it is desirable to develop and test methods of rater-mediated assessment that enable classroom teachers to make reliable and informative judgments. Accordingly, this article investigates the reliability of a two-stage method used by classroom teachers to assess primary school…

Descriptors: Essays, Elementary School Students, Writing (Composition), Writing Evaluation

Autoscoring Essays Based on Complex Networks

Peer reviewed

Direct link

Ke, Xiaohua; Zeng, Yongqiang; Luo, Haijiao – Journal of Educational Measurement, 2016

This article presents a novel method, the Complex Dynamics Essay Scorer (CDES), for automated essay scoring using complex network features. Texts produced by college students in China were represented as scale-free networks (e.g., a word adjacency model) from which typical network features, such as the in-/out-degrees, clustering coefficient (CC),…

Descriptors: Scoring, Automation, Essays, Networks

ITC Guidelines for the Large-Scale Assessment of Linguistically and Culturally Diverse Populations

Peer reviewed

Direct link

International Journal of Testing, 2019

These guidelines describe considerations relevant to the assessment of test takers in or across countries or regions that are linguistically or culturally diverse. The guidelines were developed by a committee of experts to help inform test developers, psychometricians, test users, and test administrators about fairness issues in support of the…

Descriptors: Test Bias, Student Diversity, Cultural Differences, Language Usage

The Impact of Computers on Marking Behaviors and Assessment: A Many-Facet Rasch Measurement Analysis of Essays by EFL College Students

Peer reviewed

Direct link

He, Tung-hsien – SAGE Open, 2019

This study employed a mixed-design approach and the Many-Facet Rasch Measurement (MFRM) framework to investigate whether rater bias occurred between the onscreen scoring (OSS) mode and the paper-based scoring (PBS) mode. Nine human raters analytically marked scanned scripts and paper scripts using a six-category (i.e., six-criterion) rating…

Descriptors: Computer Assisted Testing, Scoring, Item Response Theory, Essays

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

ETS Research Report Series	4
Journal of Educational…	4
Assessing Writing	3
Journal of Technology,…	3
Language Testing	3
ProQuest LLC	3
Advances in Language and…	2
Applied Measurement in…	2
College Teaching	2
Education and Information…	2
IEEE Transactions on Learning…	2
International Journal of…	2
Language Assessment Quarterly	2
Language Testing in Asia	2
West Virginia Department of…	2
Action in Teacher Education	1
Assessment & Evaluation in…	1
Assessment and Evaluation in…	1
British Journal of…	1
Chronicle of Higher Education	1
College Board	1
Composition Forum	1
E-Learning	1
Educational Research and…	1
Educational Technology &…	1
More ▼

Ben-Simon, Anat	2
Coniam, David	2
Crossley, Scott A.	2
Hixson, Nate	2
Johnson, Robert L.	2
McNamara, Danielle S.	2
Powers, Donald E.	2
Rhudy, Vaughn	2
Abbas, Mohsin	1
Abedi, Jamal	1
Alexander, R. Curby	1
Allen, Laura K.	1
Attali, Yigal	1
Baier, Herbert	1
Baker, Eva L.	1
Ballard, Laura	1
Balthazor, Ron	1
Barkaoui, Khaled	1
Barton, Ellen	1
Bell, John F.	1
Bennett, Randy Elliott	1
Berger, Cynthia M.	1
Boulanger, David	1
Breyer, F. Jay	1
More ▼