ERIC - Search Results

Publication Date

In 2025	1
Since 2024	4
Since 2021 (last 5 years)	11
Since 2016 (last 10 years)	26
Since 2006 (last 20 years)	46

Descriptor

Essays	64
Reliability	64
Writing Evaluation	27
Validity	26
Foreign Countries	23
Scoring	23
Comparative Analysis	15
Second Language Learning	15
Scores	14
English (Second Language)	13
Higher Education	13
Computer Software	12
Student Evaluation	12
College Students	11
Computer Assisted Testing	11
Evaluation Methods	11
Grading	11
Second Language Instruction	11
Writing Tests	11
Evaluators	9
Writing Skills	9
Automation	7
Correlation	7
Evaluation Criteria	7
Scoring Rubrics	7
More ▼

Publication Type

Journal Articles	51
Reports - Research	44
Reports - Descriptive	9
Speeches/Meeting Papers	5
Reports - Evaluative	4
Tests/Questionnaires	4
Opinion Papers	3
Guides - General	1
Non-Print Media	1

Education Level

Higher Education	26
Postsecondary Education	20
Secondary Education	8
Elementary Secondary Education	4
High Schools	4
Elementary Education	3
Middle Schools	3
Grade 4	2
Grade 8	2
Early Childhood Education	1
Grade 10	1
Grade 11	1
Grade 12	1
Grade 3	1
Grade 5	1
Grade 6	1
Grade 7	1
Intermediate Grades	1
Junior High Schools	1
Primary Education	1
More ▼

Audience

Location

United Kingdom (England)	4
Australia	3
Turkey	3
China	2
Connecticut	2
Iran	2
New Hampshire	2
New York	2
Rhode Island	2
Vermont	2
California	1
Germany	1
Hong Kong	1
Massachusetts	1
Netherlands	1
New Jersey	1
Nigeria	1
Philippines	1
Russia	1
Saudi Arabia	1
Singapore	1
South Korea	1
Spain (Barcelona)	1
United Kingdom (Glasgow)	1
More ▼

Laws, Policies, & Programs

Every Student Succeeds Act…

Assessments and Surveys

National Assessment of…	2
New York State Regents…	2
Graduate Record Examinations	1
National Longitudinal Study…	1
National Longitudinal Survey…	1
National Merit Scholarship…	1
Preliminary Scholastic…	1
SAT (College Admission Test)	1
Test of English as a Foreign…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 64 results Save | Export

Coherence-Based Automatic Short Answer Scoring Using Sentence Embedding

Peer reviewed

Direct link

Dadi Ramesh; Suresh Kumar Sanampudi – European Journal of Education, 2024

Automatic essay scoring (AES) is an essential educational application in natural language processing. This automated process will alleviate the burden by increasing the reliability and consistency of the assessment. With the advances in text embedding libraries and neural network models, AES systems achieved good results in terms of accuracy.…

Descriptors: Scoring, Essays, Writing Evaluation, Memory

Peer Overmarking and Insufficient Diagnosticity: The Impact of the Rating Method for Peer Assessment

Peer reviewed

Direct link

Van Meenen, Florence; Coertjens, Liesje; Van Nes, Marie-Claire; Verschuren, Franck – Advances in Health Sciences Education, 2022

The present study explores two rating methods for peer assessment (analytical rating using criteria and comparative judgement) in light of concurrent validity, reliability and insufficient diagnosticity (i.e. the degree to which substandard work is recognised by the peer raters). During a second-year undergraduate course, students wrote a one-page…

Descriptors: Evaluation Methods, Peer Evaluation, Accuracy, Evaluation Criteria

Investigating the Quality of a High-Stakes EFL Writing Assessment Procedure in the Turkish Higher Education Context

Peer reviewed
PDF on ERIC

Download full text

Elif Sari – International Journal of Assessment Tools in Education, 2024

Employing G-theory and rater interviews, the study investigated how a high-stakes writing assessment procedure (i.e., a single-task, single-rater, and holistic scoring procedure) impacted the variability and reliability of its scores within the Turkish higher education context. Thirty-two essays written on two different writing tasks (i.e.,…

Descriptors: Foreign Countries, High Stakes Tests, Writing Evaluation, Scores

On the Limitations of Human-Computer Agreement in Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Doewes, Afrizal; Pechenizkiy, Mykola – International Educational Data Mining Society, 2021

Scoring essays is generally an exhausting and time-consuming task for teachers. Automated Essay Scoring (AES) facilitates the scoring process to be faster and more consistent. The most logical way to assess the performance of an automated scorer is by measuring the score agreement with the human raters. However, we provide empirical evidence that…

Descriptors: Man Machine Systems, Automation, Computer Assisted Testing, Scoring

Utilizing Large Language Models for EFL Essay Grading: An Examination of Reliability and Validity in Rubric-Based Assessments

Peer reviewed

Direct link

Fatih Yavuz; Özgür Çelik; Gamze Yavas Çelik – British Journal of Educational Technology, 2025

This study investigates the validity and reliability of generative large language models (LLMs), specifically ChatGPT and Google's Bard, in grading student essays in higher education based on an analytical grading rubric. A total of 15 experienced English as a foreign language (EFL) instructors and two LLMs were asked to evaluate three student…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Computational Linguistics

The Different Impact of a Structured Peer-Assessment Task in Relation to University Undergraduates' Initial Writing Skills

Peer reviewed

Direct link

Ramon-Casas, Marta; Nuño, Neus; Pons, Ferran; Cunillera, Toni – Assessment & Evaluation in Higher Education, 2019

This article presents an empirical evaluation of the validity and reliability of a peer-assessment activity to improve academic writing competences. Specifically, we explored a large group of psychology undergraduate students with different initial writing skills. Participants (n = 365) produced two different essays, which were evaluated by their…

Descriptors: Peer Evaluation, Validity, Reliability, Writing Skills

More Efficient Processes for Creating Automated Essay Scoring Frameworks: A Demonstration of Two Algorithms

Peer reviewed

Direct link

Shin, Jinnie; Gierl, Mark J. – Language Testing, 2021

Automated essay scoring (AES) has emerged as a secondary or as a sole marker for many high-stakes educational assessments, in native and non-native testing, owing to remarkable advances in feature engineering using natural language processing, machine learning, and deep-neural algorithms. The purpose of this study is to compare the effectiveness…

Descriptors: Scoring, Essays, Writing Evaluation, Computer Software

Judges' Views on Pairwise Comparative Judgement and Rank Ordering as Alternatives to Analytical Essay Marking

Download full text

Walland, Emma – Research Matters, 2022

In this article, I report on examiners' views and experiences of using Pairwise Comparative Judgement (PCJ) and Rank Ordering (RO) as alternatives to traditional analytical marking for GCSE English Language essays. Fifteen GCSE English Language examiners took part in the study. After each had judged 100 pairs of essays using PCJ and eight packs of…

Descriptors: Essays, Grading, Writing Evaluation, Evaluators

Automated Analysis of Reflection in Writing: Validating Machine Learning Approaches

Peer reviewed

Direct link

Ullmann, Thomas Daniel – International Journal of Artificial Intelligence in Education, 2019

Reflective writing is an important educational practice to train reflective thinking. Currently, researchers must manually analyze these writings, limiting practice and research because the analysis is time and resource consuming. This study evaluates whether machine learning can be used to automate this manual analysis. The study investigates…

Descriptors: Reflection, Writing (Composition), Writing Evaluation, Automation

Examining Human and Automated Ratings of Elementary Students' Writing Quality: A Multivariate Generalizability Theory Application

Peer reviewed

Direct link

Chen, Dandan; Hebert, Michael; Wilson, Joshua – American Educational Research Journal, 2022

We used multivariate generalizability theory to examine the reliability of hand-scoring and automated essay scoring (AES) and to identify how these scoring methods could be used in conjunction to optimize writing assessment. Students (n = 113) included subsamples of struggling writers and non-struggling writers in Grades 3-5 drawn from a larger…

Descriptors: Reliability, Scoring, Essays, Automation

Factor Analysis Study of the Achievement Goal Framework in the Domain-Specific Task of EFL Writing

Peer reviewed
PDF on ERIC

Download full text

Husain Abdulhay; Moussa Ahmadian – rEFLections, 2024

This study attempted to discern the factor structure of the achievement goal orientation and goal structure constructs across the domain-specific task of essay writing in an Iranian EFL context. A convenience sample of 116 public university learners participated in a single-session, in-class study of an essay writing sampling and an immediate…

Descriptors: Foreign Countries, Factor Structure, Goal Orientation, Factor Analysis

Investigating Human Essay Rating Quality in a Large-Scale Assessment Using Many-Facet Rasch Measurement

Peer reviewed

Direct link

Zhang, Xiuyuan – AERA Online Paper Repository, 2019

The main purpose of the study is to evaluate the qualities of human essay ratings for a large-scale assessment using Rasch measurement theory. Specifically, Many-Facet Rasch Measurement (MFRM) was utilized to examine the rating scale category structure and provide important information about interpretations of ratings in the large-scale…

Descriptors: Essays, Evaluators, Writing Evaluation, Reliability

Toward the Automated Scoring of Written Arguments: Developing an Innovative Approach for Annotation. Research Report. ETS RR-17-11

Peer reviewed
PDF on ERIC

Download full text

Song, Yi; Deane, Paul; Beigman Klebanov, Beata – ETS Research Report Series, 2017

This project focuses on laying the foundations for automated analysis of argumentation schemes, supporting identification and classification of the arguments being made in a text, for the purpose of scoring the quality of written analyses of arguments. We developed annotation protocols for 20 argument prompts from a college-level test under the…

Descriptors: Scoring, Automation, Persuasive Discourse, Documentation

Development and Validation of a Rating Scale for Iranian EFL Academic Writing Assessment: A Mixed-Methods Study

Peer reviewed

Direct link

Ghanbari, Nasim; Barati, Hossein – Language Testing in Asia, 2020

The present study reports the process of development and validation of a rating scale in the Iranian EFL academic writing assessment context. To achieve this goal, the study was conducted in three distinct phases. Early in the study, the researcher interviewed a number of raters in different universities. Next, a questionnaire was developed based…

Descriptors: Rating Scales, Writing Evaluation, English for Academic Purposes, Second Language Learning

"Almost Astronauts" and the Pursuit of Reliability in Children's Nonfiction

Peer reviewed

Direct link

Sanders, Joe Sutliff – Children's Literature in Education, 2015

A recent surge of conversation about children's nonfiction reveals a conflict between two positions that do not at first appear to be opposed: modeling inquiry and presenting authoritative facts. Tanya Lee Stone, the author of the Sibert Award-winning "Almost Astronauts" (2009), has recently alluded to that tension and expressed a…

Descriptors: Childrens Literature, Nonfiction, Authors, Inquiry

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Journal of Technology,…	3
Language Assessment Quarterly	3
American Educational Research…	2
British Journal of…	2
ETS Research Report Series	2
Grantee Submission	2
Teaching at a Distance	2
AERA Online Paper Repository	1
Advances in Health Sciences…	1
Applied Linguistics	1
Applied Measurement in…	1
Assessment & Evaluation in…	1
Assessment and Evaluation in…	1
Assessment in Higher Education	1
Australian Journal of Teacher…	1
Babel	1
British Journal of…	1
CALICO Journal	1
CORE	1
Child Study Journal	1
Children's Literature in…	1
College Entrance Examination…	1
Council of Chief State School…	1
Education Sciences	1
Educational Sciences: Theory…	1
More ▼

Attali, Yigal	3
Darling-Hammond, Linda	2
Follman, John	2
Kantor, Robert	2
Lee, Yong-Won	2
Akinwamide, Timothy Kolade	1
Andersen, Richard	1
Barati, Hossein	1
Beigman Klebanov, Beata	1
Bell, John F.	1
Booth, Mary W.	1
Breland, Hunter	1
Brown, Gavin T. L.	1
Burstein, Jill	1
Byrne, Colin	1
Chen, Dandan	1
Coertjens, Liesje	1
Crossley, Scott A.	1
Cunillera, Toni	1
Dadi Ramesh	1
Daglish, N. D.	1
Deane, Paul	1
Dikli, Semire	1
Doewes, Afrizal	1
More ▼