ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	5

Descriptor

Essays	19
Scoring	11
Writing Evaluation	11
Test Reliability	9
Higher Education	8
Test Validity	6
Essay Tests	5
High Schools	5
Interrater Reliability	5
Reliability	5
Scores	5
Writing Skills	5
Evaluation Criteria	4
Evaluation Methods	4
Student Evaluation	4
Evaluators	3
High School Students	3
Holistic Approach	3
Holistic Evaluation	3
Rating Scales	3
Research Reports	3
Secondary School Teachers	3
Validity	3
Accuracy	2
Achievement Tests	2
More ▼

Source

Grantee Submission	2
International Educational…	2
AERA Online Paper Repository	1
Evaluation and Program…	1
Journal of Educational…	1

Publication Type

Speeches/Meeting Papers	19
Reports - Research	13
Opinion Papers	3
Journal Articles	2
Reports - Evaluative	2
Reports - General	1
Tests/Questionnaires	1

Education Level

High Schools	2
Secondary Education	2
Grade 10	1
Higher Education	1
Postsecondary Education	1

Audience

Practitioners	1
Researchers	1
Teachers	1

Location

California	1
Canada	1

Laws, Policies, & Programs

Assessments and Surveys

Test of Standard Written…

What Works Clearinghouse Rating

Showing 1 to 15 of 19 results Save | Export

Evaluating Quadratic Weighted Kappa as the Standard Performance Metric for Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Doewes, Afrizal; Kurdhi, Nughthoh Arfawi; Saxena, Akrati – International Educational Data Mining Society, 2023

Automated Essay Scoring (AES) tools aim to improve the efficiency and consistency of essay scoring by using machine learning algorithms. In the existing research work on this topic, most researchers agree that human-automated score agreement remains the benchmark for assessing the accuracy of machine-generated scores. To measure the performance of…

Descriptors: Essays, Writing Evaluation, Evaluators, Accuracy

On the Limitations of Human-Computer Agreement in Automated Essay Scoring

Peer reviewed
PDF on ERIC

Download full text

Doewes, Afrizal; Pechenizkiy, Mykola – International Educational Data Mining Society, 2021

Scoring essays is generally an exhausting and time-consuming task for teachers. Automated Essay Scoring (AES) facilitates the scoring process to be faster and more consistent. The most logical way to assess the performance of an automated scorer is by measuring the score agreement with the human raters. However, we provide empirical evidence that…

Descriptors: Man Machine Systems, Automation, Computer Assisted Testing, Scoring

Investigating Human Essay Rating Quality in a Large-Scale Assessment Using Many-Facet Rasch Measurement

Peer reviewed

Direct link

Zhang, Xiuyuan – AERA Online Paper Repository, 2019

The main purpose of the study is to evaluate the qualities of human essay ratings for a large-scale assessment using Rasch measurement theory. Specifically, Many-Facet Rasch Measurement (MFRM) was utilized to examine the rating scale category structure and provide important information about interpretations of ratings in the large-scale…

Descriptors: Essays, Evaluators, Writing Evaluation, Reliability

Annotation and Classification of Argumentative Writing Revisions

Peer reviewed
PDF on ERIC

Download full text

Zhang, Fan; Litman, Diane – Grantee Submission, 2015

This paper explores the annotation and classification of students' revision behaviors in argumentative writing. A sentence-level revision schema is proposed to capture why and how students make revisions. Based on the proposed schema, a small corpus of student essays and revisions was annotated. Studies show that manual annotation is reliable with…

Descriptors: Notetaking, Classification, Persuasive Discourse, Revision (Written Composition)

Predicting Misalignment between Teachers' and Students' Essay Scores Using Natural Language Processing Tools

Peer reviewed
PDF on ERIC

Download full text

Allen, Laura K.; Crossley, Scott A.; McNamara, Danielle S. – Grantee Submission, 2015

We investigated linguistic factors that relate to misalignment between students' and teachers' ratings of essay quality. Students (n = 126) wrote essays and rated the quality of their work. Teachers then provided their own ratings of the essays. Results revealed that students who were less accurate in their self-assessments produced essays that…

Descriptors: Essays, Scores, Natural Language Processing, Interrater Reliability

The University Writing Requirement: A Study of the Reliability of Scores.

Download full text

Sultana, Qaisar – 2001

This study examined the reliability of scores assigned to the essays written by Kentucky students to meet the University Writing Requirement (UWR) at Eastern Kentucky University. Two sets of essays, 50 each, on the same prompt that had been read and scored in 1989 and 1997 by trained UWR scorers were read by 7 UWR scorers in 2000. A correlation…

Descriptors: College Students, Correlation, Essays, Higher Education

The Relationship between Scoring Procedures and Focus and the Reliability of Direct Writing Assessment Scores.

Download full text

Wolfe, Edward W.; Kao, Chi-Wen – 1996

This paper reports the results of an analysis of the relationship between scorer behaviors and score variability. Thirty-six essay scorers were interviewed and asked to perform a think-aloud task as they scored 24 essays. Each comment made by a scorer was coded according to its content focus (i.e. appearance, assignment, mechanics, communication,…

Descriptors: Content Analysis, Educational Assessment, Essays, Evaluation Methods

A Five-Step Evaluation of a Holistic Essay-Evaluation Process.

Carr, Marion – 1983

The faculty of an intensive program of English as a second language for college-bound students, questioning the objectivity of faculty evaluations of non-native college applicants' written essays, assessed the existing evaluation process, reformed it, tested it, and planned for ongoing development. In the first stage, readers read and graded…

Descriptors: Admission Criteria, College Applicants, English (Second Language), Essays

Negotiating Expectations: Writing and Reading Placement Tests.

Download full text

Sullivan, Francis J. – 1987

Contradictions are inherent in the evaluation of placement test writing, contradictions that at once value and devalue writers and writing, readers and reading. In testing, the evidence for the essay's effectiveness rests almost entirely on the writer's choice of linguistic forms. The characteristics that distinguish evaluation in competency…

Descriptors: Essays, Higher Education, Scoring, Student Evaluation

A Comparison of Direct and Indirect Assessments of Writing Skill.

Peer reviewed

Breland, Hunter M.; Gaynor, Judith L. – Journal of Educational Measurement, 1979

Over 2,000 writing samples were collected from four undergraduate institutions and compared, where possible, with scores on a multiple-choice test. High correlations between ratings of the writing samples and multiple-choice test scores were obtained. Samples contributed substantially to the prediction of both college grades and writing…

Descriptors: Achievement Tests, Comparative Testing, Correlation, Essay Tests

Essay Topic Difficulty in Relation to Scoring Models.

Dovell, Patricia; Buhr, Dianne C. – 1986

This study examined the difficulty level of essay topics used in the large-scale assessment of writing in relation to five different scoring models, and sought to determine what effects the scoring models would have on passing rates. In model one, examinee's score is the direct result of a score assigned by the reader or the sum of scores assigned…

Descriptors: College Students, Difficulty Level, Essay Tests, Essays

The Unreliability of Data in the California Community College System. AIR Forum 1979 Paper.

Turner, John D.; Booth, Mary W. – 1979

A chronicle of the problems faced in an attempt to collect data on sociology curriculum trends in California's community college system is presented. The project was initiated in an effort to determine if other colleges in the system were experiencing the same difficulties with curriculum and enrollment in sociology courses being encountered by…

Descriptors: Centralization, Community Colleges, Conference Reports, Curriculum Evaluation

Response Length and Quality in the Grading of Essay Tests.

Tollefson, Nona; Tracy, D. B. – 1979

The validity and reliability of essay scores were examined by comparing the mean scores assigned to good and poor quality essay responses of different lengths written by high school sophomores. In-service and pre-service social studies teachers graded essay responses to a test question requiring knowledge of the Constitutional provisions for…

Descriptors: Essay Tests, Essays, Evaluation Criteria, High Schools

Measures of High School Students' Expository Writing: Direct and Indirect Strategies.

Smith, Laura Spooner – 1979

The relationship among writing assessment strategies which may be applicable to competency based testing was examined in a study involving 128 high school students in six English classes in grades 11 and 12. Each student wrote two essays of at least 200 words on topics designed to test expository, or explanatory writing, and completed an objective…

Descriptors: Content Analysis, Essay Tests, Essays, Evaluation Criteria

Alternative Scoring Systems for Predicting Criterion Group Membership.

Winters, Lynn – 1979

Four systems for scoring student essays were used to classify eleventh grade and undergraduate students according to writing ability. The reliabilities of the raters and the validities of the systems in classifying students were emphasized. Two analytic scoring systems--which assume that quality writing is characterized by the inclusion of certain…

Descriptors: Academic Ability, Analytical Criticism, Essays, Evaluation Criteria

Previous Page | Next Page »

Pages: 1 | 2

Doewes, Afrizal	2
Allen, Laura K.	1
Bobie, Allen	1
Booth, Mary W.	1
Breland, Hunter M.	1
Buhr, Dianne C.	1
Carr, Marion	1
Crossley, Scott A.	1
Dovell, Patricia	1
Gaynor, Judith L.	1
Kao, Chi-Wen	1
Kurdhi, Nughthoh Arfawi	1
Litman, Diane	1
McNamara, Danielle S.	1
Pechenizkiy, Mykola	1
Powills, Judith A.	1
Russikoff, Karen A.	1
Saxena, Akrati	1
Smith, Laura Spooner	1
Sullivan, Francis J.	1
Sultana, Qaisar	1
Thompson, Ronald W.	1
Tollefson, Nona	1
Tracy, D. B.	1
More ▼