ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	3

Descriptor

Writing Tests	8
Performance Based Assessment	5
Scoring	5
Writing Evaluation	5
Essays	4
Evaluators	3
Holistic Approach	3
Essay Tests	2
Evaluation Methods	2
Experience	2
Scores	2
Scoring Rubrics	2
Test Use	2
Accuracy	1
Anxiety	1
Bias	1
Cognitive Processes	1
College Students	1
Computer Assisted Instruction	1
Computer Attitudes	1
Computer Terminals	1
Content Analysis	1
Correlation	1
Criteria	1
Data Analysis	1
More ▼

Source

Educational and Psychological…	2
Journal of Technology,…	1
Language Learning & Technology	1

Author

Wolfe, Edward W.	8
Kao, Chi-Wen	2
Vickers, Daisy	2
Chiu, Chris W. T.	1
Engelhard, George, Jr.	1
Feltovich, Brian	1
Lai, Emily R.	1
Manalo, Jonathan R.	1
Matthews, Staci	1
Wang, Jue	1

Publication Type

Reports - Research	7
Journal Articles	4
Speeches/Meeting Papers	4
Reports - Evaluative	1

Education Level

Adult Education	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 8 results Save | Export

Evaluating Rater Accuracy in Rater-Mediated Assessments Using an Unfolding Model

Peer reviewed

Direct link

Wang, Jue; Engelhard, George, Jr.; Wolfe, Edward W. – Educational and Psychological Measurement, 2016

The number of performance assessments continues to increase around the world, and it is important to explore new methods for evaluating the quality of ratings obtained from raters. This study describes an unfolding model for examining rater accuracy. Accuracy is defined as the difference between observed and expert ratings. Dichotomous accuracy…

Descriptors: Evaluators, Accuracy, Performance Based Assessment, Models

Differentiation of Illusory and True Halo in Writing Scores

Peer reviewed

Direct link

Lai, Emily R.; Wolfe, Edward W.; Vickers, Daisy – Educational and Psychological Measurement, 2015

This report summarizes an empirical study that addresses two related topics within the context of writing assessment--illusory halo and how much unique information is provided by multiple analytic scores. Specifically, we address the issue of whether unique information is provided by analytic scores assigned to student writing, beyond what is…

Descriptors: Writing Tests, Scores, Bias, Holistic Approach

The Effectiveness and Efficiency of Distributed Online, Regional Online, and Regional Face-to-Face Training for Writing Assessment Raters

Peer reviewed
PDF on ERIC

Download full text

Wolfe, Edward W.; Matthews, Staci; Vickers, Daisy – Journal of Technology, Learning, and Assessment, 2010

This study examined the influence of rater training and scoring context on training time, scoring time, qualifying rate, quality of ratings, and rater perceptions. One hundred twenty raters participated in the study and experienced one of three training contexts: (a) online training in a distributed scoring context, (b) online training in a…

Descriptors: Writing Evaluation, Writing Tests, Qualifications, Program Effectiveness

Generalizability Theory: A New Approach To Analyze Non-Crossed Performance Assessment Data.

Download full text

Chiu, Chris W. T.; Wolfe, Edward W. – 1997

Unstable, and potentially invalid, variance component estimates may result from using only a limited portion of available data from operational performance assessments. However, missing observations are common in these settings because of the nature of the assessment design. This paper describes a procedure for overcoming the computational and…

Descriptors: College Students, Data Analysis, Essay Tests, Generalizability Theory

Expert/Novice Differences in the Focus and Procedures Used by Essay Scorers.

Download full text

Wolfe, Edward W.; Kao, Chi-Wen – 1996

The amount of variability contributed to large-scale performance assessment scores by raters is a constant concern for those who wish to use results from these assessments for educational decisions. This study approaches the problem by examining the behaviors of essay scorers who demonstrate different levels of proficiency with a holistic scoring…

Descriptors: Essay Tests, Experience, Holistic Approach, Judges

The Relationship between Scoring Procedures and Focus and the Reliability of Direct Writing Assessment Scores.

Download full text

Wolfe, Edward W.; Kao, Chi-Wen – 1996

This paper reports the results of an analysis of the relationship between scorer behaviors and score variability. Thirty-six essay scorers were interviewed and asked to perform a think-aloud task as they scored 24 essays. Each comment made by a scorer was coded according to its content focus (i.e. appearance, assignment, mechanics, communication,…

Descriptors: Content Analysis, Educational Assessment, Essays, Evaluation Methods

Learning To Rate Essays: A Study of Scorer Cognition.

Download full text

Wolfe, Edward W.; Feltovich, Brian – 1994

This paper presents a model of scored cognition that incorporates two types of mental models: models of performance (i.e., the criteria for judging performance) and models of scoring (i.e., the procedural scripts for scoring an essay). In Study 1, six novice and five experienced scorers wrote definitions of three levels of a 6-point holistic…

Descriptors: Cognitive Processes, Criteria, Essays, Evaluation Methods

Composition Medium Comparability in a Direct Writing Assessment of Non-Native English Speakers

Peer reviewed

Direct link

Wolfe, Edward W.; Manalo, Jonathan R. – Language Learning & Technology, 2004

The Test of English as a Foreign Language (TOEFL) contains a direct writing assessment, and examinees are given the option of composing their responses at a computer terminal using a keyboard or composing their responses in handwriting. This study sought to determine whether performance on a direct writing assessment is comparable for examinees…

Descriptors: Writing Evaluation, Handwriting, Writing Tests, Computer Terminals