ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	5

Source

Educational and Psychological…	3
Applied Measurement in…	1
Assessing Writing	1
International Journal of…	1
Journal of Educational…	1
Journal of Educational…	1

Author

Engelhard, George, Jr.	11
Gyagenda, Ismail S.	2
Wang, Jue	2
Wind, Stefanie A.	2
Behizadeh, Nadia	1
Kobrin, Jennifer L.	1
Wolfe, Edward W.	1

Publication Type

Journal Articles	8
Reports - Research	8
Speeches/Meeting Papers	4
Reports - Evaluative	3

Education Level

High Schools	1
Higher Education	1
Postsecondary Education	1
Secondary Education	1

Audience

Location

Georgia

Laws, Policies, & Programs

Assessments and Surveys

SAT (College Admission Test)

What Works Clearinghouse Rating

Engelhard, George, Jr. X

Showing all 11 results Save | Export

Exploring the Impersonal Judgments and Personal Preferences of Raters in Rater-Mediated Assessments with Unfolding Models

Peer reviewed

Direct link

Wang, Jue; Engelhard, George, Jr. – Educational and Psychological Measurement, 2019

The purpose of this study is to explore the use of unfolding models for evaluating the quality of ratings obtained in rater-mediated assessments. Two different judgmental processes can be used to conceptualize ratings: impersonal judgments and personal preferences. Impersonal judgments are typically expected in rater-mediated assessments, and…

Descriptors: Evaluative Thinking, Preferences, Evaluators, Models

Exploring Rating Quality in Rater-Mediated Assessments Using Mokken Scale Analysis

Peer reviewed

Direct link

Wind, Stefanie A.; Engelhard, George, Jr. – Educational and Psychological Measurement, 2016

Mokken scale analysis is a probabilistic nonparametric approach that offers statistical and graphical tools for evaluating the quality of social science measurement without placing potentially inappropriate restrictions on the structure of a data set. In particular, Mokken scaling provides a useful method for evaluating important measurement…

Descriptors: Nonparametric Statistics, Statistical Analysis, Measurement, Psychometrics

Evaluating Rater Accuracy in Rater-Mediated Assessments Using an Unfolding Model

Peer reviewed

Direct link

Wang, Jue; Engelhard, George, Jr.; Wolfe, Edward W. – Educational and Psychological Measurement, 2016

The number of performance assessments continues to increase around the world, and it is important to explore new methods for evaluating the quality of ratings obtained from raters. This study describes an unfolding model for examining rater accuracy. Accuracy is defined as the difference between observed and expert ratings. Dichotomous accuracy…

Descriptors: Evaluators, Accuracy, Performance Based Assessment, Models

Exploring Differential Subgroup Functioning on SAT Writing Items: What Happens When English Is Not a Test Taker's Best Language?

Peer reviewed

Direct link

Engelhard, George, Jr.; Kobrin, Jennifer L.; Wind, Stefanie A. – International Journal of Testing, 2014

The purpose of this study is to explore patterns in model-data fit related to subgroups of test takers from a large-scale writing assessment. Using data from the SAT, a calibration group was randomly selected to represent test takers who reported that English was their best language from the total population of test takers (N = 322,011). A…

Descriptors: College Entrance Examinations, Writing Tests, Goodness of Fit, English

Historical View of the Influences of Measurement and Writing Theories on the Practice of Writing Assessment in the United States

Peer reviewed

Direct link

Behizadeh, Nadia; Engelhard, George, Jr. – Assessing Writing, 2011

The purpose of this study is to examine the interactions among measurement theories, writing theories, and writing assessments in the United States from an historical perspective. The assessment of writing provides a useful framework for examining how theories influence, and in some cases fail to influence actual practice. Two research traditions…

Descriptors: Writing (Composition), Intellectual Disciplines, Writing Evaluation, Writing Tests

Rater, Domain, and Gender Influences on the Assessed Quality of Student Writing Using Weighted and Unweighted Scoring.

Download full text

Gyagenda, Ismail S.; Engelhard, George, Jr. – 1998

The purpose of this study was to examine rater, domain, and gender influences on the assessed quality of student writing using weighted and unweighted scores. Twenty rates were randomly selected from a group of 87 operational raters contracted to rate essays as part of the 1993 field test of the Georgia High School Writing Test. All of the raters…

Descriptors: Essay Tests, Evaluators, High School Students, High Schools

Applying the Rasch Model To Explore Rater Influences on the Assessed Quality of Students' Writing Ability.

Download full text

Gyagenda, Ismail S.; Engelhard, George, Jr. – 1998

The purpose of this study was to describe the Rasch model for measurement and apply the model to examine the relationship between raters, domains of written compositions, and student writing ability. Twenty raters were randomly selected from a group of 87 operational raters contracted to rate essays as part of the 1993 field test of the Georgia…

Descriptors: Difficulty Level, Essay Tests, Evaluators, High School Students

The Measurement of Writing Ability with a Many-Faceted Rasch Model.

Download full text

Engelhard, George, Jr. – 1991

A many-faceted Rasch model (FACETS) is presented for the measurement of writing ability. The FACETS model is a multivariate extension of Rasch measurement models that can be used to provide a framework for calibrating both raters and writing tasks within the context of writing assessment. A FACETS model is described based on the current procedures…

Descriptors: Grade 8, Holistic Evaluation, Interrater Reliability, Item Response Theory

Examining Rater Errors in the Assessment of Written Composition with a Many-Faceted Rasch Model.

Peer reviewed

Engelhard, George, Jr. – Journal of Educational Measurement, 1994

Rater errors (rater severity, halo effect, central tendency, and restriction of range) are described, and criteria are presented for evaluating rating quality based on a many-faceted Rasch (FACETS) model. Ratings of 264 compositions from the Eighth Grade Writing Test in Georgia by 15 raters illustrate the discussion. (SLD)

Descriptors: Criteria, Educational Assessment, Elementary Education, Elementary School Students

Writing Tasks and Gender: Influences on Writing Quality of Black and White Students.

Peer reviewed

Engelhard, George, Jr.; And Others – Journal of Educational Research, 1994

The influences of writing tasks and gender on the quality of student writing of black and white eighth graders were examined. Data from statewide writing assessments of 170,899 Georgia students indicated both writing tasks and student characteristics were significant predictors of writing quality. There were both racial and gender differences. (SM)

Descriptors: Blacks, Grade 8, Junior High School Students, Junior High Schools

The Measurement of Writing Ability with a Many-Faceted Rasch Model.

Peer reviewed

Engelhard, George, Jr. – Applied Measurement in Education, 1992

A Many-Faceted Rasch Model (FACETS) for measurement of writing ability is described, and its use in solving measurement problems in large-scale assessment is illustrated with a random sample of 1,000 students from Georgia's Eighth Grade Writing Test. It is a promising approach to assessment through written compositions. (SLD)

Descriptors: Educational Assessment, Essays, Evaluation Problems, Grade 8

Writing Tests	11
Item Response Theory	6
Evaluators	4
Grade 8	4
Writing (Composition)	4
Writing Evaluation	4
High School Students	3
Junior High Schools	3
Educational Assessment	2
Essay Tests	2
Evaluation Methods	2
High Schools	2
Interrater Reliability	2
Junior High School Students	2
Measurement Techniques	2
Models	2
Psychometrics	2
Scores	2
Scoring	2
Sex Differences	2
State Programs	2
Statistical Analysis	2
Testing Programs	2
Accuracy	1
Blacks	1
More ▼