ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	4

Descriptor

Writing Tests	11
English (Second Language)	10
Second Language Learning	10
Scores	8
Computer Assisted Testing	6
Language Tests	6
Reliability	5
Cues	4
Generalizability Theory	4
Prompting	4
Scoring	4
Writing Evaluation	4
Correlation	3
Effect Size	3
Essays	3
Gender Differences	3
Comparative Analysis	2
Computer Software	2
Culture Fair Tests	2
Holistic Approach	2
Item Response Theory	2
Listening Skills	2
Regression (Statistics)	2
Speech	2
Test Construction	2
More ▼

Source

ETS Research Report Series	4
Applied Linguistics	1
Applied Measurement in…	1
Educational Testing Service	1
International Journal of…	1

Author

Lee, Yong-Won	11
Kantor, Robert	6
Breland, Hunter	3
Gentile, Claudia	2
Mollaun, Pam	2
Muraki, Eiji	2
Broer, Markus	1
Najarian, Michelle	1
Powers, Don	1
Rizavi, Saba	1

Publication Type

Journal Articles	7
Reports - Research	7
Numerical/Quantitative Data	3
Reports - Evaluative	3
Speeches/Meeting Papers	3
Tests/Questionnaires	2
Reports - Descriptive	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Australia	1
Canada	1
Hong Kong	1
Mexico	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	8
Graduate Record Examinations	1

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Toward Automated Multi-Trait Scoring of Essays: Investigating Links among Holistic, Analytic, and Text Feature Scores

Peer reviewed

Direct link

Lee, Yong-Won; Gentile, Claudia; Kantor, Robert – Applied Linguistics, 2010

The main purpose of the study was to investigate the distinctness and reliability of analytic (or multi-trait) rating dimensions and their relationships to holistic scores and "e-rater"[R] essay feature variables in the context of the TOEFL[R] computer-based test (TOEFL CBT) writing assessment. Data analyzed in the study were holistic…

Descriptors: Writing Evaluation, Writing Tests, Scoring, Essays

Analytic Scoring of TOEFL® CBT Essays: Scores from Humans and "E-rater"®. TOEFL® Research Reports. RR-81. ETS RR-08-01

Peer reviewed
PDF on ERIC

Download full text

Lee, Yong-Won; Gentile, Claudia; Kantor, Robert – ETS Research Report Series, 2008

The main purpose of the study was to investigate the distinctness and reliability of analytic (or multitrait) rating dimensions and their relationships to holistic scores and "e-rater"® essay feature variables in the context of the TOEFL® computer-based test (CBT) writing assessment. Data analyzed in the study were analytic and holistic…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Scoring

Investigating Uniform and Non-Uniform Gender DIF in Computer-Based ESL Writing Assessment

Peer reviewed

Direct link

Breland, Hunter; Lee, Yong-Won – Applied Measurement in Education, 2007

The objective of the present investigation was to examine the comparability of writing prompts for different gender groups in the context of the computer-based Test of English as a Foreign Language[TM] (TOEFL[R]-CBT). A total of 87 prompts administered from July 1998 through March 2000 were analyzed. An extended version of logistic regression for…

Descriptors: Learning Theories, Writing Evaluation, Writing Tests, Second Language Learning

Evaluating Prototype Tasks and Alternative Rating Schemes for a New ESL Writing Test through G-Theory

Peer reviewed

Direct link

Lee, Yong-Won; Kantor, Robert – International Journal of Testing, 2007

Possible integrated and independent tasks were pilot tested for the writing section of a new generation of the TOEFL[R] (Test of English as a Foreign Language[TM]). This study examines the impact of various rating designs and of the number of tasks and raters on the reliability of writing scores based on integrated and independent tasks from the…

Descriptors: Generalizability Theory, Writing Tests, English (Second Language), Second Language Learning

Score Reliability as an Essential Prerequisite for Validating New Writing and Speaking Tasks for TOEFL.

Lee, Yong-Won; Kantor, Robert; Mollaun, Pam – 2002

This paper reports the results of generalizability theory (G) analyses done for new writing and speaking tasks for the Test of English as a Foreign Language (TOEFL). For writing, a special focus was placed on evaluating the impact on the reliability of the number of raters (or ratings) per essay (one or two) and the number of tasks (one, two, or…

Descriptors: English (Second Language), Generalizability Theory, Reliability, Scores

Dependability of New ESL Writing Test Scores: Evaluating Prototype Tasks and Alternative Rating Schemes. TOEFL® Monograph Series. MS-31. ETS RR-05-14

Peer reviewed
PDF on ERIC

Download full text

Lee, Yong-Won; Kantor, Robert – ETS Research Report Series, 2005

Possible integrated and independent tasks were pilot tested for the writing section of a new generation of TOEFL® (Test of English as a Foreign Language™) examination. This study examines the impact of various rating designs as well as the impact of the number of tasks and raters on the reliability of writing scores based on integrated and…

Descriptors: Language Tests, English (Second Language), Second Language Learning, Writing Tests

The Essay Scoring and Scorer Reliability in TOEFL CBT.

Lee, Yong-Won – 2001

An essay test is now an integral part of the computer based Test of English as a Foreign Language (TOEFL-CBT). This paper provides a brief overview of the current TOEFL-CBT essay test, describes the operational procedures for essay scoring, including the Online Scoring Network (OSN) of the Educational Testing Service (ETS), and discusses major…

Descriptors: Computer Assisted Testing, English (Second Language), Essay Tests, Interrater Reliability

An Analysis of TOEFL CBT Writing Prompt Difficulty and Comparability for Different Gender Groups. Research Reports. Report 76. RR-04-05

Download full text

Breland, Hunter; Lee, Yong-Won; Najarian, Michelle; Muraki, Eiji – Educational Testing Service, 2004

This investigation of the comparability of writing assessment prompts was conducted in two phases. In an exploratory Phase I, 47 writing prompts administered in the computer-based Test of English as a Foreign Language[TM] (TOEFL[R] CBT) from July through December 1998 were examined. Logistic regression procedures were used to estimate prompt…

Descriptors: Writing Evaluation, Quality Control, Gender Differences, Writing Tests

Score Dependability of the Writing and Speaking Sections of New TOEFL.

Lee, Yong-Won; Kantor, Robert; Mollaun, Pam – 2002

This study examines the score dependability of writing and speaking assessments from the Test of English as a Foreign Language (TOEFL) from the perspectives of univariate and multivariate generalizability theory (G-theory) and presents the findings of three separate G-theory studies. For writing, the focus was on evaluating the impact on…

Descriptors: Ability, English (Second Language), Generalizability Theory, Item Bias

Ensuring the Fairness of GRE Writing Prompts: Assessing Differential Difficulty. Research Report. ETS GRE Board Research Report No. 02-07R. ETS RR-05-11

Peer reviewed
PDF on ERIC

Download full text

Broer, Markus; Lee, Yong-Won; Rizavi, Saba; Powers, Don – ETS Research Report Series, 2005

Three polytomous DIF detection techniques--the Mantel test, logistic regression, and polySTAND--were used to identify GRE® Analytical Writing prompts ("Issue" and "Argument") that are differentially difficult for (a) female test takers; (b) African American, Asian, and Hispanic test takers; and (c) test takers whose strongest…

Descriptors: Culture Fair Tests, Item Response Theory, Test Items, Cues

Comparability of TOEFL CBT Writing Prompts for Different Native Language Groups. TOEFL® Research Reports. RR-77. ETS RR-04-24

Peer reviewed
PDF on ERIC

Download full text

Lee, Yong-Won; Breland, Hunter; Muraki, Eiji – ETS Research Report Series, 2004

This study has investigated the comparability of computer-based testing (CBT) writing prompts in the Test of English as a Foreign Language™ (TOEFL®) for examinees of different native language backgrounds. A total of 81 writing prompts introduced from July 1998 through August 2000 were examined using a three-step logistic regression procedure for…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing