ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	3

Descriptor

English (Second Language)	6
Generalizability Theory	6
Scores	6
Second Language Learning	5
Language Tests	4
Reliability	3
Test Construction	3
Writing Tests	3
Speech	2
Test Items	2
Ability	1
Communication (Thought…	1
Essay Tests	1
Evaluation Research	1
Foreign Countries	1
Hierarchical Linear Modeling	1
Item Bias	1
Item Response Theory	1
Japanese	1
Korean	1
Language Skills	1
Listening Comprehension Tests	1
Listening Skills	1
Multiple Regression Analysis	1
Multivariate Analysis	1
More ▼

Source

Language Testing	2
ETS Research Report Series	1
Language Assessment Quarterly	1

Author

Lee, Yong-Won	4
Kantor, Robert	3
Mollaun, Pam	2
Barkaoui, Khaled	1
Zhang, Su	1

Publication Type

Reports - Research	5
Journal Articles	4
Numerical/Quantitative Data	2
Speeches/Meeting Papers	2
Reports - Evaluative	1
Tests/Questionnaires	1

Education Level

Audience

Location

Australia	1
Canada	1
Hong Kong	1
Mexico	1
Taiwan	1

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	6
Test of English for…	1

What Works Clearinghouse Rating

Showing all 6 results Save | Export

Using Multilevel Modeling in Language Assessment Research: A Conceptual Introduction

Peer reviewed

Direct link

Barkaoui, Khaled – Language Assessment Quarterly, 2013

This article critiques traditional single-level statistical approaches (e.g., multiple regression analysis) to examining relationships between language test scores and variables in the assessment setting. It highlights the conceptual, methodological, and statistical problems associated with these techniques in dealing with multilevel or nested…

Descriptors: Hierarchical Linear Modeling, Statistical Analysis, Multiple Regression Analysis, Generalizability Theory

Dependability of Scores for a New ESL Speaking Assessment Consisting of Integrated and Independent Tasks

Peer reviewed

Direct link

Lee, Yong-Won – Language Testing, 2006

A multitask speaking measure consisting of both integrated and independent tasks is expected to be an important component of a new version of the TOEFL test. This study considered two critical issues concerning score dependability of the new speaking measure: How much would the score dependability be impacted by (1) combining scores on different…

Descriptors: Language Tests, Second Language Learning, English (Second Language), Generalizability Theory

Investigating the Relative Effects of Persons, Items, Sections, and Languages on TOEIC Score Dependability

Peer reviewed

Direct link

Zhang, Su – Language Testing, 2006

This study applied generalizability theory to investigate the contributions of persons, items, sections, and language backgrounds to the score dependability of the Test of English for International Communication (TOEIC). I replicated and extended Brown's (1999) study of the Test of English as a Foreign Language (TOEFL), using data from two…

Descriptors: Communication (Thought Transfer), Generalizability Theory, English (Second Language), Scores

Score Reliability as an Essential Prerequisite for Validating New Writing and Speaking Tasks for TOEFL.

Lee, Yong-Won; Kantor, Robert; Mollaun, Pam – 2002

This paper reports the results of generalizability theory (G) analyses done for new writing and speaking tasks for the Test of English as a Foreign Language (TOEFL). For writing, a special focus was placed on evaluating the impact on the reliability of the number of raters (or ratings) per essay (one or two) and the number of tasks (one, two, or…

Descriptors: English (Second Language), Generalizability Theory, Reliability, Scores

Dependability of New ESL Writing Test Scores: Evaluating Prototype Tasks and Alternative Rating Schemes. TOEFL® Monograph Series. MS-31. ETS RR-05-14

Peer reviewed
PDF on ERIC

Download full text

Lee, Yong-Won; Kantor, Robert – ETS Research Report Series, 2005

Possible integrated and independent tasks were pilot tested for the writing section of a new generation of TOEFL® (Test of English as a Foreign Language™) examination. This study examines the impact of various rating designs as well as the impact of the number of tasks and raters on the reliability of writing scores based on integrated and…

Descriptors: Language Tests, English (Second Language), Second Language Learning, Writing Tests

Score Dependability of the Writing and Speaking Sections of New TOEFL.

Lee, Yong-Won; Kantor, Robert; Mollaun, Pam – 2002

This study examines the score dependability of writing and speaking assessments from the Test of English as a Foreign Language (TOEFL) from the perspectives of univariate and multivariate generalizability theory (G-theory) and presents the findings of three separate G-theory studies. For writing, the focus was on evaluating the impact on…

Descriptors: Ability, English (Second Language), Generalizability Theory, Item Bias