ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	17
Since 2016 (last 10 years)	56
Since 2006 (last 20 years)	109

Descriptor

Language Tests	121
English (Second Language)	107
Second Language Learning	102
Scores	60
Language Proficiency	48
Foreign Countries	44
Computer Assisted Testing	41
Correlation	35
Statistical Analysis	30
Scoring	28
Test Validity	24
Test Items	20
Oral Language	19
Writing Tests	19
Comparative Analysis	17
Language Usage	16
Test Construction	16
Regression (Statistics)	15
Models	14
Reading Tests	14
Second Language Instruction	14
Task Analysis	14
Factor Analysis	13
Difficulty Level	12
Foreign Students	12
More ▼

Source

ETS Research Report Series

121

Publication Type

Journal Articles	121
Reports - Research	117
Tests/Questionnaires	32
Reports - Descriptive	4
Numerical/Quantitative Data	3
Information Analyses	2
Speeches/Meeting Papers	1

Education Level

Higher Education	31
Postsecondary Education	28
Secondary Education	18
Junior High Schools	8
Middle Schools	8
High Schools	7
Elementary Education	6
Elementary Secondary Education	6
Grade 7	2
Grade 8	2
Early Childhood Education	1
Grade 10	1
Grade 11	1
Grade 12	1
Grade 6	1
Grade 9	1
Intermediate Grades	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Location

Japan	9
China	8
South Korea	5
Taiwan	5
Asia	4
Canada	4
Colombia	4
Indiana	4
Iowa	4
Mexico	4
Australia	3
Georgia	3
Germany	3
Poland	3
United States	3
Arizona	2
Armenia	2
Bulgaria	2
California (Los Angeles)	2
Chile	2
Croatia	2
France	2
India	2
Italy	2
Latin America	2
More ▼

Laws, Policies, & Programs

Every Student Succeeds Act…

Assessments and Surveys

Test of English as a Foreign…	77
Test of English for…	18
Graduate Management Admission…	3
Graduate Record Examinations	3
International English…	2
Praxis Series	2
Michigan Test of English…	1
SAT (College Admission Test)	1

What Works Clearinghouse Rating

ETS Research Report Series X

Showing 1 to 15 of 121 results Save | Export

Interpretation and Use of a Workplace English Language Proficiency Test Score Report: Perspectives of TOEIC[R] Test Takers and Score Users in Taiwan. RR-23-10

Peer reviewed
PDF on ERIC

Download full text

Ching-Ni Hsieh – ETS Research Report Series, 2023

Research in validity suggests that stakeholders' interpretation and use of test results should be an aspect of validity. Claims about the meaningfulness of test score interpretations and consequences of test use should be backed by evidence that stakeholders understand the definition of the construct assessed and the score report information. The…

Descriptors: Foreign Countries, Language Proficiency, English (Second Language), Language Tests

Model Adequacy Checking for Applying Harmonic Regression to Assessment Quality Control. Research Report. ETS RR-21-13

Peer reviewed
PDF on ERIC

Download full text

Qian, Jiahe; Li, Shuhong – ETS Research Report Series, 2021

In recent years, harmonic regression models have been applied to implement quality control for educational assessment data consisting of multiple administrations and displaying seasonality. As with other types of regression models, it is imperative that model adequacy checking and model fit be appropriately conducted. However, there has been no…

Descriptors: Models, Regression (Statistics), Language Tests, Quality Control

Mapping "TOEFL® Essentials"™ Test Scores to the Canadian Language Benchmarks. "TOEFL"® Research Report. TOEFL-RR-100. ETS Research Report No. RR-22-16

Peer reviewed
PDF on ERIC

Download full text

Papageorgiou, Spiros; Davis, Larry; Ohta, Renka; Gomez, Pablo Garcia – ETS Research Report Series, 2022

In this research report, we describe a study to map the scores of the "TOEFL® Essentials"™ test to the Canadian Language Benchmarks (CLB). The TOEFL Essentials test is a four-skills assessment of foundational English language skills and communication abilities in academic and general (daily life) contexts. At the time of writing this…

Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning

Evaluating the Use and Interpretation of the TOEIC[R] Listening and Reading Test Score Report: Perspectives of Test Takers in Japan. RR-23-02

Peer reviewed
PDF on ERIC

Download full text

Ching-Ni Hsieh – ETS Research Report Series, 2023

Researchers suggest that claims about the meaningfulness of test score interpretations and consequences of test use should be backed by evidence that stakeholders understand the definition of the construct assessed (meaningfulness) and score reports (consequences). Evaluation of stakeholders' actual uses and interpretations of score reports in…

Descriptors: Reading Tests, Listening Comprehension, Foreign Countries, English (Second Language)

Simulating Real-World Context in an Email Writing Task: Implications for Task-Based Language Assessment. Research Report. ETS RR-23-05

Peer reviewed
PDF on ERIC

Download full text

John M. Norris; Shoko Sasayama; Michelle Kim – ETS Research Report Series, 2023

Accomplishing a communication task in the real world requires the ability not only to do the task per se but also to manage aspects of the context in which it occurs. For this reason, simulations of target language use contexts have been incorporated into the design of communicative language tests as a way of enhancing the authenticity of…

Descriptors: Electronic Mail, Writing (Composition), Task Analysis, Student Evaluation

Evaluating the New "TOEFL ITP"® Speaking Test: Insights from Field Test Takers. "TOEFL"® Research Report. TOEFL-RR-99. ETS Research Report No. RR-22-10

Peer reviewed
PDF on ERIC

Download full text

Lee, Shinhye – ETS Research Report Series, 2022

In response to the calls for making key stakeholders' perspectives relevant in the test validation process, the study discussed in this report sought test-taker feedback as part of collecting validity evidence and supporting the ongoing field testing efforts of the new "TOEFL ITP"® Speaking section. Specifically, I aimed to investigate…

Descriptors: English (Second Language), Second Language Learning, Language Tests, Test Validity

The Impact of Using Synthetically Generated Listening Stimuli on Test-Taker Performance: A Case Study with Multiple-Choice, Single-Selection Items. TOEFL® Research Reports. RR-98. ETS?RR-22-05

Peer reviewed
PDF on ERIC

Download full text

Choi, Ikkyu; Zu, Jiyun – ETS Research Report Series, 2022

Synthetically generated speech (SGS) has become an integral part of our oral communication in a wide variety of contexts. It can be generated instantly at a low cost and allows precise control over multiple aspects of output, all of which can be highly appealing to second language (L2) assessment developers who have traditionally relied upon human…

Descriptors: Test Wiseness, Multiple Choice Tests, Test Items, Difficulty Level

New Validity Evidence on the "TOEFL Junior"® Standard Test as a Measure of Progress. TOEFL® Research Report. RR-95. ETS RR-21-19

Peer reviewed
PDF on ERIC

Download full text

Madyarov, Irshat; Movsisyan, Vahe; Madoyan, Habet; Galikyan, Irena; Gasparyan, Rubina – ETS Research Report Series, 2021

The "TOEFL Junior"® Standard test is a tool for measuring the English language skills of students ages 11+ who learn English as an additional language. It is a paper-based multiple-choice test and measures proficiency in three sections: listening, form and meaning, and reading. To date, empirical evidence provides some support for the…

Descriptors: English (Second Language), Second Language Learning, Language Tests, Standardized Tests

Making the Case for the Quality and Use of a New Language Proficiency Assessment: Validity Argument for the Redesigned "TOEIC Bridge"® Tests. Research Report. ETS RR-21-20

Peer reviewed
PDF on ERIC

Download full text

Schmidgall, Jonathan; Cid, Jaime; Carter Grissom, Elizabeth; Li, Lucy – ETS Research Report Series, 2021

The redesigned "TOEIC Bridge"® tests were designed to evaluate test takers' English listening, reading, speaking, and writing skills in the context of everyday adult life. In this paper, we summarize the initial validity argument that supports the use of test scores for the purpose of selection, placement, and evaluation of a test…

Descriptors: Language Tests, Second Language Learning, English (Second Language), Language Proficiency

The Effects of Extended Planning Time on Candidates' Performance, Processes, and Strategy Use in the Lecture Listening-into-Speaking Tasks of the "TOEFL iBT"® Test. TOEFL® Research Report. RR-93. ETS RR-21-09

Peer reviewed
PDF on ERIC

Download full text

Inoue, Chihiro; Lam, Daniel M. K. – ETS Research Report Series, 2021

This study investigated the effects of two different planning time conditions (i.e., operational [20 s] and extended length [90 s]) for the lecture listening-into-speaking tasks of the "TOEFL iBT"® test for candidates at different proficiency levels. Seventy international students based in universities and language schools in the United…

Descriptors: Foreign Countries, English (Second Language), Language Tests, Second Language Learning

Building a Validity Argument While Developing and Using an Assessment: A Concurrent Approach for the "Winsight"® Summative Assessment. Research Report. ETS RR-19-26

Peer reviewed
PDF on ERIC

Download full text

Stone, Elizabeth; Wylie, E. Caroline – ETS Research Report Series, 2019

We describe the summative assessment component within a K-12 assessment program and our development of a validity argument to support its claims with respect to intended uses and interpretations. First, we describe the "Winsight"® assessment program theory of action, a logic model elucidating mechanisms for how use of the assessment…

Descriptors: Summative Evaluation, Educational Assessment, Test Validity, Test Use

Grouping Effects on Jackknifed Variance Estimation for Item Response Theory Scaling and Equating with Cluster-Based Assessment Data. Research Report. ETS RR-18-16

Peer reviewed
PDF on ERIC

Download full text

Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2018

Educational assessment data are often collected from a set of test centers across various geographic regions, and therefore the data samples contain clusters. Such cluster-based data may result in clustering effects in variance estimation. However, in many grouped jackknife variance estimation applications, jackknife groups are often formed by a…

Descriptors: Item Response Theory, Scaling, Equated Scores, Cluster Grouping

Exploring "GRE"® and "TOEFL"® Score Profiles of International Students Intending to Pursue a Graduate Degree in the United States. Research Report. ETS RR-22-02

Peer reviewed
PDF on ERIC

Download full text

Roohr, Katrina; Olivera-Aguilar, Margarita; Bochenek, Jennifer; Belur, Vinetha – ETS Research Report Series, 2022

The United States continues to be a top destination for international students pursuing an advanced degree. Some information about the characteristics of international students applying to graduate programs in the United States is available, but little is known about how these characteristics are related to test taker performance on graduate…

Descriptors: College Entrance Examinations, Graduate Study, Language Tests, English (Second Language)

The Redesigned "TOEIC Bridge"® Tests: Relations to Test-Taker Perceptions of Proficiency in English. Research Report. ETS RR-20-07

Peer reviewed
PDF on ERIC

Download full text

Schmidgall, Jonathan – ETS Research Report Series, 2020

The redesigned four-skills "TOEIC Bridge"® tests were designed to measure the listening, reading, speaking, and writing proficiency of beginning to low--intermediate English learners in the context of everyday life. In this paper, I describe two studies that were conducted to investigate claims about the meaningfulness of redesigned…

Descriptors: Language Tests, Second Language Learning, English (Second Language), Foreign Countries

An Evaluation of the Single-Group Growth Model as an Alternative to Common-Item Equating. Research Report. ETS RR-16-01

Peer reviewed
PDF on ERIC

Download full text

Wei, Youhua; Morgan, Rick – ETS Research Report Series, 2016

As an alternative to common-item equating when common items do not function as expected, the single-group growth model (SGGM) scaling uses common examinees or repeaters to link test scores on different forms. The SGGM scaling assumes that, for repeaters taking adjacent administrations, the conditional distribution of scale scores in later…

Descriptors: Equated Scores, Growth Models, Scaling, Computation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9

Guzman-Orth, Danielle	5
Lee, Yong-Won	5
Papageorgiou, Spiros	5
Deane, Paul	4
Evanini, Keelan	4
Haberman, Shelby J.	4
Hauck, Maurice Cogan	4
Powers, Donald E.	4
Schmidgall, Jonathan	4
Tannenbaum, Richard J.	4
Attali, Yigal	3
Bejar, Isaac I.	3
Davis, Larry	3
Gomez, Pablo Garcia	3
Gu, Lin	3
Horák, Tania	3
Kantor, Robert	3
Ling, Guangming	3
Lopez, Alexis A.	3
Qian, Jiahe	3
Sawaki, Yasuyo	3
Sinharay, Sandip	3
Tolentino, Florencia	3
Wall, Dianne	3
Wang, Xinhao	3
More ▼