NotesFAQContact Us
Collection
Advanced
Search Tips
Source
ETS Research Report Series121
Audience
Laws, Policies, & Programs
Every Student Succeeds Act…1
What Works Clearinghouse Rating
Showing 1 to 15 of 121 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ching-Ni Hsieh – ETS Research Report Series, 2023
Research in validity suggests that stakeholders' interpretation and use of test results should be an aspect of validity. Claims about the meaningfulness of test score interpretations and consequences of test use should be backed by evidence that stakeholders understand the definition of the construct assessed and the score report information. The…
Descriptors: Foreign Countries, Language Proficiency, English (Second Language), Language Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Qian, Jiahe; Li, Shuhong – ETS Research Report Series, 2021
In recent years, harmonic regression models have been applied to implement quality control for educational assessment data consisting of multiple administrations and displaying seasonality. As with other types of regression models, it is imperative that model adequacy checking and model fit be appropriately conducted. However, there has been no…
Descriptors: Models, Regression (Statistics), Language Tests, Quality Control
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Papageorgiou, Spiros; Davis, Larry; Ohta, Renka; Gomez, Pablo Garcia – ETS Research Report Series, 2022
In this research report, we describe a study to map the scores of the "TOEFL® Essentials"™ test to the Canadian Language Benchmarks (CLB). The TOEFL Essentials test is a four-skills assessment of foundational English language skills and communication abilities in academic and general (daily life) contexts. At the time of writing this…
Descriptors: Foreign Countries, Language Tests, English (Second Language), Second Language Learning
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Ching-Ni Hsieh – ETS Research Report Series, 2023
Researchers suggest that claims about the meaningfulness of test score interpretations and consequences of test use should be backed by evidence that stakeholders understand the definition of the construct assessed (meaningfulness) and score reports (consequences). Evaluation of stakeholders' actual uses and interpretations of score reports in…
Descriptors: Reading Tests, Listening Comprehension, Foreign Countries, English (Second Language)
Peer reviewed Peer reviewed
PDF on ERIC Download full text
John M. Norris; Shoko Sasayama; Michelle Kim – ETS Research Report Series, 2023
Accomplishing a communication task in the real world requires the ability not only to do the task per se but also to manage aspects of the context in which it occurs. For this reason, simulations of target language use contexts have been incorporated into the design of communicative language tests as a way of enhancing the authenticity of…
Descriptors: Electronic Mail, Writing (Composition), Task Analysis, Student Evaluation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Lee, Shinhye – ETS Research Report Series, 2022
In response to the calls for making key stakeholders' perspectives relevant in the test validation process, the study discussed in this report sought test-taker feedback as part of collecting validity evidence and supporting the ongoing field testing efforts of the new "TOEFL ITP"® Speaking section. Specifically, I aimed to investigate…
Descriptors: English (Second Language), Second Language Learning, Language Tests, Test Validity
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Choi, Ikkyu; Zu, Jiyun – ETS Research Report Series, 2022
Synthetically generated speech (SGS) has become an integral part of our oral communication in a wide variety of contexts. It can be generated instantly at a low cost and allows precise control over multiple aspects of output, all of which can be highly appealing to second language (L2) assessment developers who have traditionally relied upon human…
Descriptors: Test Wiseness, Multiple Choice Tests, Test Items, Difficulty Level
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Madyarov, Irshat; Movsisyan, Vahe; Madoyan, Habet; Galikyan, Irena; Gasparyan, Rubina – ETS Research Report Series, 2021
The "TOEFL Junior"® Standard test is a tool for measuring the English language skills of students ages 11+ who learn English as an additional language. It is a paper-based multiple-choice test and measures proficiency in three sections: listening, form and meaning, and reading. To date, empirical evidence provides some support for the…
Descriptors: English (Second Language), Second Language Learning, Language Tests, Standardized Tests
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Schmidgall, Jonathan; Cid, Jaime; Carter Grissom, Elizabeth; Li, Lucy – ETS Research Report Series, 2021
The redesigned "TOEIC Bridge"® tests were designed to evaluate test takers' English listening, reading, speaking, and writing skills in the context of everyday adult life. In this paper, we summarize the initial validity argument that supports the use of test scores for the purpose of selection, placement, and evaluation of a test…
Descriptors: Language Tests, Second Language Learning, English (Second Language), Language Proficiency
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Inoue, Chihiro; Lam, Daniel M. K. – ETS Research Report Series, 2021
This study investigated the effects of two different planning time conditions (i.e., operational [20 s] and extended length [90 s]) for the lecture listening-into-speaking tasks of the "TOEFL iBT"® test for candidates at different proficiency levels. Seventy international students based in universities and language schools in the United…
Descriptors: Foreign Countries, English (Second Language), Language Tests, Second Language Learning
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Stone, Elizabeth; Wylie, E. Caroline – ETS Research Report Series, 2019
We describe the summative assessment component within a K-12 assessment program and our development of a validity argument to support its claims with respect to intended uses and interpretations. First, we describe the "Winsight"® assessment program theory of action, a logic model elucidating mechanisms for how use of the assessment…
Descriptors: Summative Evaluation, Educational Assessment, Test Validity, Test Use
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2018
Educational assessment data are often collected from a set of test centers across various geographic regions, and therefore the data samples contain clusters. Such cluster-based data may result in clustering effects in variance estimation. However, in many grouped jackknife variance estimation applications, jackknife groups are often formed by a…
Descriptors: Item Response Theory, Scaling, Equated Scores, Cluster Grouping
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Roohr, Katrina; Olivera-Aguilar, Margarita; Bochenek, Jennifer; Belur, Vinetha – ETS Research Report Series, 2022
The United States continues to be a top destination for international students pursuing an advanced degree. Some information about the characteristics of international students applying to graduate programs in the United States is available, but little is known about how these characteristics are related to test taker performance on graduate…
Descriptors: College Entrance Examinations, Graduate Study, Language Tests, English (Second Language)
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Schmidgall, Jonathan – ETS Research Report Series, 2020
The redesigned four-skills "TOEIC Bridge"® tests were designed to measure the listening, reading, speaking, and writing proficiency of beginning to low--intermediate English learners in the context of everyday life. In this paper, I describe two studies that were conducted to investigate claims about the meaningfulness of redesigned…
Descriptors: Language Tests, Second Language Learning, English (Second Language), Foreign Countries
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Wei, Youhua; Morgan, Rick – ETS Research Report Series, 2016
As an alternative to common-item equating when common items do not function as expected, the single-group growth model (SGGM) scaling uses common examinees or repeaters to link test scores on different forms. The SGGM scaling assumes that, for repeaters taking adjacent administrations, the conditional distribution of scale scores in later…
Descriptors: Equated Scores, Growth Models, Scaling, Computation
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9