ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	23
Since 2006 (last 20 years)	48

Descriptor

English (Second Language)	74
Language Tests	70
Second Language Learning	58
Test Reliability	41
Scores	32
Interrater Reliability	28
Test Validity	27
Scoring	25
Foreign Countries	24
Computer Assisted Testing	22
Language Proficiency	22
Correlation	18
Evaluators	16
Reliability	15
Test Construction	15
Oral Language	14
Statistical Analysis	14
Writing Tests	14
Test Items	12
Second Language Instruction	11
Comparative Analysis	10
Essay Tests	10
Reading Tests	9
Factor Analysis	8
Native Speakers	8
More ▼

Publication Type

Reports - Research	59
Journal Articles	56
Tests/Questionnaires	11
Reports - Evaluative	9
Speeches/Meeting Papers	9
Numerical/Quantitative Data	4
Reports - Descriptive	4
Information Analyses	2
Dissertations/Theses -…	1
Guides - Classroom - Teacher	1

Education Level

Higher Education	20
Postsecondary Education	17
Secondary Education	5
High Schools	3
Elementary Education	2
Grade 12	2
Grade 10	1
Grade 11	1
Grade 6	1
Grade 7	1
Grade 8	1
Grade 9	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
More ▼

Audience

Researchers

Location

Iran	9
Germany	3
Japan	3
Canada	2
China	2
India	2
Mexico	2
United States	2
Australia	1
Colombia	1
Dominican Republic	1
France	1
Hong Kong	1
Italy	1
Japan (Tokyo)	1
Jordan	1
Kenya	1
Michigan	1
North America	1
Pennsylvania (Philadelphia)	1
Saudi Arabia	1
South Korea	1
Switzerland	1
Taiwan	1
Turkey	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	78
International English…	6
Graduate Record Examinations	4
Test of English for…	4
Graduate Management Admission…	2
SAT (College Admission Test)	2
ACTFL Oral Proficiency…	1
Computer Attitude Scale	1
English Proficiency Test	1
Law School Admission Test	1
Medical College Admission Test	1
Program for International…	1
Strategy Inventory for…	1
Test of Written English	1
More ▼

What Works Clearinghouse Rating

Test of English as a Foreign Language X

Showing 1 to 15 of 78 results Save | Export

Complementary Strengths? Evaluation of a Hybrid Human-Machine Scoring Approach for a Test of Oral Academic English

Peer reviewed

Direct link

Davis, Larry; Papageorgiou, Spiros – Assessment in Education: Principles, Policy & Practice, 2021

Human raters and machine scoring systems potentially have complementary strengths in evaluating language ability; specifically, it has been suggested that automated systems might be used to make consistent measurements of specific linguistic phenomena, whilst humans evaluate more global aspects of performance. We report on an empirical study that…

Descriptors: Scoring, English for Academic Purposes, Oral English, Speech Tests

Of Standardized Student Measurements and Tests in the Dominican Republic

Download full text

Tavarez Da Costa, Pedro; Reyes Arias, Fransheska – Online Submission, 2021

The present work seeks to establish a comparison between two different and distant evaluation tools applied to the Dominican student population in order to measure the efficiency of our educational system in the recent years, one of them measured the quality of Dominican education in three areas (the PISA Test), whereas the other tested the…

Descriptors: Foreign Countries, Standardized Tests, Student Evaluation, International Assessment

Using Statistical Transformation Methods to Explore Speech Perception Scale Lengths

Peer reviewed
PDF on ERIC

Download full text

Kermad, Alyssa; Bogorevich, Valeria – Language Teaching Research Quarterly, 2022

The practice of second language (L2) speech perception has traditionally relied on equal-interval perceptual scales and novice listeners' (NLs) impressionistic judgments of constructs such as accentedness and comprehensibility (Munro & Derwing, 2011). However, issues have surfaced with respect to how well NLs can use these scales, whether they…

Descriptors: Speech Communication, Second Language Learning, Intelligibility, Rating Scales

Analysis of IELTS and TOEFL Reading and Listening Tests in Terms of Revised Bloom's Taxonomy

Peer reviewed

Direct link

Baghaei, Samira; Bagheri, Mohammad Sadegh; Yamini, Mortaza – Cogent Education, 2020

The main purpose of this quantitative-qualitative content analysis study was to compare IELTS and TOEFL listening and reading tests based on the representation of the learning objectives of Revised Bloom's taxonomy. To this end, 12 Academic IELTS listening and reading tests and 12 TOEFL iBT listening and reading tests were analyzed qualitatively…

Descriptors: Second Language Learning, English (Second Language), Language Tests, Reading Tests

Adaptation and Assessment of a Public Speaking Rating Scale

Peer reviewed

Direct link

Iberri-Shea, Gina – Cogent Education, 2017

Prominent spoken language assessments such as the Oral Proficiency Interview and the Test of Spoken English have been primarily concerned with speaking ability as it relates to conversation. This paper looks at an additional aspect of spoken language ability, namely public speaking. This study used an adapted form of a public speaking rating scale…

Descriptors: Public Speaking, Rating Scales, Adoption (Ideas), English Instruction

Rater Dominance in Discussion as a Resolution Method

Peer reviewed
PDF on ERIC

Download full text

Ahmadi, Alireza – Taiwan Journal of TESOL, 2020

Rater subjectivity has long been an intriguing topic. The use of discussion as a resolution method is a practical way to reduce this subjectivity. However, the efficacy of discussion depends on whether different raters get equally engaged in it or one rater tends to dominate others. This study investigated whether and how rater dominance occurs in…

Descriptors: Evaluators, Interrater Reliability, Discussion, Discourse Analysis

Developing an Innovative Elicited Imitation Task for Efficient English Proficiency Assessment. TOEFL® Research Report. RR-96. ETS RR-21-24

Peer reviewed
PDF on ERIC

Download full text

Davis, Larry; Norris, John – ETS Research Report Series, 2021

The elicited imitation task (EIT), in which language learners listen to a series of spoken sentences and repeat each one verbatim, is a commonly used measure of language proficiency in second language acquisition research. The "TOEFL® Essentials"™ test includes an EIT as a holistic measure of speaking proficiency, referred to as the…

Descriptors: Task Analysis, Language Proficiency, Speech Communication, Language Tests

Computerized Testing in Reading Comprehension Skill: Investigating Score Interchangeability, Item Review, Age and Gender Stereotypes, ICT Literacy and Computer Attitudes

Peer reviewed

Direct link

Toroujeni, Seyyed Morteza Hashemi – Education and Information Technologies, 2022

Score interchangeability of Computerized Fixed-Length Linear Testing (henceforth CFLT) and Paper-and-Pencil-Based Testing (henceforth PPBT) has become a controversial issue over the last decade when technology has meaningfully restructured methods of the educational assessment. Given this controversy, various testing guidelines published on…

Descriptors: Computer Assisted Testing, Reading Tests, Reading Comprehension, Scoring

Automated Essay Scoring at Scale: A Case Study in Switzerland and Germany. TOEFL® Research Report. RR-86. ETS RR-19-12

Peer reviewed
PDF on ERIC

Download full text

Rupp, André A.; Casabianca, Jodi M.; Krüger, Maleika; Keller, Stefan; Köller, Olaf – ETS Research Report Series, 2019

In this research report, we describe the design and empirical findings for a large-scale study of essay writing ability with approximately 2,500 high school students in Germany and Switzerland on the basis of 2 tasks with 2 associated prompts, each from a standardized writing assessment whose scoring involved both human and automated components.…

Descriptors: Automation, Foreign Countries, English (Second Language), Language Tests

For a Greater Good: Bias Analysis in Writing Assessment

Peer reviewed

Direct link

Ahmadi Shirazi, Masoumeh – SAGE Open, 2019

Threats to construct validity should be reduced to a minimum. If true, sources of bias, namely raters, items, tests as well as gender, age, race, language background, culture, and socio-economic status need to be spotted and removed. This study investigates raters' experience, language background, and the choice of essay prompt as potential…

Descriptors: Foreign Countries, Language Tests, Test Bias, Essay Tests

Adding Value to Second-Language Listening and Reading Subscores: Using a Score Augmentation Approach

Peer reviewed

Direct link

Papageorgiou, Spiros; Choi, Ikkyu – International Journal of Testing, 2018

This study examined whether reporting subscores for groups of items within a test section assessing a second-language modality (specifically reading or listening comprehension) added value from a measurement perspective to the information already provided by the section scores. We analyzed the responses of 116,489 test takers to reading and…

Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Language Tests

Developing and Validating Band Levels and Descriptors for Reporting Overall Examinee Performance

Peer reviewed

Direct link

Papageorgiou, Spiros; Xi, Xiaoming; Morgan, Rick; So, Youngsoon – Language Assessment Quarterly, 2015

This study presents the development and empirical validation of score levels and descriptors specifically designed for reporting purposes to provide test takers with more than just a number on a score scale. In the context of a test primarily intended for 11- to 15-year-old students learning English as a second/foreign language, the study examined…

Descriptors: Scores, Validity, Scaling, Classification

Use of Automated Scoring in Spoken Language Assessments for Test Takers with Speech Impairments. Research Report. ETS RR-17-42

Peer reviewed
PDF on ERIC

Download full text

Loukina, Anastassia; Buzick, Heather – ETS Research Report Series, 2017

This study is an evaluation of the performance of automated speech scoring for speakers with documented or suspected speech impairments. Given that the use of automated scoring of open-ended spoken responses is relatively nascent and there is little research to date that includes test takers with disabilities, this small exploratory study focuses…

Descriptors: Automation, Scoring, Language Tests, Speech Tests

Evaluating Subscore Uses across Multiple Levels: A Case of Reading and Listening Subscores for Young EFL Learners

Peer reviewed

Direct link

Choi, Ikkyu; Papageorgiou, Spiros – Language Testing, 2020

Stakeholders of language tests are often interested in subscores. However, reporting a subscore is not always justified; a subscore should provide reliable and distinct information to be worth reporting. When a subscore is used for decisions across multiple levels (e.g., individual test takers and schools), it needs to be justified for its…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Scores

Do the TOEFL iBT® Section Scores Provide Value-Added Information to Stakeholders

Peer reviewed

Direct link

Sawaki, Yasuyo; Sinharay, Sandip – Language Testing, 2018

The present study examined the reliability of the reading, listening, speaking, and writing section scores for the TOEFL iBT® test and their interrelationship in order to collect empirical evidence to support, respectively, the "generalization" inference and the "explanation" inference in the TOEFL iBT validity argument…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Computer Assisted Testing

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

ETS Research Report Series	15
Language Testing	9
Language Assessment Quarterly	3
Online Submission	3
TESOL Quarterly	3
Cogent Education	2
College Entrance Examination…	2
Educational Testing Service	2
English Language Teaching	2
International Journal of…	2
JALT CALL Journal	2
Applied Linguistics	1
Assessment in Education:…	1
ESL Magazine	1
Education and Information…	1
Educational and Psychological…	1
International Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Pan-Pacific…	1
Language Learning	1
Language Teaching Research…	1
ProQuest LLC	1
Psicologica: International…	1
Reading Matrix: An…	1
More ▼

Lee, Yong-Won	7
Kantor, Robert	5
Papageorgiou, Spiros	5
Mollaun, Pam	4
Davis, Larry	3
Henning, Grant	3
Xi, Xiaoming	3
Attali, Yigal	2
Bridgeman, Brent	2
Burstein, Jill	2
Carlson, Sybil B.	2
Choi, Ikkyu	2
Gentile, Claudia	2
Kermad, Alyssa	2
Manalo, Jonathan R.	2
Morgan, Rick	2
Sawaki, Yasuyo	2
Sinharay, Sandip	2
Wolfe, Edward W.	2
Yamini, Mortaza	2
Ahmadi Shirazi, Masoumeh	1
Ahmadi, Alireza	1
Ahour, Touran	1
Alderson, J. Charles	1
More ▼