ERIC - Search Results

Publication Date

In 2025	1
Since 2024	9
Since 2021 (last 5 years)	33
Since 2016 (last 10 years)	62

Descriptor

Language Tests	42
Second Language Learning	35
English (Second Language)	32
Evaluators	24
Writing Evaluation	23
Foreign Countries	22
Evaluation Methods	15
Rating Scales	15
Scores	14
Scoring	14
Second Language Instruction	13
Writing Tests	13
Evaluation Criteria	12
Language Proficiency	12
Oral Language	12
Essays	11
Test Validity	11
Correlation	9
Decision Making	9
Interrater Reliability	9
Language Teachers	9
Statistical Analysis	9
Comparative Analysis	8
Grammar	8
College Students	7
More ▼

Source

Language Testing

Publication Type

Journal Articles	62
Reports - Research	52
Tests/Questionnaires	9
Reports - Evaluative	7
Information Analyses	5
Reports - Descriptive	3
Opinion Papers	1

Education Level

Higher Education	16
Postsecondary Education	14
Secondary Education	8
Elementary Education	6
High Schools	6
Grade 9	2
Junior High Schools	2
Middle Schools	2
Early Childhood Education	1
Elementary Secondary Education	1
Grade 7	1
Grade 8	1
Preschool Education	1
More ▼

Audience

Location

China	5
United Kingdom	3
Australia	2
Iran	2
Taiwan	2
Turkey	2
Alabama	1
California	1
Canada	1
Chile	1
Colorado	1
Croatia	1
Delaware	1
Europe	1
Georgia	1
Haiti	1
Hawaii	1
Illinois	1
Illinois (Urbana)	1
Minnesota	1
Nevada	1
New Jersey	1
North Dakota	1
Ohio	1
Oklahoma	1
More ▼

Laws, Policies, & Programs

Race to the Top

Assessments and Surveys

Test of English as a Foreign…	5
ACTFL Oral Proficiency…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 62 results Save | Export

Triangulating Natural Language Processing (NLP)-Based Analysis of Rater Comments and Many-Facet Rasch Measurement (MFRM): An Innovative Approach to Investigating Raters' Application of Rating Scales in Writing Assessment

Peer reviewed

Direct link

Huiying Cai; Xun Yan – Language Testing, 2024

Rater comments tend to be qualitatively analyzed to indicate raters' application of rating scales. This study applied natural language processing (NLP) techniques to quantify meaningful, behavioral information from a corpus of rater comments and triangulated that information with a many-facet Rasch measurement (MFRM) analysis of rater scores. The…

Descriptors: Natural Language Processing, Item Response Theory, Rating Scales, Writing Evaluation

Do Source Use Features Impact Raters' Judgment of Argumentation? An Experimental Study

Peer reviewed

Direct link

Ping-Lin Chuang – Language Testing, 2025

This experimental study explores how source use features impact raters' judgment of argumentation in a second language (L2) integrated writing test. One hundred four experienced and novice raters were recruited to complete a rating task that simulated the scoring assignment of a local English Placement Test (EPT). Sixty written responses were…

Descriptors: Interrater Reliability, Evaluators, Information Sources, Primary Sources

Assessing Speaking through Multimodal Oral Presentations: The Case of Construct Underrepresentation in EAP Contexts

Peer reviewed

Direct link

Louise Palmour – Language Testing, 2024

This article explores the nature of the construct underlying classroom-based English for academic purpose (EAP) oral presentation assessments, which are used, in part, to determine admission to programmes of study at UK universities. Through analysis of qualitative data (from questionnaires, interviews, rating discussions, and fieldnotes), the…

Descriptors: English for Academic Purposes, Public Speaking, College Students, Foreign Countries

Interpreting Testing and Assessment: A State-of-the-Art Review

Peer reviewed

Direct link

Han, Chao – Language Testing, 2022

Over the past decade, testing and assessing spoken-language interpreting has garnered an increasing amount of attention from stakeholders in interpreter education, professional certification, and interpreting research. This is because in these fields assessment results provide a critical evidential basis for high-stakes decisions, such as the…

Descriptors: Translation, Language Tests, Testing, Evaluation Methods

But Who Trains the Language Teacher Educator Who Trains the Language Teacher? An Empirical Investigation of Chilean EFL Teacher Educators' Language Assessment Literacy

Peer reviewed

Direct link

Villa Larenas, Salomé; Brunfaut, Tineke – Language Testing, 2023

Research has shown that language teachers typically feel underprepared for assessment aspects of their job. One reason may relate to how teacher education programmes prepare future teachers in this area. Research insights into how and to what extent teacher educators train future language teachers in language assessment matters are scarce,…

Descriptors: Foreign Countries, Second Language Instruction, Language Teachers, Teacher Educators

Evaluating Methodological Enhancements to the Yes/No Angoff Standard-Setting Method in Language Proficiency Assessment

Peer reviewed

Direct link

Tia M. Fechter; Heeyeon Yoon – Language Testing, 2024

This study evaluated the efficacy of two proposed methods in an operational standard-setting study conducted for a high-stakes language proficiency test of the U.S. government. The goal was to seek low-cost modifications to the existing Yes/No Angoff method to increase the validity and reliability of the recommended cut scores using a convergent…

Descriptors: Standard Setting, Language Proficiency, Language Tests, Evaluation Methods

Test Score Comparison Tables: How Well are They Serving Test Users?

Peer reviewed

Direct link

Ute Knoch; Jason Fan – Language Testing, 2024

While several test concordance tables have been published, the research underpinning such tables has rarely been examined in detail. This study aimed to survey the publically available studies or documentation underpinning the test concordance tables of the providers of four major international language tests, all accepted by the Australian…

Descriptors: Language Tests, English, Test Validity, Item Analysis

Towards More Valid Scoring Criteria for Integrated Reading-Writing and Listening-Writing Summary Tasks

Peer reviewed

Direct link

Chan, Sathena; May, Lyn – Language Testing, 2023

Despite the increased use of integrated tasks in high-stakes academic writing assessment, research on rating criteria which reflect the unique construct of integrated summary writing skills is comparatively rare. Using a mixed-method approach of expert judgement, text analysis, and statistical analysis, this study examines writing features that…

Descriptors: Scoring, Writing Evaluation, Reading Tests, Listening Skills

Administration, Labor, and Love

Peer reviewed

Direct link

Ginther, April – Language Testing, 2023

Great opportunities for language testing practitioners are enabled through language program administration. Local language tests lend themselves to multiple purposes--for placement and diagnosis, as a means of tracking progress, and as a contribution to program evaluation and revision. Administrative choices, especially those involving a test, are…

Descriptors: Language Tests, Testing, Examiners, Placement Tests

A Look into the Practices and Challenges of Assessing Young EFL Learners' Writing in Croatia

Peer reviewed

Direct link

Patekar, Jakob – Language Testing, 2021

Writing in a foreign language is a particularly difficult skill to develop, especially when young learners are concerned because they are parallelly learning to write in their L1 and do not have strong oral foundations in their L2. The issue becomes even more complex when the ways to assess young learners' writing are considered, given that…

Descriptors: Language Tests, Test Construction, Foreign Countries, Oral Language

Assessment of Fluency in the Test of English for Educational Purposes

Peer reviewed

Direct link

Tavakoli, Parvaneh; Kendon, Gill; Mazhurnaya, Svetlana; Ziomek, Anna – Language Testing, 2023

The main aim of this study was to investigate how oral fluency is assessed across different levels of proficiency in the Test of English for Educational Purposes (TEEP). Working with data from 56 test-takers performing a monologic task at a range of proficiency levels (equivalent to approximately levels 5.0, 5.5, 6.5, and 7.5 in the IELTS scoring…

Descriptors: Language Fluency, Language Tests, English (Second Language), Second Language Learning

A Comparative Judgment Approach to Assessing Chinese Sign Language Interpreting

Peer reviewed

Direct link

Han, Chao; Xiao, Xiaoyan – Language Testing, 2022

The quality of sign language interpreting (SLI) is a gripping construct among practitioners, educators and researchers, calling for reliable and valid assessment. There has been a diverse array of methods in the extant literature to measure SLI quality, ranging from traditional error analysis to recent rubric scoring. In this study, we want to…

Descriptors: Comparative Analysis, Sign Language, Deaf Interpreting, Evaluators

Diagnosing Chinese EFL Learners' Writing Ability Using Polytomous Cognitive Diagnostic Models

Peer reviewed

Direct link

Xiaoting Shi; Xiaomei Ma; Wenbo Du; Xuliang Gao – Language Testing, 2024

Cognitive diagnostic assessment (CDA) intends to identify learners' strengths and weaknesses in latent cognitive attributes to provide personalized remedial instructions. Previous CDA studies on English as a Foreign Language (EFL)/English as a Second Language (ESL) writing have adopted dichotomous cognitive diagnostic models (CDMs) to analyze data…

Descriptors: Writing Evaluation, Writing Tests, Diagnostic Tests, English (Second Language)

Assessing the Content Quality of Essays in Content and Language Integrated Learning: Exploring the Construct from Subject Specialists' Perspectives

Peer reviewed

Direct link

Takanori Sato – Language Testing, 2024

Assessing the content of learners' compositions is a common practice in second language (L2) writing assessment. However, the construct definition of content in L2 writing assessment potentially underrepresents the target competence in content and language integrated learning (CLIL), which aims to foster not only L2 proficiency but also critical…

Descriptors: Language Tests, Content and Language Integrated Learning, Writing Evaluation, Writing Tests

A Systematic Review of Methods for Evaluating Rating Quality in Language Assessment

Peer reviewed

Direct link

Wind, Stefanie A.; Peterson, Meghan E. – Language Testing, 2018

The use of assessments that require rater judgment (i.e., rater-mediated assessments) has become increasingly popular in high-stakes language assessments worldwide. Using a systematic literature review, the purpose of this study is to identify and explore the dominant methods for evaluating rating quality within the context of research on…

Descriptors: Language Tests, Evaluators, Evaluation Methods, Interrater Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Han, Chao	3
Bailey, Alison L.	2
Brunfaut, Tineke	2
Ginther, April	2
Knoch, Ute	2
Lu, Xiaofei	2
May, Lyn	2
Pill, John	2
Wind, Stefanie A.	2
Yan, Xun	2
Ann Tai Choe	1
Attali, Yigal	1
Babaii, Esmat	1
Baker, Beverly A.	1
Bilki, Zeynep	1
Bond, Trevor	1
Burton, J. Dylan	1
Can Daskin, Nilüfer	1
Chalhoub-Deville, Micheline	1
Chan, Kinnie Kin Yee	1
Chan, Sathena	1
Chapelle, Carol A.	1
Chapman, Mark	1
Chuang, Ping-Lin	1
Crossley, Scott	1
More ▼