ERIC - Search Results

Publication Date

In 2025	3
Since 2024	6
Since 2021 (last 5 years)	10
Since 2016 (last 10 years)	21
Since 2006 (last 20 years)	40

Descriptor

Foreign Countries	45
Language Tests	32
Test Reliability	27
Second Language Learning	24
English (Second Language)	22
Test Validity	18
Language Proficiency	15
Interrater Reliability	14
Item Response Theory	12
Scores	12
Secondary School Students	10
Correlation	9
Comparative Analysis	8
Reading Comprehension	8
Scoring	8
High Stakes Tests	7
Test Construction	7
College Students	5
Listening Comprehension Tests	5
Models	5
Native Speakers	5
Rating Scales	5
Reliability	5
Second Language Instruction	5
Test Bias	5
More ▼

Source

Language Testing

Publication Type

Journal Articles	45
Reports - Research	31
Reports - Evaluative	10
Reports - Descriptive	4
Tests/Questionnaires	1

Education Level

Higher Education	13
Secondary Education	11
Postsecondary Education	8
Elementary Education	4
Elementary Secondary Education	3
Junior High Schools	2
Middle Schools	2
Adult Education	1
Early Childhood Education	1
Grade 12	1
Grade 6	1
Grade 7	1
High Schools	1
Intermediate Grades	1
Kindergarten	1
Primary Education	1
More ▼

Audience

Location

Netherlands	7
China	6
Finland	4
Germany	4
Australia	3
Japan	3
France	2
Hong Kong	2
South Korea	2
Taiwan	2
United Kingdom	2
Austria	1
Bulgaria	1
Canada	1
China (Guangzhou)	1
Colombia	1
Denmark	1
Europe	1
Iran	1
Italy	1
Kenya	1
Pennsylvania (Philadelphia)	1
Poland	1
Russia	1
Sweden	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	4
English Proficiency Test	1
International English…	1
Peabody Picture Vocabulary…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 45 results Save | Export

Communal Factors in Rater Severity and Consistency over Time in High-Stakes Oral Assessment

Peer reviewed

Direct link

Reeta Neittaanmäki; Iasonas Lamprianou – Language Testing, 2024

This article focuses on rater severity and consistency and their relation to major changes in the rating system in a high-stakes testing context. The study is based on longitudinal data collected from 2009 to 2019 from the second language (L2) Finnish speaking subtest in the National Certificates of Language Proficiency in Finland. We investigated…

Descriptors: Foreign Countries, Interrater Reliability, Evaluators, Item Response Theory

All Types of Experience Are Equal, but Some Are More Equal: The Effect of Different Types of Experience on Rater Severity and Rater Consistency

Peer reviewed

Direct link

Reeta Neittaanmäki; Iasonas Lamprianou – Language Testing, 2024

This article focuses on rater severity and consistency and their relation to different types of rater experience over a long period of time. The article is based on longitudinal data collected from 2009 to 2019 from the second language Finnish speaking subtest in the National Certificates of Language Proficiency in Finland. The study investigated…

Descriptors: Foreign Countries, Interrater Reliability, Error of Measurement, Experience

Test Review: Computer-Based English Listening and Speaking Test (CELST) of National Matriculation English Test (NMET) Guangdong Version in China

Peer reviewed

Direct link

Ying Xu; Xiaodong Li; Jin Chen – Language Testing, 2025

This article provides a detailed review of the Computer-based English Listening Speaking Test (CELST) used in Guangdong, China, as part of the National Matriculation English Test (NMET) to assess students' English proficiency. The CELST measures listening and speaking skills as outlined in the "English Curriculum for Senior Middle…

Descriptors: Computer Assisted Testing, English (Second Language), Language Tests, Listening Comprehension Tests

Developing Internet-Based "Tests of Aptitude for Language Learning (TALL)": An Open Research Endeavour

Peer reviewed

Direct link

Junlan Pan; Emma Marsden – Language Testing, 2024

"Tests of Aptitude for Language Learning" (TALL) is an openly accessible internet-based battery to measure the multifaceted construct of foreign language aptitude, using language domain-specific instruments and L1-sensitive instructions and stimuli. This brief report introduces the components of this theory-informed battery and…

Descriptors: Language Tests, Aptitude Tests, Second Language Learning, Test Construction

Comparative Judgement for Evaluating Young Learners' EFL Writing Performances: Reliability and Teacher Perceptions of Holistic and Dimension-Based Judgements

Peer reviewed

Direct link

Rebecca Sickinger; Tineke Brunfaut; John Pill – Language Testing, 2025

Comparative Judgement (CJ) is an evaluation method, typically conducted online, whereby a rank order is constructed, and scores calculated, from judges' pairwise comparisons of performances. CJ has been researched in various educational contexts, though only rarely in English as a Foreign Language (EFL) writing settings, and is generally agreed to…

Descriptors: Writing Evaluation, English (Second Language), Second Language Learning, Second Language Instruction

A New Scoring Method for Item Response Theory Analysis of C-Tests

Peer reviewed

Direct link

Farshad Effatpanah; Purya Baghaei; Mona Tabatabaee-Yazdi; Esmat Babaii – Language Testing, 2025

This study aimed to propose a new method for scoring C-Tests as measures of general language proficiency. In this approach, the unit of analysis is sentences rather than gaps or passages. That is, the gaps correctly reformulated in each sentence were aggregated as sentence score, and then each sentence was entered into the analysis as a polytomous…

Descriptors: Item Response Theory, Language Tests, Test Items, Test Construction

Adaptation of the British Sign Language Receptive Skills Test into Polish Sign Language

Peer reviewed

Direct link

Kotowicz, Justyna; Woll, Bencie; Herman, Rosalind – Language Testing, 2021

The evaluation of sign language proficiency needs to be based on measures with well-established psychometric proprieties. To date, no valid and reliable test is available to assess Polish Sign Language ("Polski Jezyk Migowy," PJM) skills in deaf children. Hence, our aim with this study was to adapt the British Sign Language Receptive…

Descriptors: Language Tests, Receptive Language, Sign Language, Language Proficiency

Measuring L2 Speakers' Interactional Ability Using Interactive Speech Tasks

Peer reviewed

Direct link

van Batenburg, Eline S. L.; Oostdam, Ron J.; van Gelderen, Amos J. S.; de Jong, Nivja H. – Language Testing, 2018

This article explores ways to assess interactional performance, and reports on the use of a test format that standardizes the interlocutor's linguistic and interactional contributions to the exchange. It describes the construction and administration of six scripted speech tasks (instruction, advice, and sales tasks) with pre-vocational learners (n…

Descriptors: Second Language Learning, Speech Tests, Interaction, Test Reliability

The Longitudinal Stability of Rating Characteristics in an EFL Examination: Methodological and Substantive Considerations

Peer reviewed

Direct link

Lamprianou, Iasonas; Tsagari, Dina; Kyriakou, Nansia – Language Testing, 2021

This longitudinal study (2002-2014) investigates the stability of rating characteristics of a large group of raters over time in the context of the writing paper of a national high-stakes examination. The study uses one measure of rater severity and two measures of rater consistency. The results suggest that the rating characteristics of…

Descriptors: Longitudinal Studies, Evaluators, High Stakes Tests, Writing Evaluation

Examining the L2 Reading Comprehension Ability of Adult ELLs: Developing a Diagnostic Test within the Cognitive Diagnostic Assessment Framework

Peer reviewed

Direct link

Toprak, Tugba Elif; Cakir, Abdulvahit – Language Testing, 2021

Cognitive diagnostic assessment (CDA) has been applied to language assessment in a number of studies in which a diagnostic classification model (DCM) was retrofitted to the results of a non-diagnostic assessment. However, the need to apply CDA through utilization of an inductive rather than a retrofitted approach has been a recurrent theme in…

Descriptors: English (Second Language), Second Language Learning, Undergraduate Students, Young Adults

Validity Evidence for a Sentence Repetition Test of Swiss German Sign Language

Peer reviewed

Direct link

Haug, Tobias; Batty, Aaron Olaf; Venetz, Martin; Notter, Christa; Girard-Groeber, Simone; Knoch, Ute; Audeoud, Mireille – Language Testing, 2020

In this study we seek evidence of validity according to the socio-cognitive framework (Weir, 2005) for a new sentence repetition test (SRT) for young Deaf L1 Swiss German Sign Language (DSGS) users. SRTs have been developed for various purposes for both spoken and sign languages to assess language development in children. In order to address the…

Descriptors: Foreign Countries, Language Tests, Sentences, Repetition

Cloze Testing for Comprehension Assessment: The HyTeC-cloze

Peer reviewed

Direct link

Kleijn, Suzanne; Pander Maat, Henk; Sanders, Ted – Language Testing, 2019

Although there are many methods available for assessing text comprehension, the cloze test is not widely acknowledged as one of them. Critiques on cloze testing center on its supposedly limited ability to measure comprehension beyond the sentence. However, these critiques do not hold for all types of cloze tests; the particular configuration of a…

Descriptors: Cloze Procedure, Language Tests, Semantics, Scoring

Measuring the Development of General Language Skills in English as a Foreign Language--Longitudinal Invariance of the C-Test

Peer reviewed

Direct link

Schnoor, Birger; Hartig, Johannes; Klinger, Thorsten; Naumann, Alexander; Usanova, Irina – Language Testing, 2023

Research on assessing English as a foreign language (EFL) development has been growing recently. However, empirical evidence from longitudinal analyses based on substantial samples is still needed. In such settings, tests for measuring language development must meet high standards of test quality such as validity, reliability, and objectivity, as…

Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Longitudinal Studies

Professional and Non-Professional Raters' Responsiveness to Fluency and Accuracy in L2 Speech: An Experimental Approach

Peer reviewed

Direct link

Duijm, Klaartje; Schoonen, Rob; Hulstijn, Jan H. – Language Testing, 2018

It is general practice to use rater judgments in speaking proficiency testing. However, it has been shown that raters' knowledge and experience may influence their ratings, both in terms of leniency and varied focus on different aspects of speech. The purpose of this study is to identify raters' relative responsiveness to fluency and linguistic…

Descriptors: Language Fluency, Accuracy, Second Languages, Language Tests

Mapping the Fluctuating Effect of Strategy Use Ability on English Reading Performance for Nursing Students: A Multi-Layered Moderation Analysis Approach

Peer reviewed

Direct link

Cai, Yuyang; Kunnan, Antony John – Language Testing, 2020

An essential hypothesis of modern language assessment theory pertains to the interaction between strategy use ability (strategic competence) and second language knowledge. However, how they interact with each other is rarely explored. Drawing on relevant research in the literature, in this paper we proposed three interaction patterns (i.e.,…

Descriptors: English (Second Language), Second Language Learning, Nursing Education, Reading Tests

Previous Page | Next Page »

Pages: 1 | 2 | 3

Haug, Tobias	2
Iasonas Lamprianou	2
Reeta Neittaanmäki	2
Schoonen, Rob	2
de Jong, Nivja H.	2
Alanen, Riikka	1
Alderson, J. Charles	1
Audeoud, Mireille	1
Batty, Aaron Olaf	1
Bosker, Hans Rutger	1
Bridgeman, Brent	1
Brown, James Dean	1
Cai, Yuyang	1
Cakir, Abdulvahit	1
Chan, Stephanie W. Y.	1
Cheung, Wai Ming	1
Cho, Yeonsuk	1
Choi, Ikkyu	1
Coniam, David	1
Coombe, Christine	1
Culligan, Brent	1
Davidson, Peter	1
Deygers, Bart	1
DiPietro, Stephen	1
More ▼