Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 8 |
Descriptor
Native Speakers | 11 |
Scores | 11 |
English (Second Language) | 9 |
Language Tests | 8 |
Second Language Learning | 8 |
Foreign Countries | 4 |
Comparative Analysis | 3 |
English | 3 |
Interviews | 3 |
Statistical Analysis | 3 |
Testing | 3 |
More ▼ |
Source
Language Testing | 11 |
Author
Winke, Paula | 2 |
Chalhoub-Deville, Micheline | 1 |
Cho, Yeonsuk | 1 |
Davidson, Fred | 1 |
Garras, John | 1 |
Hanlon, Sean | 1 |
Hopp, Holger | 1 |
Jarvis, Scott | 1 |
Lee, HyeSun | 1 |
Lee, Shinhye | 1 |
McCarthy, Philip M. | 1 |
More ▼ |
Publication Type
Journal Articles | 11 |
Reports - Research | 7 |
Reports - Evaluative | 4 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 4 |
Elementary Education | 1 |
High Schools | 1 |
Postsecondary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 3 |
Graduate Record Examinations | 1 |
What Works Clearinghouse Rating
Van Moere, Alistair; Hanlon, Sean – Language Testing, 2020
In language assessment and in educational measurement more broadly, there is a tendency to interpret scores from single-administration tests as accurate indicators of a latent trait (e.g., reading ability). Even in contexts where learners receive multiple formative assessments throughout the year, estimates of student ability are determined based…
Descriptors: Bayesian Statistics, Measurement, Accuracy, English (Second Language)
Lee, Shinhye; Winke, Paula – Language Testing, 2018
We investigated how young language learners process their responses on and perceive a computer-mediated, timed speaking test. Twenty 8-, 9-, and 10-year-old non-native English-speaking children (NNSs) and eight same-aged, native English-speaking children (NSs) completed seven computerized sample TOEFL® Primary™ speaking test tasks. We investigated…
Descriptors: Elementary School Students, Second Language Learning, Responses, Computer Assisted Testing
Schmid, Monika S.; Hopp, Holger – Language Testing, 2014
This study examines the methodology of global foreign accent ratings in studies on L2 speech production. In three experiments, we test how variation in raters, range within speech samples, as well as instructions and procedures affects ratings of accent in predominantly monolingual speakers of German, non-native speakers of German, as well as…
Descriptors: Comparative Analysis, Second Language Learning, Pronunciation, Native Speakers
Cho, Yeonsuk; Rijmen, Frank; Novák, Jakub – Language Testing, 2013
This study examined the influence of prompt characteristics on the averages of all scores given to test taker responses on the TOEFL iBT[TM] integrated Read-Listen-Write (RLW) writing tasks for multiple administrations from 2005 to 2009. In the context of TOEFL iBT RLW tasks, the prompt consists of a reading passage and a lecture. To understand…
Descriptors: English (Second Language), Language Tests, Writing Tests, Cues
Lee, HyeSun; Winke, Paula – Language Testing, 2013
We adapted three practice College Scholastic Ability Tests (CSAT) of English listening, each with five-option items, to create four- and three-option versions by asking 73 Korean speakers or learners of English to eliminate the least plausible options in two rounds. Two hundred and sixty-four Korean high school English-language learners formed…
Descriptors: Academic Ability, Stakeholders, Reliability, Listening Comprehension Tests
Sasaki, Miyuki – Language Testing, 2012
The Modern Language Aptitude Test (Paper-and-Pencil Version, henceforth, the MLAT) measures "an individual's ability to learn a foreign language." It targets English-speaking adults (over Grade 9) who are literate. The test has only one form, which has not changed since it was first published by the Psychological Corporation in 1959. The test can…
Descriptors: Aptitude Tests, Test Reviews, Rewards, Acoustics
Schmitt, Norbert; Ng, Janice Wun Ching; Garras, John – Language Testing, 2011
Although the Word Associates Format (WAF) is becoming more frequently used as a depth-of-knowledge measure, relatively little validation has been carried out on it. This report of two validation studies tackles various important WAF issues yet to be satisfactorily resolved. Study 1 conducted introspective interviews regarding students' WAF…
Descriptors: Scoring, Vocabulary Development, Associative Learning, Validity
McCarthy, Philip M.; Jarvis, Scott – Language Testing, 2007
A reliable index of lexical diversity (LD) has remained stubbornly elusive for over 60 years. Meanwhile, researchers in fields as varied as "stylistics," "neuropathology," "language acquisition," and even "forensics" continue to use flawed LD indices--often ignorant that their results are questionable and in…
Descriptors: Second Language Learning, English (Second Language), Foreign Countries, Adolescents
Stricker, L. J. – Language Testing, 2004
The purpose of this study was to replicate previous research on the construct validity of the paper-based version of the TOEFL and extend it to the computer-based TOEFL. Two samples of Graduate Record Examination (GRE) General Test-takers were used: native speakers of English specially recruited to take the computer-based TOEFL, and ESL…
Descriptors: Native Speakers, Construct Validity, English (Second Language), Computer Assisted Instruction

Chalhoub-Deville, Micheline – Language Testing, 1995
The purpose of this study was to derive the criteria/dimensions underlying learners' second-language oral ability scores across three tests: an oral interview, a narration, and a read-aloud. A stimulus tape of 18 speech samples was presented to 3 native speaker rater groups for evaluation. Results indicate that researchers might need to reconsider…
Descriptors: Arabic, Evaluators, Interviews, Language Tests

Davidson, Fred – Language Testing, 1994
Examines appropriacy of a nationally standardized test normed on English speakers but used with non-English speaking students. Data from the school year are analyzed via reliability comparison, exploratory factor analysis, and comparison of variances. The use of the test was statistically defensible. This finding does not address the need for…
Descriptors: Achievement Tests, Analysis of Variance, Elementary Secondary Education, English (Second Language)