NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 11 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Van Moere, Alistair; Hanlon, Sean – Language Testing, 2020
In language assessment and in educational measurement more broadly, there is a tendency to interpret scores from single-administration tests as accurate indicators of a latent trait (e.g., reading ability). Even in contexts where learners receive multiple formative assessments throughout the year, estimates of student ability are determined based…
Descriptors: Bayesian Statistics, Measurement, Accuracy, English (Second Language)
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Shinhye; Winke, Paula – Language Testing, 2018
We investigated how young language learners process their responses on and perceive a computer-mediated, timed speaking test. Twenty 8-, 9-, and 10-year-old non-native English-speaking children (NNSs) and eight same-aged, native English-speaking children (NSs) completed seven computerized sample TOEFL® Primary™ speaking test tasks. We investigated…
Descriptors: Elementary School Students, Second Language Learning, Responses, Computer Assisted Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Schmid, Monika S.; Hopp, Holger – Language Testing, 2014
This study examines the methodology of global foreign accent ratings in studies on L2 speech production. In three experiments, we test how variation in raters, range within speech samples, as well as instructions and procedures affects ratings of accent in predominantly monolingual speakers of German, non-native speakers of German, as well as…
Descriptors: Comparative Analysis, Second Language Learning, Pronunciation, Native Speakers
Peer reviewed Peer reviewed
Direct linkDirect link
Cho, Yeonsuk; Rijmen, Frank; Novák, Jakub – Language Testing, 2013
This study examined the influence of prompt characteristics on the averages of all scores given to test taker responses on the TOEFL iBT[TM] integrated Read-Listen-Write (RLW) writing tasks for multiple administrations from 2005 to 2009. In the context of TOEFL iBT RLW tasks, the prompt consists of a reading passage and a lecture. To understand…
Descriptors: English (Second Language), Language Tests, Writing Tests, Cues
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, HyeSun; Winke, Paula – Language Testing, 2013
We adapted three practice College Scholastic Ability Tests (CSAT) of English listening, each with five-option items, to create four- and three-option versions by asking 73 Korean speakers or learners of English to eliminate the least plausible options in two rounds. Two hundred and sixty-four Korean high school English-language learners formed…
Descriptors: Academic Ability, Stakeholders, Reliability, Listening Comprehension Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Sasaki, Miyuki – Language Testing, 2012
The Modern Language Aptitude Test (Paper-and-Pencil Version, henceforth, the MLAT) measures "an individual's ability to learn a foreign language." It targets English-speaking adults (over Grade 9) who are literate. The test has only one form, which has not changed since it was first published by the Psychological Corporation in 1959. The test can…
Descriptors: Aptitude Tests, Test Reviews, Rewards, Acoustics
Peer reviewed Peer reviewed
Direct linkDirect link
Schmitt, Norbert; Ng, Janice Wun Ching; Garras, John – Language Testing, 2011
Although the Word Associates Format (WAF) is becoming more frequently used as a depth-of-knowledge measure, relatively little validation has been carried out on it. This report of two validation studies tackles various important WAF issues yet to be satisfactorily resolved. Study 1 conducted introspective interviews regarding students' WAF…
Descriptors: Scoring, Vocabulary Development, Associative Learning, Validity
Peer reviewed Peer reviewed
Direct linkDirect link
McCarthy, Philip M.; Jarvis, Scott – Language Testing, 2007
A reliable index of lexical diversity (LD) has remained stubbornly elusive for over 60 years. Meanwhile, researchers in fields as varied as "stylistics," "neuropathology," "language acquisition," and even "forensics" continue to use flawed LD indices--often ignorant that their results are questionable and in…
Descriptors: Second Language Learning, English (Second Language), Foreign Countries, Adolescents
Peer reviewed Peer reviewed
Direct linkDirect link
Stricker, L. J. – Language Testing, 2004
The purpose of this study was to replicate previous research on the construct validity of the paper-based version of the TOEFL and extend it to the computer-based TOEFL. Two samples of Graduate Record Examination (GRE) General Test-takers were used: native speakers of English specially recruited to take the computer-based TOEFL, and ESL…
Descriptors: Native Speakers, Construct Validity, English (Second Language), Computer Assisted Instruction
Peer reviewed Peer reviewed
Chalhoub-Deville, Micheline – Language Testing, 1995
The purpose of this study was to derive the criteria/dimensions underlying learners' second-language oral ability scores across three tests: an oral interview, a narration, and a read-aloud. A stimulus tape of 18 speech samples was presented to 3 native speaker rater groups for evaluation. Results indicate that researchers might need to reconsider…
Descriptors: Arabic, Evaluators, Interviews, Language Tests
Peer reviewed Peer reviewed
Davidson, Fred – Language Testing, 1994
Examines appropriacy of a nationally standardized test normed on English speakers but used with non-English speaking students. Data from the school year are analyzed via reliability comparison, exploratory factor analysis, and comparison of variances. The use of the test was statistically defensible. This finding does not address the need for…
Descriptors: Achievement Tests, Analysis of Variance, Elementary Secondary Education, English (Second Language)