Publication Date
In 2025 | 2 |
Since 2024 | 32 |
Since 2021 (last 5 years) | 107 |
Since 2016 (last 10 years) | 205 |
Since 2006 (last 20 years) | 403 |
Descriptor
Source
Language Testing | 596 |
Author
Davies, Alan | 11 |
Bachman, Lyle F. | 10 |
Alderson, J. Charles | 8 |
Elder, Catherine | 8 |
Knoch, Ute | 8 |
McNamara, Tim | 8 |
Yan, Xun | 7 |
Brunfaut, Tineke | 6 |
Chapelle, Carol A. | 6 |
Cho, Yeonsuk | 6 |
Ginther, April | 6 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 1 |
Teachers | 1 |
Location
Japan | 31 |
China | 27 |
Australia | 24 |
United Kingdom | 14 |
South Korea | 12 |
Canada | 11 |
Hong Kong | 9 |
Netherlands | 9 |
Germany | 8 |
Europe | 7 |
Taiwan | 6 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Elementary and Secondary… | 1 |
Lau v Nichols | 1 |
Race to the Top | 1 |
Assessments and Surveys
What Works Clearinghouse Rating

Kunnan, Antony John – Language Testing, 1992
Three analysis procedures were used to study the dependability and validity of ESLPE, a criterion-referenced English-as-a-Second-Language placement test developed at the University of California at Los Angeles in 1989. Findings led to the suggestion that some students might have been differently placed if subtest scores were used for placement.(38…
Descriptors: Cluster Analysis, Comparative Analysis, Criterion Referenced Tests, English (Second Language)

Rea-Dickins, Pauline; Gardner, Sheena – Language Testing, 2000
Explores the nature of formative assessment in a primary language learning context. The research is from nine inner city schools where an early years intervention project is being carried out to address problems of low levels of achievement in English, with specific reference to the language support of learners for whom English is an additional…
Descriptors: Elementary Education, English (Second Language), Foreign Countries, Formative Evaluation
Lee, Y-W. – Language Testing, 2004
The purpose of the study reported in this article is to empirically examine passage-related local item dependence (LID) by using an IRT (item response theory) based LID index called Q3 in an EFL reading comprehension test, with a special focus on item types as a potentially competing source of LID with passages. In this article, definitions and…
Descriptors: Psychometrics, Item Response Theory, Content Analysis, Reading Comprehension
Stricker, L. J. – Language Testing, 2004
The purpose of this study was to replicate previous research on the construct validity of the paper-based version of the TOEFL and extend it to the computer-based TOEFL. Two samples of Graduate Record Examination (GRE) General Test-takers were used: native speakers of English specially recruited to take the computer-based TOEFL, and ESL…
Descriptors: Native Speakers, Construct Validity, English (Second Language), Computer Assisted Instruction
Weir, Cyril J.; Wu, Jessica R. W. – Language Testing, 2006
Examination boards are often criticized for their failure to provide evidence of comparability across forms, and few such studies are publicly available. This study aims to investigate the extent to which three forms of the General English Proficiency Test Intermediate Speaking Test (GEPTS-I) are parallel in terms of two types of validity…
Descriptors: Foreign Countries, Test Format, Speech Communication, Check Lists
Roever, Carsten – Language Testing, 2006
Despite increasing interest in interlanguage pragmatics research, research on assessment of this crucial area of second language competence still lags behind assessment of other aspects of learners' developing second language (L2) competence. This study describes the development and validation of a 36-item web-based test of ESL pragmalinguistics,…
Descriptors: Familiarity, Test Validity, Speech Acts, Interlanguage
Huhta, Ari; Kalaja, Paula; Pitkanen-Huhta, Anne – Language Testing, 2006
As part of a larger project, we studied how a foreign language test got discursively constructed in the talk of upper-secondary-school leavers. A group of students were asked to keep an oral diary to record their ideas, feelings and experiences of preparing for and taking the test over the last spring term of school, as part of a high-stakes…
Descriptors: Test Results, Psychologists, Language Tests, Discourse Analysis

Rea-Dickens, Pauline – Language Testing, 1997
Examines the contributions made by stakeholders such has learners, teachers, and parents to the language assessment process. Examines the relationship between experts and government in the United Kingdom. It is argued that participation by stakeholders is not limited to providing a forum but includes equipping teachers, parents, and others with…
Descriptors: Change Agents, Child Language, Foreign Countries, Government School Relationship

Wall, Dianne – Language Testing, 1996
Suggests that any model of washback must include insights from the theory of educational innovation to help explain why tests do not always have the desired or feared effect. Key concepts in educational innovation are reviewed, showing how these concepts are manifested in a case study in washback and outlining how they are being applied in recent…
Descriptors: Case Studies, Change Strategies, Cognitive Development, Educational Innovation

Bonk, William J.; Ockey, Gary J. – Language Testing, 2003
FACETS many-facet Rasch analysis software was utilized to look at two consecutive administrations of a large-scale second language oral assessment in the form of a peer group discussion task with Japanese English-major university students. Results are discussed. (Author/VWL)
Descriptors: College Students, Computer Software, Discussion (Teaching Technique), Higher Education

Mislevy, Robert J. – Language Testing, 1995
A conceptualization of test theory is discussed that addresses issues of weight and coverage of evidence for statements framed in recent educational/psychological paradigms. Implications for language assessment built around the American Council on the Teaching of Foreign Languages' guidelines are considered. (26 references) (Author/CK)
Descriptors: Audiotape Recordings, Charts, Cognitive Measurement, Educational Psychology

McNamara, T. F. – Language Testing, 1990
Discusses the role of the Rasch model IRT in evaluating two subtests of the Occupational English test and argues for its use in exploring test constructs and in considering the implications of the empirical analysis presented for the validity of communicative language tests involving speaking and writing skills. (39 references) (Author/JL)
Descriptors: Construct Validity, English for Special Purposes, Evaluation, Health Occupations

Cumming, Alister – Language Testing, 2001
Interviewed teachers from around the world to examine a specific purpose (SP) versus general purpose (GP) distinction in their orientations to the work they do. The difference in orientation was signaled in the criteria the teachers use to assess students' writing. (Author)
Descriptors: Evaluation Criteria, Interviews, Language Teachers, Language Tests
Cumming, A.; Grant, L.; Mulcahy-Ernt, P.; Powers, D.E. – Language Testing, 2004
This study was undertaken, in conjunction with other studies field-testing prototype tasks for a new TOEFL, to evaluate the content validity, perceived authenticity and educational appropriateness of these prototype tasks. We interviewed seven highly experienced instructors of English as a Second Language (ESL) at three universities, asking them…
Descriptors: Oral Language, Writing Skills, Language Tests, Content Validity

Klein-Braley, Christine – Language Testing, 1985
Presents the theory of general language proficiency and looks at the construct validation of cloze tests and C-tests. Describes the defects of classical cloze procedures. Gives an example of the C-Test and discusses its empirical validity. Concludes that C-tests are authentic tests of the construct of general language proficiency.
Descriptors: Cloze Procedure, Comparative Analysis, Language Proficiency, Language Research