Publication Date
In 2025 | 0 |
Since 2024 | 8 |
Since 2021 (last 5 years) | 79 |
Since 2016 (last 10 years) | 161 |
Descriptor
Source
Author
Babaii, Esmat | 3 |
Cheng, Liying | 3 |
Davis, Larry | 3 |
Anaya, Jissel B. | 2 |
Anna-Maria Fall | 2 |
Arefsadr, Sajjad | 2 |
Bedore, Lisa M. | 2 |
Beula M. Magimairaj | 2 |
Brown, Alan V. | 2 |
Evanini, Keelan | 2 |
Galaczi, Evelina | 2 |
More ▼ |
Publication Type
Education Level
Audience
Teachers | 2 |
Administrators | 1 |
Researchers | 1 |
Location
China | 15 |
Iran | 9 |
Japan | 6 |
Turkey | 6 |
Europe | 5 |
New York | 4 |
Switzerland | 4 |
Germany | 3 |
India | 3 |
Nebraska | 3 |
United Kingdom | 3 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
DeCarlo, Lawrence T.; Zhou, Xiaoliang – Journal of Educational Measurement, 2021
In signal detection rater models for constructed response (CR) scoring, it is assumed that raters discriminate equally well between different latent classes defined by the scoring rubric. An extended model that relaxes this assumption is introduced; the model recognizes that a rater may not discriminate equally well between some of the scoring…
Descriptors: Scoring, Models, Bias, Perception
Exploring Language Assessment Literacy: A Case of Perceived Needs of Two Stakeholder Groups in Egypt
Amira Desouky Ali – International Journal of Training and Development, 2024
Stakeholders in exam-driven countries are responsible for developing test-related tasks to assess the quality of English as a Foreign Language (EFL) teaching and learning. Hence, the language assessment literacy (LAL) of different stakeholders has to be investigated. This mixed-methods study explored the required LAL competencies among two groups…
Descriptors: Language Tests, Assessment Literacy, Second Language Learning, English (Second Language)
Peter Howell; Clarissa Sorger; Roa'a Alsulaiman; Kaho Yoshikawa; John Harris; Kevin Tang – International Journal of Language & Communication Disorders, 2024
Background: Non-word repetition (NWR) tests are an important way speech and language therapists (SaLTs) assess language development. NWR tests are often scored whilst participants make their responses (i.e., in real time) in clinical and research reports (documented here via a secondary analysis of a published systematic review). Aims: The main…
Descriptors: Language Tests, Scoring, Accuracy, Children
Zhao, Ruibin; Zhuang, Yipeng; Zou, Di; Xie, Qin; Yu, Philip L. H. – Education and Information Technologies, 2023
Grading assignments is inherently subjective and time-consuming; automatic scoring tools can greatly reduce teacher workload and shorten the time needed for providing feedback to learners. The purpose of this paper is to propose a novel method for automatically scoring student responses to picture-cued writing tasks. As a popular paradigm for…
Descriptors: Artificial Intelligence, Automation, Scoring, Visual Aids
Han, Chao – Language Testing, 2022
Over the past decade, testing and assessing spoken-language interpreting has garnered an increasing amount of attention from stakeholders in interpreter education, professional certification, and interpreting research. This is because in these fields assessment results provide a critical evidential basis for high-stakes decisions, such as the…
Descriptors: Translation, Language Tests, Testing, Evaluation Methods
Barry O'Sullivan – Language Assessment Quarterly, 2023
This paper highlights as issues of concern the rapid changes in technology and the tendency to report on partial validation efforts where the work is not identified as forming part of a larger validation project. With close human supervision emerging technologies can have a significant and positive impact on language testing. While technology…
Descriptors: Technology Uses in Education, Computer Assisted Testing, Language Tests, Supervision
Santi Farmasari; Lalu Ali Wardana; Baharudin; Desi Herayana; Hartati Suryaningsih – International Journal of Language Education, 2023
Pre-service teachers' ability to construct and conduct assessment has been a point of emphasis for decades, and rightfully so. It is crucial that they acquire the necessary knowledge and abilities in their language assessment course during their pre-service teacher education to effectively assess students in their future professional routines. The…
Descriptors: Preservice Teachers, English (Second Language), Language Teachers, Assessment Literacy
Nakamura, Keita – Language Testing in Asia, 2022
Background: This study investigated the scoring and criterion-related validity of the TEAP, a newly developed Test of English for Academic Purposes. In this study, scoring validity was examined by investigating the factor structure, while criterion-related validity was examined by first investigating the longitudinal change of test takers'…
Descriptors: Test Validity, English for Academic Purposes, Language Tests, Scoring
Jin, Kuan-Yu; Eckes, Thomas – Measurement: Interdisciplinary Research and Perspectives, 2022
Recent research on rater effects in performance assessments has increasingly focused on rater centrality, the tendency to assign scores clustering around the rating scale's middle categories. In the present paper, we adopted Jin and Wang's (2018) extended facets modeling approach and constructed a centrality continuum, ranging from raters…
Descriptors: Performance Based Assessment, Evaluators, Scoring, Sample Size
Seedhouse, Paul; Satar, Müge – Classroom Discourse, 2023
The same L2 speaking performance may be analysed and evaluated in very different ways by different teachers or raters. We present a new, technology-assisted research design which opens up to investigation the trajectories of convergence and divergence between raters. We tracked and recorded what different raters noticed when, whilst grading a…
Descriptors: Language Tests, English (Second Language), Second Language Learning, Oral Language
Kornwipa Poonpon; Paiboon Manorom; Wirapong Chansanam – Contemporary Educational Technology, 2023
Automated essay scoring (AES) has become a valuable tool in educational settings, providing efficient and objective evaluations of student essays. However, the majority of AES systems have primarily focused on native English speakers, leaving a critical gap in the evaluation of non-native speakers' writing skills. This research addresses this gap…
Descriptors: Automation, Essays, Scoring, English (Second Language)
Huang, Jing; Chen, Gaowei – AERA Online Paper Repository, 2019
This research investigates the effects of rater experience on performance ratings in language testing using a systematic review of studies published from 1985 to 2017. Based on a comprehensive literature search of 14 databases, we identified sixteen relevant papers. With these we conducted a narrative review to conceptualize a theoretical…
Descriptors: Language Tests, Experience, Evaluators, Performance Based Assessment
Mehdi Mehranirad; Nahid Basafa; Reza Zabihi – Early Child Development and Care, 2024
The present study aimed to examine the effect of activity engagement, age, language proficiency, and time elapse on children's response accuracy to adult's questions. A total of 70, 3- to 6-year-old children participated in the study, engaging in a story-telling activity, a proficiency test, and two interviews. Additionally, 57 of these children…
Descriptors: Accuracy, Language Proficiency, Age Differences, Reaction Time
Alpizar, David; Li, Tongyun; Norris, John M.; Gu, Lixiong – Language Testing, 2023
The C-test is a type of gap-filling test designed to efficiently measure second language proficiency. The typical C-test consists of several short paragraphs with the second half of every second word deleted. The words with deleted parts are considered as items nested within the corresponding paragraph. Given this testlet structure, it is commonly…
Descriptors: Psychometrics, Language Tests, Second Language Learning, Test Items
Lestari, Santi B.; Brunfaut, Tineke – Language Testing, 2023
Assessing integrated reading-into-writing task performances is known to be challenging, and analytic rating scales have been found to better facilitate the scoring of these performances than other common types of rating scales. However, little is known about how specific operationalizations of the reading-into-writing construct in analytic rating…
Descriptors: Reading Writing Relationship, Writing Tests, Rating Scales, Writing Processes