Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 5 |
Descriptor
Language Tests | 5 |
Oral Language | 5 |
Second Language Learning | 5 |
Computer Assisted Testing | 4 |
Scoring | 4 |
Correlation | 3 |
English (Second Language) | 3 |
Classification | 2 |
Evaluators | 2 |
Foreign Students | 2 |
Language Fluency | 2 |
More ▼ |
Author
Xi, Xiaoming | 5 |
Higgins, Derrick | 1 |
Ling, Guangming | 1 |
Mollaun, Pam | 1 |
Mollaun, Pamela | 1 |
Williamson, David | 1 |
Zechner, Klaus | 1 |
Publication Type
Journal Articles | 5 |
Reports - Research | 4 |
Tests/Questionnaires | 2 |
Reports - Evaluative | 1 |
Education Level
Higher Education | 2 |
Postsecondary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
Test of English as a Foreign… | 4 |
What Works Clearinghouse Rating
Ling, Guangming; Mollaun, Pamela; Xi, Xiaoming – Language Testing, 2014
The scoring of constructed responses may introduce construct-irrelevant factors to a test score and affect its validity and fairness. Fatigue is one of the factors that could negatively affect human performance in general, yet little is known about its effects on a human rater's scoring quality on constructed responses. In this study, we compared…
Descriptors: Evaluators, Fatigue (Biology), Scoring, Performance
Xi, Xiaoming; Higgins, Derrick; Zechner, Klaus; Williamson, David – Language Testing, 2012
This paper compares two alternative scoring methods--multiple regression and classification trees--for an automated speech scoring system used in a practice environment. The two methods were evaluated on two criteria: construct representation and empirical performance in predicting human scores. The empirical performance of the two scoring models…
Descriptors: Scoring, Classification, Weighted Scores, Comparative Analysis
Xi, Xiaoming – Language Testing, 2010
Motivated by cognitive theories of graph comprehension, this study systematically manipulated characteristics of a line graph description task in a speaking test in ways to mitigate the influence of graph familiarity, a potential source of construct-irrelevant variance. It extends Xi (2005), which found that the differences in holistic scores on…
Descriptors: Familiarity, Graphs, Scoring, Task Analysis
Xi, Xiaoming – ETS Research Report Series, 2008
Although the primary use of the speaking section of the Test of English as a Foreign Language™ Internet-based test (TOEFL® iBT Speaking test) is to inform admissions decisions at English medium universities, it may also be useful as an initial screening measure for international teaching assistants (ITAs). This study provides criterion-related…
Descriptors: Test Validity, Criterion Referenced Tests, English (Second Language), Language Tests
Xi, Xiaoming; Mollaun, Pam – ETS Research Report Series, 2006
This study explores the utility of analytic scoring for the TOEFL® Academic Speaking Test (TAST) in providing useful and reliable diagnostic information in three aspects of candidates' performance: delivery, language use, and topic development. G studies were used to investigate the dependability of the analytic scores, the distinctness of the…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Oral Language