ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	3
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	9

Descriptor

Second Language Learning	12
Test Items	12
Writing Tests	12
English (Second Language)	11
Language Tests	9
Comparative Analysis	7
Scores	7
Foreign Countries	6
College Entrance Examinations	5
Second Language Instruction	4
Test Construction	4
Correlation	3
Item Analysis	3
Reading Tests	3
Cross Cultural Studies	2
English	2
Generalization	2
Graduate Study	2
Grammar	2
High School Students	2
High Stakes Tests	2
Item Response Theory	2
Language Proficiency	2
Prompting	2
Psychometrics	2
More ▼

Source

ETS Research Report Series	4
College Board	1
International Journal of…	1
Language Testing	1
Language Testing in Asia	1
MEXTESOL Journal	1
SAGE Open	1
Studies in Second Language…	1

Publication Type

Journal Articles	10
Reports - Research	10
Tests/Questionnaires	2
Non-Print Media	1
Numerical/Quantitative Data	1
Reference Materials - General	1
Reports - Evaluative	1
Speeches/Meeting Papers	1

Education Level

Higher Education	6
Postsecondary Education	6
Secondary Education	3
High Schools	2
Elementary Education	1
Elementary Secondary Education	1
Grade 8	1
Junior High Schools	1
Middle Schools	1

Audience

Location

Japan	3
China	1
Colombia	1
Europe	1
Iran (Tehran)	1
South Korea	1

Laws, Policies, & Programs

Elementary and Secondary…	1
Lau v Nichols	1
No Child Left Behind Act 2001	1
Race to the Top	1

Assessments and Surveys

Test of English as a Foreign…	4
SAT (College Admission Test)	3
Graduate Management Admission…	1
Graduate Record Examinations	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Timing of Form-Focused Instruction: Effects on EFL Learners' Grammar Learning

Peer reviewed
PDF on ERIC

Download full text

Xu, Jinfen; Li, Changying – Studies in Second Language Learning and Teaching, 2022

This study investigates how different form-focused instruction (FFI) timing impacts English as a foreign language (EFL) learners' grammar development. A total of 169 Chinese middle school learners were assigned to four conditions randomly: control, before-isolated FFI, integrated FFI, and after-isolated FFI. The three experimental groups received…

Descriptors: Intervention, Grammar, English (Second Language), Second Language Learning

Designing Efficient L2 Writing Assessment Tasks for Low-Proficiency Learners of English. TOEFL® Research Report. RR-97. ETS RR-21-27

Peer reviewed
PDF on ERIC

Download full text

Sasayama, Shoko; Garcia Gomez, Pablo; Norris, John M. – ETS Research Report Series, 2021

This report describes the development of efficient second language (L2) writing assessment tasks designed specifically for low-proficiency learners of English to be included in the "TOEFL® Essentials"™ test. Based on the can-do descriptors of the Common European Framework of Reference for Languages for the A1 through B1 levels of…

Descriptors: English (Second Language), Language Tests, Second Language Learning, Writing Tests

A Comparison of Distractor Selection among Proficiency Levels in Reading Tests: A Focus on Summarization Processes in Japanese EFL Learners

Peer reviewed

Direct link

Terao, Takahiro; Ishii, Hidetoki – SAGE Open, 2020

This study aimed to compare selection patterns of distractors (incorrect options) according to test taker proficiency regarding Japanese students' summarization skills of an English paragraph. Participants included 414 undergraduate students, and the test comprised three summarization process types--deletion, generalization, and integration.…

Descriptors: Comparative Analysis, English (Second Language), Second Language Instruction, Second Language Learning

Form-Focused Instruction in Second Language Vocabulary Learning: A Case for Contrastive Analysis and Translation (A Conceptual Replication Study)

Peer reviewed
PDF on ERIC

Download full text

Jahangard, Ali – MEXTESOL Journal, 2022

One of the most interesting studies on the role of L1 and contrastive analysis in vocabulary teaching is by Laufer and Girsai (2008). However, due to some methodological issues, their research findings are open to criticism and controversy. The current study aimed to replicate the research with a more rigorous design to re-investigate the…

Descriptors: Grammar, Vocabulary Development, Second Language Learning, Second Language Instruction

Test-Taker Perception of What Test Items Measure: A Potential Impact of Face Validity on Student Learning

Peer reviewed

Direct link

Sato, Takanori; Ikeda, Naoki – Language Testing in Asia, 2015

Background: High-stakes tests have an immense washback effect on what students learn and affect the content of student learning. However, if students fail to recognize the abilities that the test developers intend to measure, they are less likely to learn what the test developers wish them to learn. This study aims to investigate test-taker…

Descriptors: High Stakes Tests, Testing Problems, Test Items, College Students

Exploring Differential Subgroup Functioning on SAT Writing Items: What Happens When English Is Not a Test Taker's Best Language?

Peer reviewed

Direct link

Engelhard, George, Jr.; Kobrin, Jennifer L.; Wind, Stefanie A. – International Journal of Testing, 2014

The purpose of this study is to explore patterns in model-data fit related to subgroups of test takers from a large-scale writing assessment. Using data from the SAT, a calibration group was randomly selected to represent test takers who reported that English was their best language from the total population of test takers (N = 322,011). A…

Descriptors: College Entrance Examinations, Writing Tests, Goodness of Fit, English

Investigating the Suitability of Implementing the "e-rater"® Scoring Engine in a Large-Scale English Language Testing Program. Research Report. ETS RR-13-36

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013

In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…

Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests

Testing English Language Learners under No Child Left Behind

Peer reviewed

Direct link

Bunch, Michael B. – Language Testing, 2011

Title III of Public Law 107-110 (No Child Left Behind; NCLB) provided for creation of assessments of English language learners (ELLs) and established, through the Enhanced Assessment Grant program, a platform from which four consortia of states developed ELL tests aligned to rigorous statewide content standards. Those four tests (ACCESS for ELLs,…

Descriptors: Test Items, Student Evaluation, Federal Legislation, Formative Evaluation

Score Reliability as an Essential Prerequisite for Validating New Writing and Speaking Tasks for TOEFL.

Lee, Yong-Won; Kantor, Robert; Mollaun, Pam – 2002

This paper reports the results of generalizability theory (G) analyses done for new writing and speaking tasks for the Test of English as a Foreign Language (TOEFL). For writing, a special focus was placed on evaluating the impact on the reliability of the number of raters (or ratings) per essay (one or two) and the number of tasks (one, two, or…

Descriptors: English (Second Language), Generalizability Theory, Reliability, Scores

The Effect of Using Different Weights for Multiple-Choice and Free-Response Item Sections

Download full text

Hendrickson, Amy; Patterson, Brian; Melican, Gerald – College Board, 2008

Presented at the Annual National Council on Measurement in Education (NCME) in New York in March 2008. This presentation explores how different item weighting can affect the effective weights, validity coefficents and test reliability of composite scores among test takers.

Descriptors: Multiple Choice Tests, Test Format, Test Validity, Test Reliability

Statistical and Measurement Properties of Features Used in Essay Assessment. Research Report. ETS RR-04-21

Peer reviewed
PDF on ERIC

Download full text

Haberman, Shelby J. – ETS Research Report Series, 2004

Statistical and measurement properties are examined for features used in essay assessment to determine the generalizability of the features across populations, prompts, and individuals. Data are employed from TOEFL® and GMAT® examinations and from writing for Criterion?.

Descriptors: Language Tests, English (Second Language), Second Language Learning, Business Administration Education

Ensuring the Fairness of GRE Writing Prompts: Assessing Differential Difficulty. Research Report. ETS GRE Board Research Report No. 02-07R. ETS RR-05-11

Peer reviewed
PDF on ERIC

Download full text

Broer, Markus; Lee, Yong-Won; Rizavi, Saba; Powers, Don – ETS Research Report Series, 2005

Three polytomous DIF detection techniques--the Mantel test, logistic regression, and polySTAND--were used to identify GRE® Analytical Writing prompts ("Issue" and "Argument") that are differentially difficult for (a) female test takers; (b) African American, Asian, and Hispanic test takers; and (c) test takers whose strongest…

Descriptors: Culture Fair Tests, Item Response Theory, Test Items, Cues

Lee, Yong-Won	2
Breyer, F. Jay	1
Broer, Markus	1
Bunch, Michael B.	1
Engelhard, George, Jr.	1
Garcia Gomez, Pablo	1
Haberman, Shelby J.	1
Hendrickson, Amy	1
Ikeda, Naoki	1
Ishii, Hidetoki	1
Jahangard, Ali	1
Kantor, Robert	1
Kobrin, Jennifer L.	1
Li, Changying	1
Lorenz, Florian	1
Melican, Gerald	1
Mollaun, Pam	1
Norris, John M.	1
Patterson, Brian	1
Powers, Don	1
Rizavi, Saba	1
Sasayama, Shoko	1
Sato, Takanori	1
Terao, Takahiro	1
Wind, Stefanie A.	1
More ▼