ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	6

Descriptor

Language Tests	91
Test Reliability	73
Test Validity	49
Language Proficiency	47
English (Second Language)	43
Second Language Learning	27
Test Construction	26
Foreign Countries	24
Second Language Instruction	23
Interrater Reliability	22
Higher Education	21
Comparative Analysis	18
Testing	17
Rating Scales	16
Oral Language	12
Cloze Procedure	11
Test Format	11
Interviews	10
Scoring	10
Second Languages	10
Spanish	9
Speech Skills	9
Standardized Tests	9
Student Evaluation	9
Test Items	9
More ▼

Source

Online Submission	4
Research-publishing.net	2
Academic Medicine	1
Journal of Communication…	1
Language Learning	1
Language Testing	1

Publication Type

Speeches/Meeting Papers	91
Reports - Research	46
Reports - Evaluative	23
Tests/Questionnaires	8
Reports - Descriptive	6
Opinion Papers	5
Information Analyses	4
Journal Articles	4
Guides - Classroom - Teacher	2
Collected Works - Serials	1
Guides - Non-Classroom	1
Historical Materials	1
More ▼

Education Level

Higher Education	2
Postsecondary Education	2

Audience

Practitioners	7
Teachers	6
Researchers	3
Administrators	1

Location

Japan	5
Australia	3
Netherlands	3
Spain	3
Algeria	1
California	1
Canada	1
Connecticut	1
Cyprus	1
France	1
Ireland	1
Israel	1
Texas	1
United Kingdom (Great Britain)	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

Test of English as a Foreign…	6
International English…	2
Language Assessment Scales	2
Test of English for…	2
ACTFL Oral Proficiency…	1
Alabama High School…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 91 results Save | Export

Testing the Reliability of the New General Service List Test (NGSLT) in Order to Better Evaluate Japanese University Students' Written Receptive Vocabulary Levels

Peer reviewed
PDF on ERIC

Download full text

Nadasdy, Paul; Aizawa, Kazumi; Iso, Tatsuo – Research-publishing.net, 2018

The New General Service List Test (NGSLT) (Stoeckel & Bennett, 2015) was designed as a diagnostic test to measure students' written receptive vocabulary knowledge. This test battery was developed based upon the New General Service List (NGSL) (Browne, 2013), which makes it appealing to teachers in Japan, and especially those who see vocabulary…

Descriptors: Test Reliability, Receptive Language, Vocabulary, Language Tests

Developing a Vocabulary Size Test Measuring Two Aspects of Receptive Vocabulary Knowledge: Visual versus Aural

Peer reviewed
PDF on ERIC

Download full text

Aizawa, Kazumi; Iso, Tatsuo; Nadasdy, Paul – Research-publishing.net, 2017

Testing learners' English proficiency is central to university English classes in Japan. This study developed and implemented a set of parallel online receptive aural and visual vocabulary tests that would predict learners' English proficiency. The tests shared the same target words and choices--the main difference was the presentation of the…

Descriptors: Receptive Language, English (Second Language), Second Language Learning, Word Frequency

Designing a VOIP Based Language Test

Download full text

Garcia Laborda, Jesus; Magal Royo, Teresa; Otero de Juan, Nuria; Gimenez Lopez, Jose L. – Online Submission, 2015

Assessing speaking is one of the most difficult tasks in computer based language testing. Many countries all over the world face the need to implement standardized language tests where speaking tasks are commonly included. However, a number of problems make them rather impractical such as the costs, the personnel involved, the length of time for…

Descriptors: Test Construction, Telecommunications, Computer Mediated Communication, Computer Assisted Testing

Second Language Assessment for Classroom Teachers

Download full text

Tran, Thu H. – Online Submission, 2012

The vast majority of second language teachers feels confident about their instructional performance and does not usually have much difficulty with their teaching thanks to their professional training and accumulated classroom experience. Nonetheless, many second language teachers may not have received sufficient training in test development to…

Descriptors: Second Language Instruction, Language Tests, Test Construction, Test Validity

A Generalizability Investigation of Cognitive Demand and Rigor Ratings of Items and Standards in an Alignment Study

Download full text

Lombardi, Allison; Seburn, Mary; Conley, David; Snow, Eric – Online Submission, 2010

In alignment studies, expert raters evaluate assessment items against standards and ratings are used to compute various alignment indices. Questions about rater reliability, however, are often ignored or inadequately addressed. This paper reports the results of a generalizability theory study of cognitive demand and rigor ratings of assessment…

Descriptors: Generalizability Theory, Test Items, College Entrance Examinations, Readiness

Lecture:"Where We Have Been and Where We Should Go"

Peer reviewed

Direct link

Stansfield, Charles W. – Language Testing, 2008

In this speech, the author covers a lot of ground. In the first half of his speech, the author gives a brief summary of the last 40 years of the history of language testing, from his perspective. The author reviews these years more or less by decade. Additionally, he discusses the evolution of the profession of language testing during this period,…

Descriptors: History, Testing, Language Tests, Role

Who Is To Judge How Well Others Speak? An Experiment with the ACTFL/ETS Oral Proficiency Scale.

Download full text

Barnwell, David – 1986

A study examined inter-rater reliability on the American Council on the Teaching of Foreign Languages/Educational Testing Service (ACTFL/ETS) oral language proficiency rating scale. Seven raters, all elementary or intermediate college Spanish teachers given only brief formal training in the use of the scale, evaluated recorded interviews with…

Descriptors: College Faculty, Higher Education, Interrater Reliability, Language Teachers

Investigating Variability in Tasks and Rater Judgments in a Performance Test of Foreign Language Speaking.

Download full text

Bachman, Lyle F.; And Others – 1993

This paper outlines the development of a performance assessment measure of language speaking ability, the Language Ability Assessment System (LAAS), which is highly reliable and can be examined for reliability through modern measurement theories, such as generalizability theory (G-theory) and the many-facet Rasch theory. LAAS was developed to…

Descriptors: College Students, Higher Education, Interrater Reliability, Language Proficiency

Exploring Rater Behaviour with Rasch Techniques.

Download full text

McNamara, T. F.; Adams, R. J. – 1991

A preliminary study is reported of the use of new multifaceted Rasch measurement mechanisms for investigating rater characteristics in language testing. Ratings from four judges of scripts from 50 candidates taking the International English Language Testing System test, a test of English for Academic Purposes, are analyzed. The analysis…

Descriptors: Comparative Analysis, English (Second Language), Foreign Countries, Interrater Reliability

Characteristics of the Test Components of the IELTS Battery: Australian Trial Data.

Download full text

Griffin, Patrick – 1990

Results of the International English Language Testing System (IELTS) battery trials in Australia are reported. The IELTS tests of productive language skills use direct assessment strategies and subjective scoring according to detailed guidelines. The receptive skills tests use indirect assessment strategies and clerical scoring procedures.…

Descriptors: English (Second Language), Foreign Countries, Grammar, Interrater Reliability

Sampling Reliability in Spontaneous Language Sampling.

Fujiki, Martin; Brinton, Bonnie – 1985

To determine how many occurrences of a syntactic structure are necessary to provide sampling reliability, two one-half hour spontaneous language samples were elicited from each of 15 language disordered students (5 to 6 years old). Sessions were divided into two periods, one for the child's telling about pictures and toys and the other for…

Descriptors: Language Handicaps, Language Tests, Sampling, Syntax

Sources of Variability in the Angoff Standard-Setting Process.

Download full text

Halpin, Glennelle; McLean, James E. – 1991

Although the standard-setting method of W. H. Angoff (1971) has broad-based support in the research literature, inconsistencies in the resulting standards do occur. Sources of these inconsistencies are examined in a study of judges, competencies (items), rounds (replications), and the interactions among them. A modified Angoff approach was used to…

Descriptors: Analysis of Variance, Error of Measurement, Evaluators, High Schools

The Stability of Oral Proficiency Assessment on the Oral Interview Testing Procedures.

Peer reviewed

Shohamy, Elana – Language Learning, 1983

Reports on study in which students of Hebrew as a second language took four versions of oral proficiency test. Results indicate that different speech styles and topics significantly affected students' scores, and correlational analyses between pairs pointed to low reliability and lack of stability of the test. Urges caution in making decisions…

Descriptors: Hebrew, Interviews, Language Proficiency, Language Tests

Decision Dependability of Subtests, Tests, and the Overall TOEFL Test Battery.

Download full text

Brown, James Dean; Ross, Jacqueline A. – 1993

This study investigates the Test of English as a Foreign Language (TOEFL), in particular the relative contributions to score dependability (analogous to classical theory reliability) of various numbers of items and subtests as well as the decision dependability at different cut points. Research questions that apply to the overall TOEFL battery and…

Descriptors: English (Second Language), Language Tests, Statistical Analysis, Test Reliability

Of English Marks and American Reviewers.

Download full text

Spolsky, Bernard – 1990

A discussion of the differences between the Test of English as a Foreign Language (TOEFL), an American test battery, and the Cambridge English Examinations (Cambridge), a British battery, focuses on the different approaches to language test development embodied in the tests as the source of difficulty in translating between them for individual…

Descriptors: Comparative Analysis, Cultural Differences, English (Second Language), Foreign Countries

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7

Stansfield, Charles W.	4
Brown, James Dean	3
Aizawa, Kazumi	2
Ingram, D. E.	2
Iso, Tatsuo	2
Nadasdy, Paul	2
Oller, John W., Jr.	2
Rice, William K., Jr.	2
Ross, Steven	2
Spolsky, Bernard	2
Templin, Stephen A.	2
Aaronson, May	1
Adams, R. J.	1
Alderson, J. Charles	1
Amado, Alfred J.	1
Bachman, Lyle F.	1
Barnwell, David	1
Barnwell, David Patrick	1
Berkoff, Nelson A.	1
Bobie, Allen	1
Bordie, John G.	1
Brindley, Geoff	1
Brinton, Bonnie	1
Buell, James G.	1
More ▼