Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 5 |
Descriptor
Item Analysis | 51 |
Test Construction | 51 |
Test Results | 51 |
Test Reliability | 17 |
Multiple Choice Tests | 14 |
Test Items | 14 |
Test Validity | 14 |
Test Interpretation | 13 |
Achievement Tests | 12 |
Research Reports | 9 |
Standardized Tests | 9 |
More ▼ |
Source
Author
Publication Type
Education Level
Elementary Education | 2 |
Elementary Secondary Education | 2 |
Higher Education | 2 |
Postsecondary Education | 2 |
Adult Education | 1 |
Grade 6 | 1 |
High Schools | 1 |
Secondary Education | 1 |
Audience
Administrators | 1 |
Counselors | 1 |
Policymakers | 1 |
Teachers | 1 |
Location
Canada | 2 |
Australia | 1 |
California | 1 |
Connecticut | 1 |
Europe | 1 |
Florida | 1 |
Israel | 1 |
Thailand | 1 |
United Kingdom (England) | 1 |
West Germany | 1 |
Laws, Policies, & Programs
Education Amendments 1974 | 1 |
National Defense Education Act | 1 |
Assessments and Surveys
Iowa Tests of Basic Skills | 2 |
College Level Academic Skills… | 1 |
National Assessment of… | 1 |
Peabody Individual… | 1 |
Praxis Series | 1 |
Program for International… | 1 |
SAT (College Admission Test) | 1 |
Stanford Achievement Tests | 1 |
What Works Clearinghouse Rating
Cheewasukthaworn, Kanchana – PASAA: Journal of Language Teaching and Learning in Thailand, 2022
In 2016, the Office of the Higher Education Commission issued a directive requiring all higher education institutions in Thailand to have their students take a standardized English proficiency test. According to the directive, the test's results had to align with the Common European Framework of Reference for Languages (CEFR). In response to this…
Descriptors: Test Construction, Standardized Tests, Language Tests, English (Second Language)
Moses, Tim; Liu, Jinghua; Tan, Adele; Deng, Weiling; Dorans, Neil J. – ETS Research Report Series, 2013
In this study, differential item functioning (DIF) methods utilizing 14 different matching variables were applied to assess DIF in the constructed-response (CR) items from 6 forms of 3 mixed-format tests. Results suggested that the methods might produce distinct patterns of DIF results for different tests and testing programs, in that the DIF…
Descriptors: Test Construction, Multiple Choice Tests, Test Items, Item Analysis
Monk, Janice J.; Stallings, William M. – J Educ Res, 1970
Descriptors: Item Analysis, Test Construction, Test Results
Simon, George B. – J Educ Meas, 1969
Descriptors: Item Analysis, Measurement Instruments, Test Construction, Test Results
Lee, Saekyun H.; Han, Hyunjoo – Applied Language Learning, 2007
This study investigated some issues regarding the validity of the Scholastic Achievement Test (SAT) Subject Test: Korean with Listening. The SAT Korean has been administered just once a year since its inception in 1997. As of March 2006, it had been administered nine times. However, SAT foreign language tests are not as rigorously researched as…
Descriptors: Test Results, Second Language Learning, Language Tests, Academic Achievement
US Citizenship and Immigration Services, 2008
"Naturalization Test Redesign Project: Civics Item Selection Analysis" provides an overview of the development of content items for the U.S. history and government (civics) portion of the redesigned naturalization test. This document also reviews the process used to gather and analyze data from multiple studies to determine which civics…
Descriptors: History, Test Items, Citizenship, Individual Testing
Marso, Ronald N. – J Educ Meas, 1970
It was found test item arrangement does not significantly influence test results. (CK)
Descriptors: Hypothesis Testing, Item Analysis, Test Construction, Test Results
Shoemaker, David M. – J Exp Educ, 1970
Descriptors: Analysis of Variance, Educational Research, Item Analysis, Test Construction
Schmeiser, Cynthia Board; Whitney, Douglas R. – 1973
Violations of four selected principles of writing multiple-choice items were introduced into an undergraduate religion course mid-term examination. Three of the flaws significantly increased test difficulty. KR-sub-20 values were lower for all of the tests containing the flawed items than for the "good" versions of the items but significantly so…
Descriptors: Item Analysis, Multiple Choice Tests, Research Reports, Test Construction
Montague, Margariete A. – 1972
This study investigated the feasibility of concurrently and randomly sampling examinees and items in order to estimate group achievement. Seven 32-item tests reflecting a 640-item universe of simple open sentences were used such that item selection (random, systematic) and assignment (random, systematic) of items (four, eight, sixteen) to forms…
Descriptors: Analysis of Variance, Elementary School Students, Group Testing, Item Analysis

Board, Cynthia; Whitney, Douglas R. – Journal of Educational Measurement, 1972
For the principles studied here, poor item-writing practices serve to obscure (or attentuate) differences between good and poor students. (Authors)
Descriptors: College Students, Item Analysis, Multiple Choice Tests, Test Construction
Hills, John R. – 1984
The literature on item bias, i.e., the question of whether some items in tests favor one cultural group over another cultural group due to irrelevant factors, is reviewed and evaluated. All known references through 1981 are described including a large number of unpublished reports. Each method is described and the criticisms that have appeared in…
Descriptors: Evaluation Methods, Item Analysis, Racial Differences, Test Bias

Browning, Robert; And Others – Psychology in the Schools, 1979
Effects that item order and basal and ceiling rules have on test means, variances, and internal consistency estimates for the Peabody Individual Achievement Test mathematics and reading recognition subtests were examined. Items on the math and reading recognition subtests were significantly easier or harder than test placements indicated. (Author)
Descriptors: Achievement Tests, Elementary Education, Individual Testing, Item Analysis
Pyrczak, Fred – 1973
The general purpose of this study was to determine the effects of similarities between stems and keyed choices on test difficulty. Unlike previous investigations of this undesirable characteristic of some multiple-choice items, the present study employed items that were unintentionally faulty and samples of examinees who were highly experienced…
Descriptors: Item Analysis, Multiple Choice Tests, Research Reports, Test Construction

Fricke, Reiner; Luhmann, Reinhold – Studies in Educational Evaluation, 1983
On the basis of the characteristics of criterion-referenced tests, the contribution of German research to the development and application of criterion-referenced tests is discussed. (PN)
Descriptors: Criterion Referenced Tests, Item Analysis, Measurement Techniques, Models