Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 6 |
Since 2016 (last 10 years) | 22 |
Since 2006 (last 20 years) | 82 |
Descriptor
Test Content | 173 |
Test Construction | 61 |
Test Items | 53 |
Test Validity | 40 |
Test Format | 31 |
Test Reliability | 30 |
Foreign Countries | 29 |
Elementary Secondary Education | 28 |
Scores | 27 |
Test Use | 26 |
Scoring | 25 |
More ▼ |
Source
Author
Sireci, Stephen G. | 4 |
Donovan, Jenny | 3 |
Lennon, Melissa | 3 |
Baker, Eva L. | 2 |
Breithaupt, Krista | 2 |
Cui, Zhongmin | 2 |
Geisinger, Kurt F. | 2 |
Hutton, Penny | 2 |
Kingsbury, G. Gage | 2 |
Kolen, Michael J. | 2 |
LeMahieu, Paul G. | 2 |
More ▼ |
Publication Type
Education Level
Secondary Education | 19 |
High Schools | 18 |
Elementary Secondary Education | 17 |
Higher Education | 15 |
Postsecondary Education | 10 |
Elementary Education | 8 |
Grade 6 | 6 |
Grade 8 | 6 |
Grade 4 | 5 |
Grade 10 | 4 |
Grade 7 | 4 |
More ▼ |
Location
Australia | 6 |
California | 6 |
China | 6 |
United States | 4 |
Canada | 2 |
Delaware | 2 |
France | 2 |
Germany | 2 |
Illinois | 2 |
Japan | 2 |
Pennsylvania | 2 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 4 |
Individuals with Disabilities… | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Viruru, Radhika – International Journal of Educational Policy, Research, and Practice: Reconceptualizing Childhood Studies, 2006
In this article, the author examines the construction of standardized testing not only as a cultural product, but as an imperialistic product. She conducts both a postcolonial discussion of the critique of testing and examines the ways that children of color are represented in the content. Recognizing first that the increased use of tests is part…
Descriptors: Racial Bias, Testing, Standardized Tests, Content Analysis

Burton, Nancy – Educational Measurement: Issues and Practice, 1996
The effects of recent changes on the Scholastic Assessment Tests (SAT) on mathematics performance are being studied using data from 1993 and later. Early results show a relative gain for women in the verbal area but not in mathematics. Expected trends, including an effect from increased calculator use, are discussed. (SLD)
Descriptors: Achievement Gains, College Entrance Examinations, Mathematics Achievement, Performance Factors

Crocker, Linda – Applied Measurement in Education, 1997
The experience of the National Board for Professional Teaching Standards illustrates how issues of assessing the content representativeness of performance assessment can be addressed to ensure validity for certification procedures. Explores the challenges of collecting validation evidence when expert judgments of content are used. (SLD)
Descriptors: Content Validity, Credentials, Data Collection, Evaluation Methods

Sykes, Robert C.; Fitzpatrick, Anne R. – Journal of Educational Measurement, 1992
Explanations for an observed change in Rasch item parameters ("b" values) from consecutive administrations of a professional licensing examination were investigated. Analysis of covariance indicated that the change was not related to item position or type. It is hypothesized that the change is attributable to shifts in curriculum…
Descriptors: Analysis of Covariance, Change, Curriculum, Higher Education

Haney, Walt; Fowler, Clarke; Wheelock, Anne; Bebell, Damian; Malec, Nicole – Education Policy Analysis Archives, 1999
Using data from state and academic reports, an independent committee of researchers has evaluated the Massachusetts Teacher Tests. Scores are found to be highly unreliable, and the tests are found to contain questionable content. Suspending use of the tests is recommended. (SLD)
Descriptors: Beginning Teachers, Elementary Secondary Education, State Programs, Teacher Evaluation
O'Neil, Timothy; Sireci, Stephen G.; Huff, Kristen L. – Educational Assessment, 2004
Educational tests used for accountability purposes must represent the content domains they purport to measure. When such tests are used to monitor progress over time, the consistency of the test content across years is important for ensuring that observed changes in test scores are due to student achievement rather than to changes in what the test…
Descriptors: Test Items, Cognitive Ability, Test Content, Science Teachers
Wiliam, Dylan – Review of Research in Education, 2010
The idea that validity should be considered a property of inferences, rather than of assessments, has developed slowly over the past century. In early writings about the validity of educational assessments, validity was defined as a property of an assessment. The most common definition was that an assessment was valid to the extent that it…
Descriptors: Educational Assessment, Validity, Inferences, Construct Validity
Allen, Nancy L.; And Others – 1992
Many testing programs include a section of optional questions in addition to mandatory parts of a test. These optional parts of a test are not often truly parallel to one another, and groups of examinees selecting each optional test section are not equivalent to one another. This paper provides a general method based on missing-data methods for…
Descriptors: Comparative Testing, Estimation (Mathematics), Graphs, Scaling
Melnick, Steven A.; Henk, William A. – 1997
This paper compares two methods of establishing content validity, forced-choice judgmental review and a latent category judgmental review. It also compares content validity evidence with the results of a scale reliability analysis and makes recommendations of the two content validity procedures. Two different groups of graduate students enrolled…
Descriptors: Classification, Comparative Analysis, Content Validity, Graduate Students

Herman, Joan L. – Educational Leadership, 1992
Summarizes research supporting current beliefs in testing, identifies good assessment qualities, and reviews the current knowledge of test design. Standardized tests negatively affect academic program quality. Alternative assessments must be judged by their validity, reliability, consequences, fairness, generalizability, cognitive complexity,…
Descriptors: Accountability, Alternative Assessment, Cost Effectiveness, Educational Testing
Yan, Jin; Huizhong, Yang – Language, Culture and Curriculum, 2006
The College English Test (CET), designed in accordance with the requirements of the National College English Teaching Syllabus and as a result of the need for China's reform and its open-door policy in the 1980s, is the world's largest language test administered nationwide. Owing to its scientific approach, consistent marking, rigorous…
Descriptors: Test Content, Language Tests, Graduates, Educational Change
van der Linden, Wim J.; Reese, Lynda M. – 1997
A model for constrained computerized adaptive testing is proposed in which the information in the test at the ability estimate is maximized subject to a large variety of possible constraints on the contents of the test. At each item-selection step, a full test is first assembled to have maximum information at the current ability estimate fixing…
Descriptors: Ability, Adaptive Testing, Computer Assisted Testing, Computer Simulation
Tambini, Robert F. – 1999
The quality and the effectiveness of the 1992 New Jersey Grade 8 Early Warning Test (NJEWT) are assessed. Standardized tests possess clear advantages for educators, especially in the case of administration and scoring, but there are clear disadvantages as well, including the possibility of bias. Four criteria are applied to the NJEWT: adequacy,…
Descriptors: Achievement Tests, Grade 8, Junior High School Students, Junior High Schools
Chen, Shu-Ying; Ankenman, Robert D. – Journal of Educational Measurement, 2004
The purpose of this study was to compare the effects of four item selection rules--(1) Fisher information (F), (2) Fisher information with a posterior distribution (FP), (3) Kullback-Leibler information with a posterior distribution (KP), and (4) completely randomized item selection (RN)--with respect to the precision of trait estimation and the…
Descriptors: Test Length, Adaptive Testing, Computer Assisted Testing, Test Selection
Wu, Yuh-Yin; Guei, I-Fen – 2000
A study was conducted to investigate: (1) the relationships between the results from various forms of assessment and the patterns of correlation across content areas; (2) how cognitive components correlate with the test results from different classroom assessments; and (3) how content areas affected the relationships. Data were collected from a…
Descriptors: Cognitive Processes, Cognitive Tests, Correlation, Elementary School Students