Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 6 |
Since 2016 (last 10 years) | 22 |
Since 2006 (last 20 years) | 82 |
Descriptor
Test Content | 173 |
Test Construction | 61 |
Test Items | 53 |
Test Validity | 40 |
Test Format | 31 |
Test Reliability | 30 |
Foreign Countries | 29 |
Elementary Secondary Education | 28 |
Scores | 27 |
Test Use | 26 |
Scoring | 25 |
More ▼ |
Source
Author
Sireci, Stephen G. | 4 |
Donovan, Jenny | 3 |
Lennon, Melissa | 3 |
Baker, Eva L. | 2 |
Breithaupt, Krista | 2 |
Cui, Zhongmin | 2 |
Geisinger, Kurt F. | 2 |
Hutton, Penny | 2 |
Kingsbury, G. Gage | 2 |
Kolen, Michael J. | 2 |
LeMahieu, Paul G. | 2 |
More ▼ |
Publication Type
Education Level
Secondary Education | 19 |
High Schools | 18 |
Elementary Secondary Education | 17 |
Higher Education | 15 |
Postsecondary Education | 10 |
Elementary Education | 8 |
Grade 6 | 6 |
Grade 8 | 6 |
Grade 4 | 5 |
Grade 10 | 4 |
Grade 7 | 4 |
More ▼ |
Location
Australia | 6 |
California | 6 |
China | 6 |
United States | 4 |
Canada | 2 |
Delaware | 2 |
France | 2 |
Germany | 2 |
Illinois | 2 |
Japan | 2 |
Pennsylvania | 2 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 4 |
Individuals with Disabilities… | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Sireci, Stephen G.; Parker, Polly – Educational Measurement: Issues and Practice, 2006
The psychometric literature is replete with comprehensive discussions of test validity, test validation, and the characteristics of quality assessment programs. The most authoritative source for guidance regarding sound test development and evaluation practices is the Standards for Educational and Psychological Testing. However, the Standards are…
Descriptors: Psychometrics, Test Validity, Educational Testing, Psychological Testing
McCall, Cecelia – 1989
While the charge that standardized tests are biased is not new, critics (including feminists) recently have made accusations of gender bias. One argument for the superior performance of males on the Scholastic Aptitude Tests (SAT), the Preliminary Scholastic Aptitude Test/National Merit Scholarship Qualifying Test (PSAT/NMSQT), and the American…
Descriptors: Elementary Secondary Education, Reading Achievement, Sex Differences, Test Bias
Thomas, Leslie; Kalohn, John C. – 1996
Test specifications dictate the kind of content that should be included on each form of an examination, and the relative weight that each content domain should contribute to the determination of examinees' test scores by specifying the proportion of items to be included in each content area. This paper addresses a step in the development of…
Descriptors: Job Analysis, Licensing Examinations (Professions), Mathematical Models, Research Methodology
Kromrey, Jeffrey D.; Parshall, Cynthia G.; Yi, Qing – 1998
The effects of anchor test characteristics in the accuracy and precision of test equating in the "common items, nonequivalent groups design" were studied. The study also considered the effects of nonparallel based and new forms on the equating solution, and it investigated the effects of differential weighting on the success of equating…
Descriptors: Equated Scores, High Schools, Item Response Theory, Monte Carlo Methods
Leung, Chi-Keung; Chang, Hua-Hua; Hau, Kit-Tai – 2001
The multistage alpha-stratified computerized adaptive testing (CAT) design advocated a new philosophy of pool management and item selection using low discriminating items first. It has been demonstrated through simulation studies to be effective both in reducing item overlap rate and enhancing pool utilization with certain pool types. Based on…
Descriptors: Adaptive Testing, Computer Assisted Testing, Item Banks, Selection
The Effects of Content Homogeneity and Equating Method on the Accuracy of Common-Item Test Equating.
Yang, Wen-Ling – 2000
This study investigated whether equating accuracy improves with an anchor test that is more representative of its corresponding total test and whether such content effect depends on the particular equating method used. Scoring outcomes of a professional examination for a medical specialty were used. A total of 1,092 examinees took one form, and…
Descriptors: Equated Scores, Item Response Theory, Licensing Examinations (Professions), Physicians

Werner, Patrice Holden – Journal of Reading, 1991
Reviews the Ennis-Weir Critical Thinking Essay Test. Notes that the test may be used as an informal diagnostic instrument, an evaluation tool for instructional effectiveness, or as material for teaching critical thinking. (RS)
Descriptors: Critical Thinking, Essay Tests, Higher Education, Instructional Effectiveness

Kingsbury, G. Gage; Zara, Anthony R. – Applied Measurement in Education, 1991
This simulation investigated two procedures that reduce differences between paper-and-pencil testing and computerized adaptive testing (CAT) by making CAT content sensitive. Results indicate that the price in terms of additional test items of using constrained CAT for content balancing is much smaller than that of using testlets. (SLD)
Descriptors: Adaptive Testing, Comparative Analysis, Computer Assisted Testing, Computer Simulation
Marrelli, Anne F. – Performance and Instruction, 1995
Discusses the advantages of using multiple choice questions, highlighting the flexibility of using different variations of questions. Item writing guidelines include information on content, sensitivity, difficulty, irrelevant sources of difficulty, order, misleads, avoidance of clues, and exercises in the application of guidelines. (JKP)
Descriptors: Distractors (Tests), Guidelines, Multiple Choice Tests, Questioning Techniques
Powell, Sara Davis – High School Magazine, 1999
Academically sound methods for preparing students for standardized tests include establishing tests' importance, forming preparation teams, gathering information, aligning curricular and test objectives, teaching test-wiseness skills, informing stakeholders, involving students in preparation plans, infusing curriculum with test content, and…
Descriptors: Ethics, High Schools, Standardized Tests, Student Participation
Green, Anthony B.; Weir, Cyril J. – Language Testing, 2004
Studies of placement tests are typically narrowly concerned with their validation as instruments for the efficient grouping of students. They rarely explore the assumption that placement test content can be related to classroom tasks and so inform instructional decisions. This study focuses on a trial version of the Global Placement Test (GPT), a…
Descriptors: Foreign Countries, Test Format, Instructional Materials, Inferences
Benners, G. Anthony; George-Ezzelle, Carol E. – GED Testing Service, 2006
The present investigation was aimed at exploring the stability of the standard score distributions on the GED (General Educational Development) Tests taken by U.S. high school seniors in equating studies conducted by GED Testing Service during the span of 5 years from 2001 (the norming year) to 2005. Three questions were addressed by this…
Descriptors: Testing, High School Seniors, Scores, Measurement Techniques
Solarsh, Barbara; Alant, Erna – Journal of Communication Disorders, 2006
A culturally appropriate test, The Test of Ability To Explain for Zulu-speaking Children (TATE-ZC), was developed to measure verbal problem solving skills of rural, Zulu-speaking, primary school children. Principles of "non-biased" assessment, as well as emic (culture specific) and etic (universal) aspects of intelligence formed the theoretical…
Descriptors: African Languages, Elementary School Students, Culture Fair Tests, Cultural Relevance
Johanson, George A. – 1993
K. K. Waltman and D. A. Frisbie (1994) observed that teachers and parents often interpret grades given to students in both absolute and relative senses. They conclude that this sort of interpretation is illogical and may indicate misunderstandings in several areas. Absolute and relative methods of assigning letter grades are approached from…
Descriptors: Academic Achievement, Grades (Scholastic), Grading, Performance Based Assessment

Mehrens, William A. – Applied Measurement in Education, 1997
This commentary on articles in this special issue generally agrees with the viewpoints expressed, although it argues that in some cases the authors of these articles should have expanded on certain issues. Many comments relate to the legal defensibility of the positions taken. (SLD)
Descriptors: Certification, Decision Making, Licensing Examinations (Professions), Performance Based Assessment