Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 8 |
Since 2016 (last 10 years) | 24 |
Since 2006 (last 20 years) | 59 |
Descriptor
Test Construction | 495 |
Test Results | 495 |
Achievement Tests | 107 |
Test Validity | 104 |
Test Interpretation | 101 |
Elementary Secondary Education | 100 |
Scores | 93 |
Testing Programs | 89 |
Test Reliability | 80 |
Testing | 77 |
Test Items | 75 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Practitioners | 34 |
Teachers | 26 |
Administrators | 9 |
Researchers | 8 |
Parents | 7 |
Policymakers | 4 |
Counselors | 3 |
Community | 2 |
Students | 1 |
Support Staff | 1 |
Location
Florida | 12 |
Canada | 9 |
Australia | 8 |
California | 8 |
Delaware | 8 |
Connecticut | 6 |
New York | 6 |
Nebraska | 5 |
United Kingdom (England) | 5 |
Japan | 4 |
Pennsylvania | 4 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 3 |
Debra P v Turlington | 1 |
Education Amendments 1974 | 1 |
Elementary and Secondary… | 1 |
Fourteenth Amendment | 1 |
National Defense Education Act | 1 |
Assessments and Surveys
What Works Clearinghouse Rating

Livingston, Samuel A. – Journal of Educational Measurement, 1972
This article is a reply to a previous paper (see TM 500 488) interpreting Livingston's original article (see TM 500 487). (CK)
Descriptors: Criterion Referenced Tests, Error of Measurement, Norm Referenced Tests, Test Construction

Schrock, Timothy J.; Mueller, Daniel J. – Journal of Educational Research, 1982
Three item-construction principles for multiple-choice tests were studied to determine how they affected test results for high school students: (1) use of incomplete sentence stem; (2) location of blank in the stem; and (3) presence of noncueing material. Differences in item construction had a slight effect on test results. (Authors/CJ)
Descriptors: Cues, High School Students, High Schools, Item Analysis

Thornburg, Hershel – High School Journal, 1981
This article discusses how to develop a conceptual framework for health education on which a needs assessment instrument can be based; how to construct the actual instrument; and how to interpret the cognitive, affective, and behavioral data gathered for curriculum decision making. (Author/SJL)
Descriptors: Administrator Guides, Curriculum Development, Health Education, Needs Assessment

Burton, Nancy – Educational Measurement: Issues and Practice, 1996
The effects of recent changes on the Scholastic Assessment Tests (SAT) on mathematics performance are being studied using data from 1993 and later. Early results show a relative gain for women in the verbal area but not in mathematics. Expected trends, including an effect from increased calculator use, are discussed. (SLD)
Descriptors: Achievement Gains, College Entrance Examinations, Mathematics Achievement, Performance Factors

Yen, Wendy M. – Educational Measurement: Issues and Practice, 1997
The accuracy of statistics based on performance assessments that represent percentages of students reaching standards is explored using data from a large-scale performance assessment, the Maryland School Performance Assessment Program. Results with students in grades 3, 5, and 8 support the accuracy of pooling results to produce the statistics.…
Descriptors: Achievement Tests, Elementary Education, Error of Measurement, Performance Based Assessment

Sukigara, Masune – Educational and Psychological Measurement, 1996
The New Japanese version of the Minnesota Multiphasic Personality Inventory (MMPI) was administered twice to 200 Japanese female college students to verify the equivalence of the computer- and booklet-administered formats. For four scales, scores from the computer version were statistically significantly higher than those from the booklet…
Descriptors: College Students, Computer Assisted Testing, Females, Foreign Countries

Cronbach, Lee J.; And Others – Educational and Psychological Measurement, 1997
Through the standard error, rather than a reliability coefficient, generalizability theory provides an indicator of the uncertainty attached to school and individual scores on performance assessments. Recommendations are made to apply generalizability theory to current performance assessments, emphasizing practices that differ from usual…
Descriptors: Academic Achievement, Error of Measurement, Generalizability Theory, Performance Based Assessment

Linn, Robert L.; Burton, Elizabeth – Educational Measurement: Issues and Practice, 1994
Generalizability of performance-based assessment scores across raters and tasks is examined, focusing on implications of generalizability analyses for specific uses and interpretations of assessment results. Although it seems probable that assessment conditions, task characteristics, and interactions with instructional experiences affect the…
Descriptors: Educational Assessment, Educational Experience, Generalizability Theory, Interaction

Blaney, Paul H.; Cox, Charles L. – Journal of Personality Assessment, 1975
This study assesses the viability of the rating approach compared to the forced choice approach by use of the Activity Preference Questionnaire. (DEP)
Descriptors: Anxiety, College Students, Forced Choice Technique, Higher Education
Haenn, Joseph F.; And Others – 1984
The Maryland State Department of Education issued a request for proposals to develop a score reporting system for the Maryland Functional Testing Program. RMC Research Corporation conducted a literature review of extant literature and developed a national survey of non-norm-referenced test score reporting practices. This comprehensive analysis of…
Descriptors: Criterion Referenced Tests, Information Utilization, Minimum Competency Testing, Scores

Kastrinos, William; Erk, Frank C. – American Biology Teacher, 1974
Describes the structure of the Advanced Placement (AP) examination in biology, and presents in entirety the objective portion of the examination administered in May 1970. (JR)
Descriptors: Advanced Placement Programs, Biology, Evaluation, Multiple Choice Tests
Pratt, Harold; And Others – 1980
Recently the framework of the Concerns-Based Adoption Model was used to guide the implementation of a revised elementary science program in grades 3-6 in the Jefferson County (Colorado) Public Schools. This paper explores how student achievement on a locally developed, criterion-referenced test is related to the length of time the revised science…
Descriptors: Academic Achievement, Curriculum Development, Elementary Education, Elementary School Science
Western Michigan Univ., Kalamazoo. School of Education. – 1977
When survey data are analyzed by computer, a codebook is necessary to show how raw scores are translated to computer-usable form. When data are tabulated by hand, the need for a codebook depends upon the number of respondents, the number and complexity of items, and the types of statistical manipulations required. During the process of coding,…
Descriptors: Codification, Data Processing, Input Output, Item Analysis
Karma, Kai – 1975
This report is the second part of a study designed to construct a test for measuring musical aptitude of persons from various age groups. It covers the construction of the test, material, item analysis, reliability, validity, and possible future steps. The test is composed of musical recordings, determined from pilot studies, that the test groups…
Descriptors: Aptitude, Aptitude Tests, Comparative Testing, Fine Arts
Padia, William L. – 1975
Model identification of time-series data is essential to valid statistical tests of intervention effects. Model identification is, at best, inexact in the social and behavioral sciences where one is often confronted with small numbers of observations. These problems are discussed, and the results of independent identifications of 130 social and…
Descriptors: Evaluation Methods, Identification, Item Analysis, Mathematical Models