Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 9 |
Descriptor
Source
Author
Koretz, Daniel | 3 |
Somers, Marie-Andree | 2 |
Wong, Edmond | 2 |
Zhu, Pei | 2 |
Allen, Nancy L. | 1 |
Anderson, Lorin W. | 1 |
Bennett, Randy Elliot | 1 |
Breyer, F. Jay | 1 |
Brown, Richard S. | 1 |
Carvajal, Jorge | 1 |
Coughlin, Ed | 1 |
More ▼ |
Publication Type
Education Level
Elementary Secondary Education | 3 |
Grade 4 | 2 |
Grade 6 | 2 |
Grade 8 | 2 |
High Schools | 2 |
Secondary Education | 2 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Grade 12 | 1 |
Grade 3 | 1 |
Grade 5 | 1 |
More ▼ |
Audience
Administrators | 1 |
Practitioners | 1 |
Researchers | 1 |
Teachers | 1 |
Location
Vermont | 4 |
Canada | 3 |
Texas | 2 |
Alaska | 1 |
California | 1 |
New York | 1 |
New York (Albany) | 1 |
New York (Buffalo) | 1 |
New York (New York) | 1 |
New York (Rochester) | 1 |
New York (Syracuse) | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
New York State Education Department, 2015
This technical report provides an overview of the New York State Alternate Assessment (NYSAA), including a description of the purpose of the NYSAA, the processes utilized to develop and implement the NYSAA program, and Stakeholder involvement in those processes. By comparing the intent of the NYSAA with its process and design, the validity of the…
Descriptors: Alternative Assessment, Grade 3, Grade 4, Grade 5
Kettler, Ryan J. – Review of Research in Education, 2015
This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…
Descriptors: Meta Analysis, Literature Reviews, Testing, Testing Accommodations
Zhu, Pei; Somers, Marie-Andree; Wong, Edmond – Society for Research on Educational Effectiveness, 2010
For this project, the authors use data from four IES-sponsored randomized studies to examine some of the key issues identified in May et. al. (2009). The first set of questions focuses on issues related to using state tests: (1) Do studies meet the assumptions needed for combining impacts on state tests across grades and/or states?; (2) How…
Descriptors: Academic Achievement, Program Effectiveness, State Standards, Testing Programs
Huang, Jinyan – TESOL Journal, 2011
Using generalizability theory, this study examined both the rating variability and reliability of English as a second language (ESL) students' writing in two provincial examinations in Canada. This article discusses expected and unexpected similarities and differences related to rating variability and reliability between the two testing programs.…
Descriptors: Foreign Countries, Generalizability Theory, Test Reliability, Testing Programs
Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013
In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…
Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests
Skorupski, William P.; Carvajal, Jorge – Educational and Psychological Measurement, 2010
This study is an evaluation of the psychometric issues associated with estimating objective level scores, often referred to as "subscores." The article begins by introducing the concepts of reliability and validity for subscores from statewide achievement tests. These issues are discussed with reference to popular scaling techniques, classical…
Descriptors: Testing Programs, Test Validity, Achievement Tests, Scores
Somers, Marie-Andree; Zhu, Pei; Wong, Edmond – National Center for Education Evaluation and Regional Assistance, 2011
This study examines the practical implications of using state tests to measure student achievement in impact evaluations that span multiple states and grades. In particular, the study examines the sensitivity of impact findings to (1) the type of assessment used to measured achievement (state tests or an external assessment administered by the…
Descriptors: Evaluators, Grades (Scholastic), Academic Achievement, Program Effectiveness
Ezzelle, Carol; Setzer, J. Carl – GED Testing Service, 2009
This manual was written to provide technical information regarding the 2002 Series GED (General Educational Development) Tests. Throughout this manual, documentation is provided regarding the development of the GED Tests, data collection activities, as well as reliability and validity evidence. The purpose of this manual is to provide evidence…
Descriptors: High School Equivalency Programs, Testing Programs, Test Validity, Test Reliability
Brown, Richard S.; Coughlin, Ed – Regional Educational Laboratory Mid-Atlantic, 2007
This report examines the availability and quality of predictive validity data for a selection of benchmark assessments identified by state and district personnel as in use within Mid-Atlantic Region jurisdictions. Based on a review of practices within the school districts in the region, this report details the benchmark assessments being used, in…
Descriptors: Test Content, Academic Achievement, Predictive Validity, Program Effectiveness
Koretz, Daniel; And Others – 1993
The 1992-93 school year was the second year of the implementation of the Vermont assessment program. Evaluation of the 1991-92 year yielded mixed results, with some evidence that the assessment program was having a strong impact on instruction, but other indications that the reliability of the portfolio scoring in both writing and mathematics was…
Descriptors: Educational Assessment, Elementary Secondary Education, Evaluation Methods, Evaluation Utilization

Sicoly, Fiore – Applied Measurement in Education, 2002
Calculated year-1 to year-2 stability of assessment data from 21 states and 2 Canadian provinces. The median stability coefficient was 0.78 in mathematics and reading, and lower in writing. A stability coefficient of 0.80 is recommended as the standard for large-scale assessments of student performance. (SLD)
Descriptors: Educational Testing, Elementary Secondary Education, Foreign Countries, Mathematics
Easton, John Q.; Washington, Elois D. – 1982
The effects of students taking different levels of the same standardized achievement test were assessed by administering two levels of the same test to each student. The functional level of the test was taken by all students. The second level of testing was randomly assigned at the adjacent higher or lower level of the test. Functional level…
Descriptors: Elementary Education, Pilot Projects, Reading Achievement, Scores

Reckase, Mark D. – Educational Measurement: Issues and Practice, 1995
An example application of portfolio assessment was developed and the model and estimates of reliability derived from the literature were then used to estimate the characteristics of an operational large-scale portfolio assessment program. Costs were estimated to put results in a realistic context. (SLD)
Descriptors: Cost Estimates, Educational Assessment, Educational Theories, Models
McKenna, Bernard H. – NJEA Review, 1976
Article presented a true story of how two cities ran testing programs and the lessons that can be learned from their failures. (Editor/RK)
Descriptors: Learning Processes, Scores, Standardized Tests, Student Attitudes
Kadamus, James A. – 2001
This field memo is the third in a series of updates from the Deputy Commissioner of the Office of Elementary, Middle, Secondary and Continuing Education to all teachers and administrators of public and nonpublic schools on standards and assessments in New York. This document explains how the New York State Education Department determines if test…
Descriptors: Decision Making, Educational Testing, Elementary Secondary Education, Reliability