ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	9

Descriptor

Scores	32
Testing Programs	32
Test Reliability	24
State Programs	16
Test Validity	14
Elementary Secondary Education	11
Achievement Tests	10
Educational Assessment	9
Academic Achievement	7
Scoring	7
Test Construction	7
Evaluation Methods	6
Performance Based Assessment	6
Standardized Tests	6
Comparative Analysis	5
Mathematics	5
Portfolios (Background…	5
Reliability	5
State Standards	5
Test Results	5
Test Use	5
Educational Testing	4
Foreign Countries	4
Interrater Reliability	4
Item Analysis	4
More ▼

Source

Applied Measurement in…	3
ETS Research Report Series	1
Education Policy Analysis…	1
Educational Measurement:…	1
Educational and Psychological…	1
GED Testing Service	1
Journal of Personnel…	1
NJEA Review	1
National Center for Education…	1
New York State Education…	1
Online Submission	1
Regional Educational…	1
Review of Research in…	1
Society for Research on…	1
TESOL Journal	1
More ▼

Publication Type

Reports - Research	14
Reports - Evaluative	11
Journal Articles	10
Speeches/Meeting Papers	5
Numerical/Quantitative Data	3
Reports - Descriptive	3
Tests/Questionnaires	2
Guides - General	1
Guides - Non-Classroom	1
Information Analyses	1

Education Level

Elementary Secondary Education	3
Grade 4	2
Grade 6	2
Grade 8	2
High Schools	2
Secondary Education	2
Early Childhood Education	1
Elementary Education	1
Grade 12	1
Grade 3	1
Grade 5	1
Grade 7	1
High School Equivalency…	1
Intermediate Grades	1
Junior High Schools	1
Middle Schools	1
Primary Education	1
More ▼

Audience

Administrators	1
Practitioners	1
Researchers	1
Teachers	1

Location

Vermont	4
Canada	3
Texas	2
Alaska	1
California	1
New York	1
New York (Albany)	1
New York (Buffalo)	1
New York (New York)	1
New York (Rochester)	1
New York (Syracuse)	1
United States	1
More ▼

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
SAT (College Admission Test)	2
California Achievement Tests	1
Comprehensive Tests of Basic…	1
General Educational…	1
Iowa Tests of Basic Skills	1
Metropolitan Achievement Tests	1
North Carolina End of Course…	1
SRA Achievement Series	1
Sequential Tests of…	1
Texas Assessment of Academic…	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 32 results Save | Export

New York State Alternate Assessment Technical Report, 2014-15

Download full text

New York State Education Department, 2015

This technical report provides an overview of the New York State Alternate Assessment (NYSAA), including a description of the purpose of the NYSAA, the processes utilized to develop and implement the NYSAA program, and Stakeholder involvement in those processes. By comparing the intent of the NYSAA with its process and design, the validity of the…

Descriptors: Alternative Assessment, Grade 3, Grade 4, Grade 5

Adaptations and Access to Assessment of Common Core Content

Peer reviewed

Direct link

Kettler, Ryan J. – Review of Research in Education, 2015

This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…

Descriptors: Meta Analysis, Literature Reviews, Testing, Testing Accommodations

Using State Tests vs. Study-Administered Tests to Measure Student Achievement: An Empirical Assessment Based on Four Recent Randomized Evaluations of Educational Interventions

Download full text

Zhu, Pei; Somers, Marie-Andree; Wong, Edmond – Society for Research on Educational Effectiveness, 2010

For this project, the authors use data from four IES-sponsored randomized studies to examine some of the key issues identified in May et. al. (2009). The first set of questions focuses on issues related to using state tests: (1) Do studies meet the assumptions needed for combining impacts on state tests across grades and/or states?; (2) How…

Descriptors: Academic Achievement, Program Effectiveness, State Standards, Testing Programs

Generalizability Theory as Evidence of Concerns about Fairness in Large-Scale ESL Writing Assessments

Peer reviewed

Direct link

Huang, Jinyan – TESOL Journal, 2011

Using generalizability theory, this study examined both the rating variability and reliability of English as a second language (ESL) students' writing in two provincial examinations in Canada. This article discusses expected and unexpected similarities and differences related to rating variability and reliability between the two testing programs.…

Descriptors: Foreign Countries, Generalizability Theory, Test Reliability, Testing Programs

Investigating the Suitability of Implementing the "e-rater"® Scoring Engine in a Large-Scale English Language Testing Program. Research Report. ETS RR-13-36

Peer reviewed
PDF on ERIC

Download full text

Zhang, Mo; Breyer, F. Jay; Lorenz, Florian – ETS Research Report Series, 2013

In this research, we investigated the suitability of implementing "e-rater"® automated essay scoring in a high-stakes large-scale English language testing program. We examined the effectiveness of generic scoring and 2 variants of prompt-based scoring approaches. Effectiveness was evaluated on a number of dimensions, including agreement…

Descriptors: Computer Assisted Testing, Computer Software, Scoring, Language Tests

A Comparison of Approaches for Improving the Reliability of Objective Level Scores

Peer reviewed

Direct link

Skorupski, William P.; Carvajal, Jorge – Educational and Psychological Measurement, 2010

This study is an evaluation of the psychometric issues associated with estimating objective level scores, often referred to as "subscores." The article begins by introducing the concepts of reliability and validity for subscores from statewide achievement tests. These issues are discussed with reference to popular scaling techniques, classical…

Descriptors: Testing Programs, Test Validity, Achievement Tests, Scores

Whether and How to Use State Tests to Measure Student Achievement in a Multi-State Randomized Experiment: An Empirical Assessment Based on Four Recent Evaluations. NCEE 2012-4015

Peer reviewed
PDF on ERIC

Download full text

Somers, Marie-Andree; Zhu, Pei; Wong, Edmond – National Center for Education Evaluation and Regional Assistance, 2011

This study examines the practical implications of using state tests to measure student achievement in impact evaluations that span multiple states and grades. In particular, the study examines the sensitivity of impact findings to (1) the type of assessment used to measured achievement (state tests or an external assessment administered by the…

Descriptors: Evaluators, Grades (Scholastic), Academic Achievement, Program Effectiveness

Technical Manual: 2002 Series GED Tests

Download full text

Ezzelle, Carol; Setzer, J. Carl – GED Testing Service, 2009

This manual was written to provide technical information regarding the 2002 Series GED (General Educational Development) Tests. Throughout this manual, documentation is provided regarding the development of the GED Tests, data collection activities, as well as reliability and validity evidence. The purpose of this manual is to provide evidence…

Descriptors: High School Equivalency Programs, Testing Programs, Test Validity, Test Reliability

The Predictive Validity of Selected Benchmark Assessments Used in the Mid-Atlantic Region. Issues & Answers. REL 2007-No. 017

Peer reviewed
PDF on ERIC

Download full text

Brown, Richard S.; Coughlin, Ed – Regional Educational Laboratory Mid-Atlantic, 2007

This report examines the availability and quality of predictive validity data for a selection of benchmark assessments identified by state and district personnel as in use within Mid-Atlantic Region jurisdictions. Based on a review of practices within the school districts in the region, this report details the benchmark assessments being used, in…

Descriptors: Test Content, Academic Achievement, Predictive Validity, Program Effectiveness

Lessons from an Evolving System. Interim Report: The Reliability of Vermont Portfolio Scores in the 1992-93 School Year. Project 3.2 State Accountability Models in Action.

Download full text

Koretz, Daniel; And Others – 1993

The 1992-93 school year was the second year of the implementation of the Vermont assessment program. Evaluation of the 1991-92 year yielded mixed results, with some evidence that the assessment program was having a strong impact on instruction, but other indications that the reliability of the portfolio scoring in both writing and mathematics was…

Descriptors: Educational Assessment, Elementary Secondary Education, Evaluation Methods, Evaluation Utilization

Stability of School-Level Scores from Large-Scale Student Assessment.

Peer reviewed

Sicoly, Fiore – Applied Measurement in Education, 2002

Calculated year-1 to year-2 stability of assessment data from 21 states and 2 Canadian provinces. The median stability coefficient was 0.78 in mathematics and reading, and lower in writing. A stability coefficient of 0.80 is recommended as the standard for large-scale assessments of student performance. (SLD)

Descriptors: Educational Testing, Elementary Secondary Education, Foreign Countries, Mathematics

The Effects of Functional Level Testing on Five New Standardized Reading Achievement Tests.

Download full text

Easton, John Q.; Washington, Elois D. – 1982

The effects of students taking different levels of the same standardized achievement test were assessed by administering two levels of the same test to each student. The functional level of the test was taken by all students. The second level of testing was randomly assigned at the adjacent higher or lower level of the test. Functional level…

Descriptors: Elementary Education, Pilot Projects, Reading Achievement, Scores

Portfolio Assessment: A Theoretical Estimate of Score Reliability.

Peer reviewed

Reckase, Mark D. – Educational Measurement: Issues and Practice, 1995

An example application of portfolio assessment was developed and the model and estimates of reliability derived from the literature were then used to estimate the characteristics of an operational large-scale portfolio assessment program. Costs were estimated to put results in a realistic context. (SLD)

Descriptors: Cost Estimates, Educational Assessment, Educational Theories, Models

A Tale of Testing in Two Cities

McKenna, Bernard H. – NJEA Review, 1976

Article presented a true story of how two cities ran testing programs and the lessons that can be learned from their failures. (Editor/RK)

Descriptors: Learning Processes, Scores, Standardized Tests, Student Attitudes

Standards to Assessments: Looking at the Whole Picture. Memorandum.

Download full text

Kadamus, James A. – 2001

This field memo is the third in a series of updates from the Deputy Commissioner of the Office of Elementary, Middle, Secondary and Continuing Education to all teachers and administrators of public and nonpublic schools on standards and assessments in New York. This document explains how the New York State Education Department determines if test…

Descriptors: Decision Making, Educational Testing, Elementary Secondary Education, Reliability

Previous Page | Next Page »

Pages: 1 | 2 | 3

Koretz, Daniel	3
Somers, Marie-Andree	2
Wong, Edmond	2
Zhu, Pei	2
Allen, Nancy L.	1
Anderson, Lorin W.	1
Bennett, Randy Elliot	1
Breyer, F. Jay	1
Brown, Richard S.	1
Carvajal, Jorge	1
Coughlin, Ed	1
Easton, John Q.	1
Ebel, Robert L.	1
Ezzelle, Carol	1
Friedman, Greg	1
Haney, Walt	1
Holland, Paul W.	1
Huang, Jinyan	1
Isham, Steven P.	1
Kadamus, James A.	1
Kettler, Ryan J.	1
Klein, Stephen P.	1
Lorenz, Florian	1
Mandeville, Garrett K.	1
More ▼