ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	9

Descriptor

Reliability	47
Testing Programs	47
State Programs	29
Validity	26
Elementary Secondary Education	21
Test Construction	14
Academic Achievement	11
Scoring	10
Achievement Tests	8
Educational Assessment	8
Evaluation Methods	7
Mathematics	7
Psychometrics	7
Test Use	7
Decision Making	6
Performance Based Assessment	6
Statistical Analysis	6
Student Evaluation	6
Elementary School Students	5
English	5
Error of Measurement	5
Language Arts	5
Scores	5
Test Content	5
Educational Testing	4
More ▼

Source

Applied Measurement in…	3
Behavioral Research and…	2
Educational Assessment	2
American Journal of Education	1
ETS Research Report Series	1
Education Policy Analysis…	1
Educational Measurement:…	1
Educational Researcher	1
Gifted Child Quarterly	1
International Journal of…	1
Journal of Deaf Studies and…	1
Journal of Educational…	1
National Bureau of Economic…	1
Society for Research on…	1
More ▼

Publication Type

Reports - Research	19
Journal Articles	14
Reports - Descriptive	9
Reports - Evaluative	9
Speeches/Meeting Papers	9
Numerical/Quantitative Data	8
Guides - Non-Classroom	5
Opinion Papers	2
Tests/Questionnaires	2
Collected Works - Proceedings	1
Legal/Legislative/Regulatory…	1
Reference Materials -…	1
More ▼

Education Level

Elementary Secondary Education	5
Elementary Education	3
Middle Schools	2
Secondary Education	2
Early Childhood Education	1
Grade 2	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Grade 7	1
High Schools	1
Higher Education	1
Junior High Schools	1
Postsecondary Education	1
More ▼

Audience

Administrators	2
Practitioners	2
Parents	1
Teachers	1

Location

United States	3
Canada	2
Kentucky	2
New York	2
Georgia	1
Hawaii	1
Louisiana	1
Maine	1
Maryland	1
Massachusetts	1
North Carolina	1
Oregon	1
Texas	1
Vermont	1
Washington	1
More ▼

Laws, Policies, & Programs

Kentucky Education Reform Act…

Assessments and Surveys

Stanford Achievement Tests	3
Iowa Tests of Basic Skills	2
Massachusetts Comprehensive…	2
National Assessment of…	2
Texas Assessment of Academic…	2
ACT Assessment	1
Cognitive Abilities Test	1
Delaware Student Testing…	1
Early Childhood Longitudinal…	1
General Educational…	1
North Carolina End of Course…	1
SAT (College Admission Test)	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 47 results Save | Export

The Sensitivity of Teacher Value-Added Scores to the Use of Fall or Spring Test Scores

Peer reviewed

Direct link

Atteberry, Allison; Mangan, Daniel – Educational Researcher, 2020

Papay (2011) noticed that teacher value-added measures (VAMs) from a statistical model using the most common pre/post testing timeframe--current-year spring relative to previous spring (SS)--are essentially unrelated to those same teachers' VAMs when instead using next-fall relative to current-fall (FF). This is concerning since this choice--made…

Descriptors: Correlation, Value Added Models, Pretests Posttests, Decision Making

Multiple Linking in Equating and Random Scale Drift. Research Report. ETS RR-11-46

Peer reviewed
PDF on ERIC

Download full text

Guo, Hongwen; Liu, Jinghua; Dorans, Neil; Feigenbaum, Miriam – ETS Research Report Series, 2011

Maintaining score stability is crucial for an ongoing testing program that administers several tests per year over many years. One way to stall the drift of the score scale is to use an equating design with multiple links. In this study, we use the operational and experimental SAT® data collected from 44 administrations to investigate the effect…

Descriptors: Equated Scores, College Entrance Examinations, Reliability, Testing Programs

Combining Scores in Multiple-Criteria Assessment Systems: The Impact of Combination Rule

Peer reviewed

Direct link

McBee, Matthew T.; Peters, Scott J.; Waterman, Craig – Gifted Child Quarterly, 2014

Best practice in gifted and talented identification procedures involves making decisions on the basis of multiple measures. However, very little research has investigated the impact of different methods of combining multiple measures. This article examines the consequences of the conjunctive ("and"), disjunctive/complementary…

Descriptors: Best Practices, Ability Identification, Academically Gifted, Correlation

Large-Scale Academic Achievement Testing of Deaf and Hard-of-Hearing Students: Past, Present, and Future

Peer reviewed

Direct link

Qi, Sen; Mitchell, Ross E. – Journal of Deaf Studies and Deaf Education, 2012

The first large-scale, nationwide academic achievement testing program using Stanford Achievement Test (Stanford) for deaf and hard-of-hearing children in the United States started in 1969. Over the past three decades, the Stanford has served as a benchmark in the field of deaf education for assessing student academic achievement. However, the…

Descriptors: Testing Programs, Educational Testing, Deafness, Academic Achievement

Using State Tests vs. Study-Administered Tests to Measure Student Achievement: An Empirical Assessment Based on Four Recent Randomized Evaluations of Educational Interventions

Download full text

Zhu, Pei; Somers, Marie-Andree; Wong, Edmond – Society for Research on Educational Effectiveness, 2010

For this project, the authors use data from four IES-sponsored randomized studies to examine some of the key issues identified in May et. al. (2009). The first set of questions focuses on issues related to using state tests: (1) Do studies meet the assumptions needed for combining impacts on state tests across grades and/or states?; (2) How…

Descriptors: Academic Achievement, Program Effectiveness, State Standards, Testing Programs

Consistency of Standard Setting in an Augmented State Testing System

Peer reviewed

Direct link

Lissitz, Robert W.; Wei, Hua – Educational Measurement: Issues and Practice, 2008

In this article we address the issue of consistency in standard setting in the context of an augmented state testing program. Information gained from the external NRT scores is used to help make an informed decision on the determination of cut scores on the state test. The consistency of cut scores on the CRT across grades is maintained by forcing…

Descriptors: Testing Programs, State Programs, Standard Setting, Reliability

The GED. NBER Working Paper No. 16064

Direct link

Heckman, James J.; Humphries, John Eric; Mader, Nicholas S. – National Bureau of Economic Research, 2010

The General Educational Development (GED) credential is issued on the basis of an eight hour subject-based test. The test claims to establish equivalence between dropouts and traditional high school graduates, opening the door to college and positions in the labor market. In 2008 alone, almost 500,000 dropouts passed the test, amounting to 12% of…

Descriptors: Credentials, Testing Programs, Dropouts, Labor Market

Technical Adequacy of the easyCBM Grade 2 Reading Measures. Technical Report #1004

Download full text

Jamgochian, Elisa; Park, Bitnara Jasmine; Nese, Joseph F. T.; Lai, Cheng-Fei; Saez, Leilani; Anderson, Daniel; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2010

In this technical report, we provide reliability and validity evidence for the easyCBM[R] Reading measures for grade 2 (word and passage reading fluency and multiple choice reading comprehension). Evidence for reliability includes internal consistency and item invariance. Evidence for validity includes concurrent, predictive, and construct…

Descriptors: Grade 2, Reading Comprehension, Testing Programs, Reading Fluency

Technical Adequacy of the easyCBM Reading Measures (Grades 3-7), 2009-2010 Version. Technical Report #1005

Download full text

Saez, Leilani; Park, Bitnara; Nese, Joseph F. T.; Jamgochian, Elisa; Lai, Cheng-Fei; Anderson, Daniel; Kamata, Akihito; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2010

In this series of studies, we investigated the technical adequacy of three curriculum-based measures used as benchmarks and for monitoring progress in three critical reading- related skills: fluency, reading comprehension, and vocabulary. In particular, we examined the following easyCBM measurement across grades 3-7 at fall, winter, and spring…

Descriptors: Elementary School Students, Middle School Students, Vocabulary, Reading Comprehension

Reliability and Decision Consistency: An Analysis of Writing Mode at Two Times on a Statewide Test.

Peer reviewed

Hollenbeck, Keith; Tindal, Gerald; Almond, Patricia – Educational Assessment, 1999

Studied the amount of measurement error in a state's performance-based writing task as it relates to high-stakes decision reproducibility. Using 175 eighth-grade writing samples, the study finds moderate correlations between the two raters' scores, with significant differences for the rates for the handwritten, but not the typed, essays.(SLD)

Descriptors: Decision Making, Error of Measurement, Essay Tests, Grade 8

Lessons from an Evolving System. Interim Report: The Reliability of Vermont Portfolio Scores in the 1992-93 School Year. Project 3.2 State Accountability Models in Action.

Download full text

Koretz, Daniel; And Others – 1993

The 1992-93 school year was the second year of the implementation of the Vermont assessment program. Evaluation of the 1991-92 year yielded mixed results, with some evidence that the assessment program was having a strong impact on instruction, but other indications that the reliability of the portfolio scoring in both writing and mathematics was…

Descriptors: Educational Assessment, Elementary Secondary Education, Evaluation Methods, Evaluation Utilization

Defending a State Graduation Test: "GI Forum v. Texas Education Agency." Measurement Perspectives from an External Evaluator.

Peer reviewed

Mehrens, William A. – Applied Measurement in Education, 2000

Presents conclusions of an independent measurement expert that the Texas Assessment of Academic Skills (TAAS) was constructed according to acceptable professional standards and tests curricular material considered by the Texas Board of Education important for graduates to have mastered. Also supports the validity and reliability of the TAAS and…

Descriptors: Curriculum, Psychometrics, Reliability, Standards

The Consistency of DIF/DTF across Different Test Administrations: A Multidimensional Perspective.

Download full text

Flowers, Claudia P.; Oshima, T. C. – 1994

This study was patterned after a previous study by Skaggs and Lissitz (1992) in which inconsistency of differential item functioning (DIF) was reported across test administrations. They suggested multidimensionality of test data as one possible reason for inconsistency. Therefore, in this study, DIF indices which were developed recently with a…

Descriptors: Ethnic Groups, Item Bias, Mathematics, Reliability

Using Traditional Psychometric Methodologies and the Rasch Model in Designing a Test.

Download full text

Crislip, Marian A.; Chin-Chance, Selvin – 2001

This paper discusses the use of two theories of item analysis and test construction, their strengths and weaknesses, and applications to the design of the Hawaii State Test of Essential Competencies (HSTEC). Traditional analyses of the data collected from the HSTEC field test were viewed from the perspectives of item difficulty levels and item…

Descriptors: Difficulty Level, Item Response Theory, Psychometrics, Reliability

Stability of School-Level Scores from Large-Scale Student Assessment.

Peer reviewed

Sicoly, Fiore – Applied Measurement in Education, 2002

Calculated year-1 to year-2 stability of assessment data from 21 states and 2 Canadian provinces. The median stability coefficient was 0.78 in mathematics and reading, and lower in writing. A stability coefficient of 0.80 is recommended as the standard for large-scale assessments of student performance. (SLD)

Descriptors: Educational Testing, Elementary Secondary Education, Foreign Countries, Mathematics

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Tindal, Gerald	3
Alonzo, Julie	2
Anderson, Daniel	2
Jamgochian, Elisa	2
Lai, Cheng-Fei	2
Nese, Joseph F. T.	2
Saez, Leilani	2
Almond, Patricia	1
Atteberry, Allison	1
August, Diane, Ed.	1
Bene, Nancy	1
Brennan, Robert L.	1
Chelimsky, Eleanor	1
Chin-Chance, Selvin	1
Clark, John L. D.	1
Coladarci, Theodore	1
Crislip, Marian A.	1
Dahl, Theodore	1
DeMauro, Gerald E.	1
Dorans, Neil	1
Ediger, Marlow	1
Ellett, Chad D.	1
Feigenbaum, Miriam	1
Fink, C. Dennis	1
More ▼