ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	5
Since 2016 (last 10 years)	28
Since 2006 (last 20 years)	92

Descriptor

Evaluation Methods	136
Scores	136
Test Validity	136
Test Reliability	62
Student Evaluation	38
Foreign Countries	25
Test Construction	23
Standardized Tests	20
Correlation	19
Measurement Techniques	19
Psychometrics	18
Test Items	17
Factor Analysis	15
Achievement Tests	14
Comparative Analysis	14
Educational Assessment	14
Elementary Secondary Education	14
Statistical Analysis	13
Elementary School Students	12
Rating Scales	12
Test Interpretation	12
State Standards	11
Children	10
Test Bias	10
Construct Validity	9
More ▼

Publication Type

Journal Articles	99
Reports - Research	70
Reports - Evaluative	37
Reports - Descriptive	14
Speeches/Meeting Papers	9
Tests/Questionnaires	8
Opinion Papers	6
Dissertations/Theses -…	3
Information Analyses	3
Guides - Non-Classroom	2
Reports - General	2
Books	1
Guides - Classroom - Teacher	1
Guides - General	1
Non-Print Media	1
Numerical/Quantitative Data	1
Reference Materials - General	1
More ▼

Education Level

Elementary Education	25
Higher Education	19
Postsecondary Education	15
Secondary Education	14
Elementary Secondary Education	8
High Schools	7
Middle Schools	6
Early Childhood Education	5
Junior High Schools	5
Grade 6	4
Grade 4	3
Intermediate Grades	3
Primary Education	3
Adult Basic Education	2
Adult Education	2
Grade 2	2
Grade 3	2
Grade 5	2
Grade 1	1
Grade 7	1
Kindergarten	1
Preschool Education	1
More ▼

Audience

Practitioners	4
Teachers	3
Researchers	2
Administrators	1
Community	1
Parents	1
Policymakers	1

Location

Illinois	3
Massachusetts	3
United Kingdom	3
United States	3
Florida	2
Germany	2
Michigan	2
Minnesota	2
North Carolina	2
Texas	2
Washington	2
Arizona	1
Australia	1
California	1
California (Los Angeles)	1
California (San Diego)	1
China	1
Colombia	1
Colorado	1
Denmark	1
Japan	1
Jordan	1
Kenya	1
Maine	1
Maryland	1
More ▼

Laws, Policies, & Programs

Elementary and Secondary…	2
Comprehensive Education…	1
Elementary and Secondary…	1
Every Student Succeeds Act…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 136 results Save | Export

Building Validity Evidence for the Use of Aggregate Scores in Accountability

Direct link

Karen Blackburn Hoeve – ProQuest LLC, 2021

High stakes test-based accountability systems primarily rely on aggregates and derivatives of scores from tests that were originally developed to measure individual student mastery of content specifications. Current validity models do not explicitly address this use of aggregate scores to measure the performance of teachers, administrators, and…

Descriptors: Accountability, Test Validity, High Stakes Tests, Hierarchical Linear Modeling

Disrupted Data: Using Longitudinal Assessment Systems to Monitor Test Score Quality

Peer reviewed

Direct link

An, Lily Shiao; Ho, Andrew Dean; Davis, Laurie Laughlin – Educational Measurement: Issues and Practice, 2022

Technical documentation for educational tests focuses primarily on properties of individual scores at single points in time. Reliability, standard errors of measurement, item parameter estimates, fit statistics, and linking constants are standard technical features that external stakeholders use to evaluate items and individual scale scores.…

Descriptors: Documentation, Scores, Evaluation Methods, Longitudinal Studies

Developing a Game-Based Test to Assess Middle School Sixth-Grade Students' Algorithmic Thinking Skills

Peer reviewed
PDF on ERIC

Download full text

Emre Zengin; Yasemin Karal – International Journal of Assessment Tools in Education, 2024

This study was carried out to develop a test to assess algorithmic thinking skills. To this end, the twelve steps suggested by Downing (2006) were adopted. Throughout the test development, 24 middle school sixth-grade students and eight experts in different areas took part as needed in the tasks on the project. The test was given to 252 students…

Descriptors: Grade 6, Algorithms, Thinking Skills, Evaluation Methods

Impact of Superscoring on Subgroup Differences. Issue Brief

Download full text

Mattern, Krista; Radunzel, Justine – ACT, Inc., 2019

When applicants take the ACT® more than once, how do colleges and universities reconcile and make sense of the multiple scores? In terms of validity, fairness, and impact on subgroup differences, are certain score-use polices better than others? The focus of this issue brief is to summarize evidence on the validity and fairness of various…

Descriptors: Scoring, College Entrance Examinations, Test Validity, Evaluation Methods

A Novel Means-End Problem-Solving Assessment Tool for Early Intervention: Evaluation of Validity, Reliability, and Sensitivity

Peer reviewed
PDF on ERIC

Download full text

Direct link

Baraldi Cunha, Andrea; Babik, Iryna; Koziol, Natalie A.; Hsu, Lin-Ya; Nord, Jayden; Harbourne, Regina T.; Westcott-McCoy, Sarah; Dusing, Stacey C.; Bovaird, James A.; Lobo, Michele A. – Grantee Submission, 2021

Purpose: To evaluate the validity, reliability, and sensitivity of the novel Means-End Problem-Solving Assessment Tool (MEPSAT). Methods: Children with typical development and those with motor delay were assessed throughout the first 2 years of life using the MEPSAT. MEPSAT scores were validated against the cognitive and motor subscales of the…

Descriptors: Problem Solving, Early Intervention, Evaluation Methods, Motor Development

Adapting Paper-Based Tests for Computer Administration: Lessons Learned from 30 Years of Mode Effects Studies in Education

Peer reviewed
PDF on ERIC

Download full text

Lynch, Sarah – Practical Assessment, Research & Evaluation, 2022

In today's digital age, tests are increasingly being delivered on computers. Many of these computer-based tests (CBTs) have been adapted from paper-based tests (PBTs). However, this change in mode of test administration has the potential to introduce construct-irrelevant variance, affecting the validity of score interpretations. Because of this,…

Descriptors: Computer Assisted Testing, Tests, Scores, Scoring

Interaction, Change, and the Role of the Historical in Validation: The Case of L2 Dynamic Assessment

Peer reviewed

Direct link

Poehner, Matthew E.; van Compernolle, Rémi A. – Journal of Cognitive Education and Psychology, 2018

This article examines the implications of argument-based validity for the continued development of dynamic assessment (DA) research and practice. We propose that the move toward validation as a process of interpretation and evidence-based argument is commensurable with DA but that fundamental ontological differences with conventional approaches to…

Descriptors: Alternative Assessment, Evaluation Methods, Second Language Learning, Interaction

Validation Methods for Aggregate-Level Test Scale Linking: A Case Study Mapping School District Test Score Distributions to a Common Scale. CEPA Working Paper No. 16-09

Download full text

Reardon, Sean F.; Ho, Andrew D.; Kalogrides, Demetra – Stanford Center for Education Policy Analysis, 2019

Linking score scales across different tests is considered speculative and fraught, even at the aggregate level (Feuer et al., 1999; Thissen, 2007). We introduce and illustrate validation methods for aggregate linkages, using the challenge of linking U.S. school district average test scores across states as a motivating example. We show that…

Descriptors: Test Validity, Evaluation Methods, School Districts, Scores

Using Computer Adaptive Testing to Assess Physics Proficiency and Improve Exam Performance in an Introductory Physics Course

Peer reviewed

Direct link

Morphew, Jason W.; Mestre, Jose P.; Kang, Hyeon-Ah; Chang, Hua-Hua; Fabry, Gregory – Physical Review Physics Education Research, 2018

Prior research has established that students often underprepare for midterm examinations yet remain overconfident in their proficiency. Research concerning the testing effect has demonstrated that utilizing testing as a study strategy leads to higher performance and more accurate confidence compared to more common study strategies such as…

Descriptors: Computer Assisted Testing, Physics, Science Instruction, Introductory Courses

Validation of a Revised Observation-Based Assessment Tool for Children Birth through Kindergarten: The COR Advantage

Peer reviewed

Direct link

Wakabayashi, Tomoko; Claxton, Jill; Smith, Everett V., Jr. – Journal of Psychoeducational Assessment, 2019

The Child Observation Record (COR), initially developed in 1993 by HighScope Educational Research Foundation, is an observation-based instrument that provides systematic assessment of young children's knowledge and abilities in all major areas of development. Teachers or caregivers spend a few minutes each day writing brief notes or…

Descriptors: Observation, Evaluation Methods, Early Childhood Education, Kindergarten

Rapid Eye-Tracking Evaluation of Language in Children and Adolescents Referred for Assessment of Neurodevelopmental Disorders

Peer reviewed

Direct link

Frazier, Thomas W.; Hauschild, Kathryn M.; Klingemier, Eric; Strauss, Mark S.; Hardan, Antonio Y.; Youngstrom, Eric A. – Journal of Intellectual & Developmental Disability, 2020

Background: Language assessment is a key element of evaluations of children and adolescents with neurodevelopmental disorders (NDDs). The present study examined the validity of a gaze-based receptive language index (RLI) in predicting language test results.Method: Participants included toddlers, pre-school, and school age children and adolescents…

Descriptors: Children, Adolescents, Neurological Impairments, Evaluation Methods

Linking and Comparing Short and Full-Length Concept Inventories of Electricity and Magnetism Using Item Response Theory

Peer reviewed

Direct link

Xiao, Yang; Fritchman, Joseph C.; Bao, Jacqueline Y.; Nie, Ying; Han, Jing; Xiong, Jianwen; Xiao, Hua; Bao, Lei – Physical Review Physics Education Research, 2019

In physics education research (PER), concept inventories (CIs) have become standard instruments for assessing students' learning throughout instruction. To promote widespread use of concept inventories, previous studies have developed an approach to split a full length CI into short versions of CIs. This research extends the existing method to…

Descriptors: Physics, Science Instruction, Energy, Magnets

Increasing the Consequential Validity of Reading Assessment Using Dynamic Measurement Modeling: A Comment on Dumas and McNeish (2017)

Peer reviewed

Direct link

Dumas, Denis G.; McNeish, Daniel M. – Educational Researcher, 2018

Dynamic measurement modeling (DMM) has been shown to improve the consequential validity of longitudinal mathematics assessment in the Early Childhood Longitudinal Study-Kindergarten (ECLS-K) database. Here, the authors demonstrate the capability of DMM to similarly improve the consequential validity of ECLS-K reading assessment through the…

Descriptors: Measurement Techniques, Student Evaluation, Alternative Assessment, Evaluation Methods

English Learner Trajectories and Reclassification. Technical Appendices

Direct link

Betts, Julian; Hill, Laura; Bachofer, Karen; Hayes, Joseph; Lee, Andrew; Zau, Andrew – Public Policy Institute of California, 2019

This document includes two technical appendices that accompany the main report, "English Learner Trajectories and Reclassification." The two appendices include: (1) Methodology; and (2) Supporting Tables and Figures. [For the full report, see ED603764.]

Descriptors: English Language Learners, Classification, School Districts, Outcomes of Education

Revealing Hidden Talents: The Development, Use, and Benefit of VESPARCH

Peer reviewed

Direct link

Badger, Julia R.; Mellanby, Jane – British Journal of Educational Psychology, 2018

Background: School attainment tests and Cognitive Abilities Tests are used in the United Kingdom to set targets for educational outcome. Whilst these are good predictors, they depend not only on basic ability but also on learnt knowledge and skills, such as reading. Method and Aims: VESPARCH is an online group test of verbal and spatial reasoning,…

Descriptors: Foreign Countries, Intelligence Tests, Verbal Ability, Spatial Ability

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Educational Measurement:…	4
Journal of Educational…	4
Journal of Psychoeducational…	4
Assessment for Effective…	3
ETS Research Report Series	3
Educational Researcher	3
Educational and Psychological…	3
Journal of Autism and…	3
Practical Assessment,…	3
ProQuest LLC	3
Psychology in the Schools	3
Elementary School Journal	2
Grantee Submission	2
Journal of Chemical Education	2
Measurement:…	2
Phi Delta Kappan	2
Physical Review Physics…	2
ACT, Inc.	1
Advances in Health Sciences…	1
American Journal on…	1
Applied Measurement in…	1
Applied Psychological…	1
Assessment	1
Assessment & Evaluation in…	1
B.C. Journal of Special…	1
More ▼

Erford, Bradley T.	2
Frazier, Thomas W.	2
Kane, Michael T.	2
McIntyre, Nancy	2
Mundy, Peter	2
Novotny, Stephanie	2
Oswald, Tasha	2
Ryser, Gail R.	2
Swain-Lerro, Lindsey	2
Youngstrom, Eric A.	2
Zajic, Matt	2
Abu-Hamour, Bashir	1
Algozzine, Bob	1
Algozzine, Kate	1
Allen, Abigail	1
Amrein-Beardsley, Audrey	1
An, Lily Shiao	1
Anderson, Daniel	1
Anderson, Ronald E.	1
Arjoon, Janelle A.	1
Arter, Judith A.	1
August, Diane	1
Awomolo, Ademola	1
Babik, Iryna	1
More ▼

Autism Diagnostic Observation…	4
National Assessment of…	4
ACT Assessment	3
Bayley Scales of Infant…	2
Teacher Rating Scale	2
Woodcock Johnson Tests of…	2
Beck Anxiety Inventory	1
Beck Depression Inventory	1
Behavioral and Emotional…	1
Child Behavior Checklist	1
Childhood Autism Rating Scale	1
Clinical Evaluation of…	1
College Level Examination…	1
Collegiate Assessment of…	1
Early Childhood Longitudinal…	1
Graduate Management Admission…	1
Maslach Burnout Inventory	1
Massachusetts Comprehensive…	1
Measures of Academic Progress	1
Mullen Scales of Early…	1
National Assessment of Adult…	1
National Teacher Examinations	1
Peabody Individual…	1
Pennsylvania Educational…	1
Preschool Language Scale	1
More ▼