ERIC - Search Results

Publication Date

In 2025	1
Since 2024	3
Since 2021 (last 5 years)	6
Since 2016 (last 10 years)	33
Since 2006 (last 20 years)	232

Descriptor

Scores	363
Reliability	170
Test Reliability	169
Test Validity	89
Validity	80
Measures (Individuals)	75
Psychometrics	64
Correlation	63
Foreign Countries	52
Factor Analysis	48
Error of Measurement	47
Test Items	45
Test Construction	44
Interrater Reliability	41
Comparative Analysis	39
Scoring	37
Academic Achievement	33
Evaluation Methods	29
Item Response Theory	29
Measurement Techniques	29
Construct Validity	28
Statistical Analysis	28
Factor Structure	26
Questionnaires	25
Rating Scales	24
More ▼

Publication Type

Reports - Evaluative	363
Journal Articles	282
Speeches/Meeting Papers	44
Numerical/Quantitative Data	14
Tests/Questionnaires	9
Information Analyses	7
Opinion Papers	6
Book/Product Reviews	3
Guides - Non-Classroom	3
Reports - Descriptive	2
Reports - Research	1
More ▼

Education Level

Higher Education	32
Secondary Education	22
Postsecondary Education	20
Elementary Education	17
Elementary Secondary Education	15
Grade 5	14
High Schools	14
Grade 4	12
Grade 8	11
Grade 6	10
Grade 3	9
Grade 7	7
Middle Schools	7
Early Childhood Education	6
Intermediate Grades	5
Junior High Schools	5
Kindergarten	4
Primary Education	4
Grade 1	3
Grade 10	3
Grade 11	3
Grade 2	3
Preschool Education	3
Adult Education	2
Grade 9	2
More ▼

Audience

Practitioners	1
Researchers	1
Teachers	1

Location

California	6
Australia	5
United Kingdom (England)	5
Vermont	5
Canada	4
China	4
United Kingdom	4
Netherlands	3
Texas	3
Florida	2
Minnesota	2
New Hampshire	2
Rhode Island	2
Spain	2
United Kingdom (Reading)	2
United States	2
Africa	1
Alabama	1
Alaska	1
Asia	1
Bangladesh	1
Brunei	1
Colombia	1
European Union	1
Finland	1
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	3
Race to the Top	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	2
Meets WWC Standards with or without Reservations	2

Showing 1 to 15 of 363 results Save | Export

New Tests of Rater Drift in Trend Scoring

Peer reviewed

Direct link

John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024

Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…

Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics

On the Benefits of Using Maximal Reliability in Educational and Behavioral Research

Peer reviewed

Direct link

Tenko Raykov – Educational and Psychological Measurement, 2024

This note is concerned with the benefits that can result from the use of the maximal reliability and optimal linear combination concepts in educational and psychological research. Within the widely used framework of unidimensional multi-component measuring instruments, it is demonstrated that the linear combination of their components that…

Descriptors: Educational Research, Behavioral Science Research, Reliability, Error of Measurement

Studying Score Stability with a Harmonic Regression Family: A Comparison of Three Approaches to Adjustment of Examinee-Specific Demographic Data

Peer reviewed

Direct link

Lee, Yi-Hsuan; Haberman, Shelby J. – Journal of Educational Measurement, 2021

For assessments that use different forms in different administrations, equating methods are applied to ensure comparability of scores over time. Ideally, a score scale is well maintained throughout the life of a testing program. In reality, instability of a score scale can result from a variety of causes, some are expected while others may be…

Descriptors: Scores, Regression (Statistics), Demography, Data

Lagged Dependent Variable Predictors, Classical Measurement Error, and Path Dependency: The Conditions under Which Various Estimators Are Appropriate

Peer reviewed

Direct link

Anders Holm; Anders Hjorth-Trolle; Robert Andersen – Sociological Methods & Research, 2025

Lagged dependent variables (LDVs) are often used as predictors in ordinary least squares (OLS) models in the social sciences. Although several estimators are commonly employed, little is known about their relative merits in the presence of classical measurement error and different longitudinal processes. We assess the performance of four commonly…

Descriptors: Elementary Education, Scores, Error of Measurement, Predictor Variables

Revisiting Rating Scale Development for Rater-Mediated Language Performance Assessments: Modelling Construct and Contextual Choices Made by Scale Developers

Peer reviewed

Direct link

Knoch, Ute; Deygers, Bart; Khamboonruang, Apichat – Language Testing, 2021

Rating scale development in the field of language assessment is often considered in dichotomous ways: It is assumed to be guided either by expert intuition or by drawing on performance data. Even though quite a few authors have argued that rating scale development is rarely so easily classifiable, this dyadic view has dominated language testing…

Descriptors: Rating Scales, Test Construction, Language Tests, Test Use

Thanks Coefficient Alpha, We Still Need You!

Peer reviewed

Direct link

Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2019

This note discusses the merits of coefficient alpha and their conditions in light of recent critical publications that miss out on significant research findings over the past several decades. That earlier research has demonstrated the empirical relevance and utility of coefficient alpha under certain empirical circumstances. The article highlights…

Descriptors: Test Validity, Test Reliability, Test Items, Correlation

Annual Changes in High School Average ACT Composite Scores: Celebration, Concern, or Something Else? Technical Brief

Download full text

Harmston, Matt T.; Camara, Wayne J.; Phillips, Christine K. – ACT, Inc., 2019

Average score change: How big is big? This paper discusses school-level changes in average ACT scores and highlights an interactive tool designed to facilitate score change comparisons.

Descriptors: College Entrance Examinations, High School Students, Scores, Reliability

A Design for Comparing CTT and IRT in Test Assembly, Scoring and Argumentation: Differences among Reliability, Information and Validation

Peer reviewed

Direct link

Alqarni, Abdulelah Mohammed – Journal on Educational Psychology, 2019

This study compares the psychometric properties of reliability in Classical Test Theory (CTT), item information in Item Response Theory (IRT), and validation from the perspective of modern validity theory for the purpose of bringing attention to potential issues that might exist when testing organizations use both test theories in the same testing…

Descriptors: Test Theory, Item Response Theory, Test Construction, Scoring

Deep Dive into Visual Representation and Interrater Agreement Using Data from a High-School Diving Competition

Peer reviewed

Direct link

McGee, Monnie – Journal of Statistics Education, 2019

In several sporting events, the winner is chosen on the basis of a subjective score. These sports include gymnastics, ice skating, and diving. Unlike for other subjectively judged sports, diving competitions consist of multiple rounds in quick succession on the same apparatus. These multiple rounds lead to an extra layer of complexity in the data,…

Descriptors: Data Use, Visualization, Interrater Reliability, Introductory Courses

Developing an Innovation Attitude Survey for Middle School Students

Peer reviewed
PDF on ERIC

Download full text

Christensen, Rhonda; Knezek, Gerald – Journal of Technology Education, 2022

This article describes the development and validation of an Innovation Attitude Survey (IAS) composed of 16 Likert-type items selected to measure middle school students' attitudes toward innovation and leadership in the advancement of new ideas. The goal of developing the IAS was to identify desirable dispositions that may be related to future…

Descriptors: Attitude Measures, Likert Scales, Test Construction, Test Validity

Test Review: TestDaF

Peer reviewed

Direct link

Norris, John; Drackert, Anastasia – Language Testing, 2018

The Test of German as a Foreign Language (TestDaF) plays a critical role as a standardized test of German language proficiency. Developed and administered by the Society for Academic Study Preparation and Test Development (g.a.s.t.), TestDaF was launched in 2001 and has experienced persistent annual growth, with more than 44,000 test takers in…

Descriptors: German, Second Language Learning, Language Tests, Language Proficiency

Threats to the Validity of the Collegiate Learning Assessment (CLA+) as a Measure of Critical Thinking Skills and Implications for Learning Gain

Peer reviewed

Direct link

Aloisi, Cesare; Callaghan, A. – Higher Education Pedagogies, 2018

The University of Reading Learning Gain project is a three-year longitudinal project to test and evaluate a range of available methodologies and to draw conclusions on what might be the right combination of instruments for the measurement of Learning Gain in higher education. This paper analyses the validity of a measure of critical thinking…

Descriptors: Foreign Countries, Cognitive Tests, Critical Thinking, Thinking Skills

Increasing the Consequential Validity of Reading Assessment Using Dynamic Measurement Modeling: A Comment on Dumas and McNeish (2017)

Peer reviewed

Direct link

Dumas, Denis G.; McNeish, Daniel M. – Educational Researcher, 2018

Dynamic measurement modeling (DMM) has been shown to improve the consequential validity of longitudinal mathematics assessment in the Early Childhood Longitudinal Study-Kindergarten (ECLS-K) database. Here, the authors demonstrate the capability of DMM to similarly improve the consequential validity of ECLS-K reading assessment through the…

Descriptors: Measurement Techniques, Student Evaluation, Alternative Assessment, Evaluation Methods

Evaluation of Dimensionality in the Assessment of Internal Consistency Reliability: Coefficient Alpha and Omega Coefficients

Peer reviewed

Direct link

Green, Samuel B.; Yang, Yanyun – Educational Measurement: Issues and Practice, 2015

In the lead article, Davenport, Davison, Liou, & Love demonstrate the relationship among homogeneity, internal consistency, and coefficient alpha, and also distinguish among them. These distinctions are important because too often coefficient alpha--a reliability coefficient--is interpreted as an index of homogeneity or internal consistency.…

Descriptors: Reliability, Factor Analysis, Computation, Factor Structure

Test Review of the English Public Examination at the Secondary Level in Bangladesh

Peer reviewed

Direct link

Sultana, Nasreen – Language Testing in Asia, 2018

This paper reviews the most important public English examination (matriculation exam) that students take at the end of their secondary education in Bangladesh. The examination is known as the Secondary School Certificate (SSC), which is taken at the end of Grade 10 in the mainstream education in the country. The score of SSC English examination is…

Descriptors: English (Second Language), Language Tests, Secondary School Students, Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 25

Educational and Psychological…	38
Journal of Psychoeducational…	15
Applied Psychological…	11
Applied Measurement in…	10
Language Testing	10
Journal of Educational…	8
Psychological Assessment	8
Measurement and Evaluation in…	6
Assessment	5
Online Submission	5
Phi Delta Kappan	5
Canadian Journal of School…	4
Educational Measurement:…	4
Journal of Child and Family…	4
Language Assessment Quarterly	4
National Center for Education…	4
Partnership for Assessment of…	4
Psychometrika	4
ACT, Inc.	3
Assessment for Effective…	3
Developmental Medicine &…	3
Evaluation & Research in…	3
Journal of Autism and…	3
Journal of Counseling…	3
Psychological Methods	3
More ▼

Thompson, Bruce	6
Reckase, Mark D.	5
Brennan, Robert L.	4
Lee, Guemin	4
Wainer, Howard	4
Worrell, Frank C.	4
Erford, Bradley T.	3
Petscher, Yaacov	3
Shields, Alan L.	3
Sinharay, Sandip	3
Vacha-Haase, Tammi	3
Zimmerman, Donald W.	3
Abramowitz, Jonathan S.	2
Bennett, Randy Elliot	2
Capraro, Mary Margaret	2
Capraro, Robert M.	2
Caruso, John C.	2
Cormier, Damien C.	2
Eaves, Ronald C.	2
Fan, Xitao	2
Feldt, Leonard S.	2
Floyd, Randy G.	2
Frisbie, David A.	2
Graham, James M.	2
More ▼

Wechsler Adult Intelligence…	7
Wechsler Intelligence Scale…	6
Minnesota Multiphasic…	5
ACT Assessment	4
Beck Depression Inventory	4
National Assessment of…	4
SAT (College Admission Test)	4
Woodcock Johnson Tests of…	4
Advanced Placement…	3
Test of English as a Foreign…	3
Early Childhood Longitudinal…	2
General Educational…	2
Mathematics Anxiety Rating…	2
Peabody Developmental Motor…	2
Peabody Picture Vocabulary…	2
Torrance Tests of Creative…	2
Trends in International…	2
Wechsler Memory Scale	2
Work Keys (ACT)	2
ACT Interest Inventory	1
ACTFL Oral Proficiency…	1
Armed Forces Qualification…	1
Bayley Scales of Infant…	1
Behavior Assessment System…	1
Bem Sex Role Inventory	1
More ▼