ERIC - Search Results

Publication Date

In 2025	1
Since 2024	24
Since 2021 (last 5 years)	93
Since 2016 (last 10 years)	299
Since 2006 (last 20 years)	850

Descriptor

Reliability	1051
Scores	1051
Validity	419
Correlation	277
Foreign Countries	276
Measures (Individuals)	267
Factor Analysis	209
Psychometrics	197
Statistical Analysis	164
Comparative Analysis	135
Questionnaires	114
Factor Structure	109
Error of Measurement	103
Academic Achievement	102
College Students	91
Elementary School Students	90
Evaluation Methods	90
Construct Validity	86
Gender Differences	86
Student Attitudes	69
Test Items	67
English (Second Language)	66
Item Response Theory	64
Models	63
High School Students	60
More ▼

Education Level

Higher Education	193
Postsecondary Education	131
Elementary Education	127
Secondary Education	116
Middle Schools	62
High Schools	60
Junior High Schools	44
Early Childhood Education	41
Elementary Secondary Education	31
Grade 4	26
Grade 8	25
Primary Education	25
Grade 5	23
Grade 6	21
Intermediate Grades	21
Preschool Education	19
Grade 3	18
Kindergarten	17
Grade 7	15
Grade 9	14
Grade 11	13
Grade 10	12
Grade 2	9
Grade 1	8
Grade 12	7
More ▼

Audience

Researchers	10
Teachers	5
Practitioners	3
Administrators	1
Policymakers	1
Students	1

Location

Turkey	40
United States	25
China	21
Canada	20
Australia	18
Florida	16
Pennsylvania	14
California	12
South Korea	12
Netherlands	10
Spain	10
United Kingdom (England)	10
New York	9
Taiwan	9
Greece	8
Iran	8
Portugal	7
Texas	7
United Kingdom	7
Colorado	6
Finland	6
Germany	6
Hong Kong	6
Indonesia	6
Egypt	5
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	12
Race to the Top	6
Americans with Disabilities…	1
Education for All Handicapped…	1
Education of the Handicapped…	1
Elementary and Secondary…	1
Head Start	1
Individuals with Disabilities…	1
Individuals with Disabilities…	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1

Showing 1 to 15 of 1,051 results Save | Export

New Tests of Rater Drift in Trend Scoring

Peer reviewed

Direct link

John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024

Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…

Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics

Trust the "Process"? When Fundamental Motor Skill Scores Are Reliably Unreliable

Peer reviewed

Direct link

Hulteen, Ryan M.; True, Larissa; Kroc, Edward – Measurement in Physical Education and Exercise Science, 2023

The typical process for assessing inter-rater reliability is facilitated by training raters within a research team. Lacking is an understanding if inter-rater reliability scores "between" research teams demonstrate adequate reliability. This study examined inter-rater reliability between 16 researchers who assessed fundamental motor…

Descriptors: Psychomotor Skills, Scores, Reliability, Interrater Reliability

Comparative Judgement in Education Research

Peer reviewed

Direct link

Ian Jones; Ben Davies – International Journal of Research & Method in Education, 2024

Educational researchers often need to construct precise and reliable measurement scales of complex and varied representations such as participants' written work, videoed lesson segments and policy documents. Developing such scales using can be resource-intensive and time-consuming, and the outcomes are not always reliable. Here we present…

Descriptors: Educational Research, Comparative Analysis, Educational Researchers, Measurement

Visualizing Agreement: Bland-Altman Plots as a Supplement to Inter-Rater Reliability Indices

Peer reviewed

Direct link

Brogan L. Barr; Virginia V. W. McIntosh; Eileen F. Britt; Jennifer Jordan; Janet D. Carter – Measurement: Interdisciplinary Research and Perspectives, 2024

Even when raters demonstrate agreement in the use of a measure, limited score variability or violation of often-ignored statistical assumptions can result in lower reliability estimates than intuitively expected. This article uses data drawn from two randomized controlled trials of schema therapy and cognitive behavioral therapy for the treatment…

Descriptors: Evaluators, Interrater Reliability, Reliability, Measurement Techniques

Reliable for Whom? Inferring and Reporting Reliability across Diverse Populations

Peer reviewed

Direct link

Richard S. Balkin; Quentin Hunter; Bradley T. Erford – Measurement and Evaluation in Counseling and Development, 2024

We describe best practices in reporting reliability estimates in counseling research with consideration to precision, generalization, and diverse populations. We provide a historical context to reporting reliability estimates, the limitations of past practices, and new methods to address reliability generalization. We highlight best practices…

Descriptors: Best Practices, Reliability, Counseling, Research

A Validation and Reliability Study of the Interpersonal Stress Scale-Counselor Scores

Peer reviewed

Direct link

Moore, C. Missy; Crawford, Carey C.; Tertichny, Alissa – Measurement and Evaluation in Counseling and Development, 2023

We examined dimensionality and temporal stability of the Interpersonal Stress Scale-Counselor (ISS-C) scores in a sample of professional counselors (n = 518). Confirmatory factor analyses provided support for a four-factor model previously identified through exploratory factor analysis and a bifactor model. Using a randomized test-retest, temporal…

Descriptors: Counselors, Interpersonal Relationship, Stress Variables, Measures (Individuals)

Benchmark Rating Procedure, Best of Both Worlds? Comparing Procedures to Rate Text Quality in a Reliable and Valid Manner

Peer reviewed

Direct link

Bouwer, Renske; Koster, Monica; van den Bergh, Huub – Assessment in Education: Principles, Policy & Practice, 2023

Assessing students' writing performance is essential to adequately monitor and promote individual writing development, but it is also a challenge. The present research investigates a benchmark rating procedure for assessing texts written by upper-elementary students. In two studies we examined whether a benchmark rating procedure (1) leads to…

Descriptors: Benchmarking, Writing Evaluation, Evaluation Methods, Elementary School Students

Reconceptualization of Coefficient Alpha Reliability for Test Summed and Scaled Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Educational Measurement: Issues and Practice, 2022

Coefficient alpha reliability persists as the most common reliability coefficient reported in research. The assumptions for its use are, however, not well-understood. The current paper challenges the commonly used expressions of coefficient alpha and argues that while these expressions are correct when estimating reliability for summed scores,…

Descriptors: Reliability, Scores, Scaling, Statistical Analysis

The Impact of Inconsistent Responders to Mixed-Worded Scales on Inferences in International Large-Scale Assessments

Peer reviewed

Direct link

Steinmann, Isa; Sánchez, Daniel; van Laar, Saskia; Braeken, Johan – Assessment in Education: Principles, Policy & Practice, 2022

Questionnaire scales that are mixed-worded, i.e. include both positively and negatively worded items, often suffer from issues like low reliability and more complex latent structures than intended. Part of the problem might be that some responders fail to respond consistently to the mixed-worded items. We investigated the prevalence and impact of…

Descriptors: Response Style (Tests), Test Items, Achievement Tests, Foreign Countries

Comparing Music Recordings Using Pairwise Comparative Judgement: Exploring the Judge Experience

Download full text

Lucy Chambers; Emma Walland; Jo Ireland – Research Matters, 2024

Comparative Judgement (CJ) is traditionally and primarily used to compare written texts. In this study we explored whether we could extend its use to comparing audio files. We used GCSE Music portfolios which contained a mix of audio recordings, musical scores and text documents. Fifteen judges completed two exercises: one comparing musical…

Descriptors: Evaluative Thinking, Judges, Comparative Analysis, Reliability

The Correlation between Perceptual Ratings and Nasalance Scores in Resonance Disorders: A Systematic Review

Peer reviewed

Direct link

Liu, Yilan; Lee, Sue Ann S.; Chen, Wenjun – Journal of Speech, Language, and Hearing Research, 2022

Introduction: Assessment of resonance characteristics is essential in research and clinical practice in individuals with velopharyngeal impairment. The purpose of this study was to systematically review correlations between auditory perceptual ratings and nasalance scores obtained by a nasometer in individuals with resonance disorders and to…

Descriptors: Correlation, Auditory Perception, Meta Analysis, Guidelines

On the Benefits of Using Maximal Reliability in Educational and Behavioral Research

Peer reviewed

Direct link

Tenko Raykov – Educational and Psychological Measurement, 2024

This note is concerned with the benefits that can result from the use of the maximal reliability and optimal linear combination concepts in educational and psychological research. Within the widely used framework of unidimensional multi-component measuring instruments, it is demonstrated that the linear combination of their components that…

Descriptors: Educational Research, Behavioral Science Research, Reliability, Error of Measurement

Range Restriction Affects Factor Analysis: Normality, Estimation, Fit, Loadings, and Reliability

Peer reviewed

Direct link

Franco-Martínez, Alicia; Alvarado, Jesús M.; Sorrel, Miguel A. – Educational and Psychological Measurement, 2023

A sample suffers range restriction (RR) when its variance is reduced comparing with its population variance and, in turn, it fails representing such population. If the RR occurs over the latent factor, not directly over the observed variable, the researcher deals with an indirect RR, common when using convenience samples. This work explores how…

Descriptors: Factor Analysis, Factor Structure, Scores, Sampling

Measuring Social and Emotional Learning Skills of Preschool Children in Croatia: Initial Validation of the SSIS SEL Brief Scales

Peer reviewed
PDF on ERIC

Download full text

Sanja Tatalovic Vorkapic; Christopher J. Anthony; Stephen N. Elliott; Ilaria Grazzani; Valeria Cavioni – International Journal of Emotional Education, 2024

Although there is increased interest in social and emotional competence and mental health in Croatia, there are currently limited measurement options available for early childhood settings. Thus, the SSIS SEL Brief Scales (SSIS SELb), an efficient measure of social and emotional learning competencies developed in the United States, was translated…

Descriptors: Social Emotional Learning, Preschool Children, Foreign Countries, Resilience (Psychology)

Integrating Bifactor Models into a Generalizability Theory Based Structural Equation Modeling Framework

Peer reviewed

Direct link

Vispoel, Walter P.; Lee, Hyeryung; Xu, Guanlan; Hong, Hyeri – Journal of Experimental Education, 2023

Although generalizability theory (GT) designs have traditionally been analyzed within an ANOVA framework, identical results can be obtained with structural equation models (SEMs) but extended to represent multiple sources of both systematic and measurement error variance, include estimation methods less likely to produce negative variance…

Descriptors: Generalizability Theory, Structural Equation Models, Programming Languages, Scores

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 71

Educational and Psychological…	95
ProQuest LLC	53
Journal of Psychoeducational…	28
Psychological Assessment	23
Online Submission	18
ETS Research Report Series	16
Applied Measurement in…	15
Measurement and Evaluation in…	15
Journal of Educational…	12
Advances in Health Sciences…	11
International Journal of…	11
Educational Measurement:…	10
Grantee Submission	10
Psychology in the Schools	9
Applied Psychological…	8
Assessment	8
Language Assessment Quarterly	8
Regional Educational…	8
School Psychology Quarterly	8
Assessment & Evaluation in…	7
Assessment in Education:…	7
Educational Sciences: Theory…	7
International Journal of…	7
Journal of Counseling…	7
Language Testing	7
More ▼

Thompson, Bruce	17
Henson, Robin K.	11
Haberman, Shelby J.	8
Lee, Yong-Won	7
Vacha-Haase, Tammi	7
Gill, Brian	6
Petscher, Yaacov	6
Sinharay, Sandip	6
Worrell, Frank C.	6
Kantor, Robert	5
Kolen, Michael J.	5
Lee, Guemin	5
Zimmerman, Donald W.	5
Attali, Yigal	4
Capraro, Robert M.	4
Caruso, John C.	4
Cook, Colleen	4
Fan, Xitao	4
Foorman, Barbara R.	4
Lane, Kathleen Lynne	4
Lipscomb, Stephen	4
Lowe, Patricia A.	4
Raykov, Tenko	4
Zumbo, Bruno D.	4
More ▼

Journal Articles	862
Reports - Research	740
Reports - Evaluative	170
Reports - Descriptive	63
Speeches/Meeting Papers	58
Dissertations/Theses -…	55
Tests/Questionnaires	41
Information Analyses	19
Numerical/Quantitative Data	17
Opinion Papers	10
Guides - Non-Classroom	6
Book/Product Reviews	4
Guides - General	2
Books	1
Collected Works - General	1
Collected Works - Serials	1
ERIC Digests in Full Text	1
ERIC Publications	1
Guides - Classroom - Teacher	1
More ▼

Wechsler Intelligence Scale…	12
SAT (College Admission Test)	10
Test of English as a Foreign…	9
Beck Depression Inventory	8
Child Behavior Checklist	6
Childrens Manifest Anxiety…	6
Behavior Assessment System…	5
Minnesota Multiphasic…	5
Strengths and Difficulties…	5
ACT Assessment	4
Learning and Study Strategies…	4
Peabody Picture Vocabulary…	4
Stanford Achievement Tests	4
Torrance Tests of Creative…	4
Advanced Placement…	3
Learning Style Inventory	3
Maslach Burnout Inventory	3
Mathematics Anxiety Rating…	3
Measures of Academic Progress	3
Motivated Strategies for…	3
Social Skills Improvement…	3
Test of English for…	3
Woodcock Johnson Tests of…	3
Working Alliance Inventory	3
Aberrant Behavior Checklist	2
More ▼