ERIC - Search Results

Publication Date

In 2025	0
Since 2024	6
Since 2021 (last 5 years)	25
Since 2016 (last 10 years)	860
Since 2006 (last 20 years)	1810

Descriptor

Statistical Analysis	2527
Reliability	1276
Test Reliability	1071
Foreign Countries	940
Correlation	633
Test Validity	628
Factor Analysis	559
Validity	507
Questionnaires	479
Measures (Individuals)	411
Test Construction	338
Scores	329
Comparative Analysis	321
Psychometrics	293
Student Attitudes	275
College Students	274
Interrater Reliability	267
Likert Scales	267
Gender Differences	222
Evaluation Methods	193
Factor Structure	186
Item Analysis	178
Elementary School Students	175
Measurement Techniques	174
Research Methodology	171
More ▼

Education Level

Higher Education	662
Postsecondary Education	488
Secondary Education	280
Elementary Education	259
High Schools	126
Middle Schools	122
Elementary Secondary Education	88
Junior High Schools	83
Early Childhood Education	82
Grade 8	49
Primary Education	43
Grade 6	41
Preschool Education	39
Grade 5	38
Intermediate Grades	38
Grade 7	37
Grade 4	33
Grade 3	32
Kindergarten	22
Adult Education	21
Grade 1	21
Grade 2	21
Grade 10	20
Grade 9	20
Grade 11	18
More ▼

Audience

Researchers	33
Practitioners	20
Teachers	10
Students	8
Administrators	5
Counselors	2
Parents	1
Policymakers	1

Location

Turkey	204
Nigeria	57
Jordan	38
Australia	35
Iran	35
Taiwan	35
Canada	31
China	30
Germany	29
California	28
United Kingdom	25
India	22
Malaysia	22
Netherlands	22
Florida	20
Greece	17
Hong Kong	16
Pennsylvania	16
Saudi Arabia	16
Spain	16
Indonesia	15
Texas	15
Japan	14
New York	14
Sweden	14
More ▼

Laws, Policies, & Programs

No Child Left Behind Act 2001	8
Individuals with Disabilities…	7
Elementary and Secondary…	2
Individuals with Disabilities…	2
Race to the Top	2
Americans with Disabilities…	1
Debra P v Turlington	1
Reading Excellence Act	1
Rehabilitation Act 1973…	1
Safe and Drug Free Schools…	1

What Works Clearinghouse Rating

Meets WWC Standards without Reservations	1
Meets WWC Standards with or without Reservations	1
Does not meet standards	1

Statistical Analysis X

Showing 1 to 15 of 2,527 results Save | Export

New Tests of Rater Drift in Trend Scoring

Peer reviewed

Direct link

John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024

Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…

Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics

Reliability of Measuring Constructs in Applied Linguistics Research: A Comparative Study of Domestic and International Graduate Theses

Peer reviewed

Direct link

Razavipour, Kioumars; Raji, Behnaz – Language Testing in Asia, 2022

The credibility of conclusions arrived at in quantitative research depends, to a large extent, on the quality of data collection instruments used to quantify language and non-language constructs. Despite this, research into data collection instruments used in Applied Linguistics and particularly in the thesis genre remains limited. This study…

Descriptors: Applied Linguistics, Test Reliability, Language Tests, Credibility

Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021

Peer reviewed

Direct link

Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023

Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…

Descriptors: Chemistry, Periodicals, Journal Articles, Science Education

Designing Multisite Randomized Trials to Detect (Moderated) Mediation Effects

Peer reviewed

Direct link

Fangxing Bai; Ben Kelcey; Amota Ataneka; Yanli Xie; Kyle Cox; Nianbo Dong – Society for Research on Educational Effectiveness, 2024

Purpose: Multisite mediation studies are a cornerstone in mapping out developmental processes because they probe the mechanisms of a treatment while creating key opportunities to learn from and about variation in those mechanisms across sites. Despite the prevalence of multisite studies, a significant gap in the literature is how to plan such…

Descriptors: Randomized Controlled Trials, Mediation Theory, Statistical Analysis, Robustness (Statistics)

Agree to Disagree: Multiple Methods to Assess Rater Agreement during Student Teaching

Peer reviewed

Direct link

Elayne P. Colón; Lori M. Dassa; Thomas M. Dana; Nathan P. Hanson – Action in Teacher Education, 2024

To meet accreditation expectations, teacher preparation programs must demonstrate their candidates are evaluated using summative assessment tools that yield sound, reliable, and valid data. These tools are primarily used by the clinical experience team -- university supervisors and mentor teachers. Institutional beliefs regarding best practices…

Descriptors: Student Teachers, Teacher Interns, Evaluation Methods, Interrater Reliability

Using Differential Item Functioning to Test for Interrater Reliability in Constructed Response Items

Peer reviewed

Direct link

Walker, Cindy M.; Göçer Sahin, Sakine – Educational and Psychological Measurement, 2020

The purpose of this study was to investigate a new way of evaluating interrater reliability that can allow one to determine if two raters differ with respect to their rating on a polytomous rating scale or constructed response item. Specifically, differential item functioning (DIF) analyses were used to assess interrater reliability and compared…

Descriptors: Test Bias, Interrater Reliability, Responses, Correlation

On the Importance of Coefficient Alpha for Measurement Research: Loading Equality Is Not Necessary for Alpha's Utility as a Scale Reliability Index

Peer reviewed

Direct link

Raykov, Tenko; Anthony, James C.; Menold, Natalja – Educational and Psychological Measurement, 2023

The population relationship between coefficient alpha and scale reliability is studied in the widely used setting of unidimensional multicomponent measuring instruments. It is demonstrated that for any set of component loadings on the common factor, regardless of the extent of their inequality, the discrepancy between alpha and reliability can be…

Descriptors: Correlation, Evaluation Research, Reliability, Measurement Techniques

Improving Peer Assessment Validity and Reliability Through a Fuzzy Coherence Measure

Peer reviewed

Direct link

El Alaoui, Mohamed – IEEE Transactions on Learning Technologies, 2023

Classical evaluation methods, assessments, exams, and so forth accentuate the perception of one against all, professor versus learners. Including students in the assessment process, allows transforming the professor from an opponent to a critical friend, with the role of helping students to recognize both their strengths and weaknesses. However,…

Descriptors: Peer Evaluation, Educational Improvement, Test Validity, Test Reliability

Reconceptualization of Coefficient Alpha Reliability for Test Summed and Scaled Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Educational Measurement: Issues and Practice, 2022

Coefficient alpha reliability persists as the most common reliability coefficient reported in research. The assumptions for its use are, however, not well-understood. The current paper challenges the commonly used expressions of coefficient alpha and argues that while these expressions are correct when estimating reliability for summed scores,…

Descriptors: Reliability, Scores, Scaling, Statistical Analysis

Twenty Years of Network Meta-Analysis: Continuing Controversies and Recent Developments

Peer reviewed

Direct link

A. E. Ades; Nicky J. Welton; Sofia Dias; David M. Phillippo; Deborah M. Caldwell – Research Synthesis Methods, 2024

Network meta-analysis (NMA) is an extension of pairwise meta-analysis (PMA) which combines evidence from trials on multiple treatments in connected networks. NMA delivers internally consistent estimates of relative treatment efficacy, needed for rational decision making. Over its first 20 years NMA's use has grown exponentially, with applications…

Descriptors: Network Analysis, Meta Analysis, Medicine, Clinical Experience

Application of Model Averaging for Measurement in the Presence of Unknown Familiarization Phase or Fatigue Phase

Peer reviewed

Direct link

Steven Kim; Stephanie Lara-Sotelo; Eric Martin – Measurement in Physical Education and Exercise Science, 2024

A number of familiarization trials are needed for reliable measurement, particularly for inexperienced subjects. Researchers have studied and developed familiarization protocols that vary by exercise and study population. The pace of familiarization and fatigue may be an individual-level characteristic, so a population-level protocol may not fit…

Descriptors: Familiarity, Physical Education, Fatigue (Biology), Reliability

Six Solutions for More Reliable Infant Research

Peer reviewed

Direct link

Byers-Heinlein, Krista; Bergmann, Christina; Savalei, Victoria – Infant and Child Development, 2022

Infant research is often underpowered, undermining the robustness and replicability of our findings. Improving the reliability of infant studies offers a solution for increasing statistical power independent of sample size. Here, we discuss two senses of the term reliability in the context of infant research: reliable (large) effects and reliable…

Descriptors: Infants, Research, Reliability, Effect Size

Reliability Evidence for the NC Teacher Evaluation Process Using a Variety of Indicators of Inter-Rater Agreement

Peer reviewed
PDF on ERIC

Download full text

Holcomb, T. Scott; Lambert, Richard; Bottoms, Bryndle L. – Journal of Educational Supervision, 2022

In this study, various statistical indexes of agreement were calculated using empirical data from a group of evaluators (n = 45) of early childhood teachers. The group of evaluators rated ten fictitious teacher profiles using the North Carolina Teacher Evaluation Process (NCTEP) rubric. The exact and adjacent agreement percentages were calculated…

Descriptors: Interrater Reliability, Teacher Evaluation, Statistical Analysis, Early Childhood Teachers

Large-Sample Variance of Fleiss Generalized Kappa

Peer reviewed

Direct link

Gwet, Kilem L. – Educational and Psychological Measurement, 2021

Cohen's kappa coefficient was originally proposed for two raters only, and it later extended to an arbitrarily large number of raters to become what is known as Fleiss' generalized kappa. Fleiss' generalized kappa and its large-sample variance are still widely used by researchers and were implemented in several software packages, including, among…

Descriptors: Sample Size, Statistical Analysis, Interrater Reliability, Computation

A Comparison of Manual versus Automated Quantitative Production Analysis of Connected Speech

Peer reviewed

Direct link

Fromm, Davida; Katta, Saketh; Paccione, Mason; Hecht, Sophia; Greenhouse, Joel; MacWhinney, Brian; Schnur, Tatiana T. – Journal of Speech, Language, and Hearing Research, 2021

Purpose: Analysis of connected speech in the field of adult neurogenic communication disorders is essential for research and clinical purposes, yet time and expertise are often cited as limiting factors. The purpose of this project was to create and evaluate an automated program to score and compute the measures from the Quantitative Production…

Descriptors: Speech, Automation, Statistical Analysis, Adults

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 169

ProQuest LLC	92
Educational and Psychological…	88
Journal of Education and…	52
Online Submission	49
Educational Research and…	43
International Education…	32
Research on Social Work…	32
Journal of Psychoeducational…	31
Journal of Education and…	29
English Language Teaching	26
Journal of Educational…	26
Measurement in Physical…	26
Universal Journal of…	24
ETS Research Report Series	22
Educational Sciences: Theory…	21
Applied Psychological…	19
Measurement and Evaluation in…	19
Eurasian Journal of…	18
Grantee Submission	18
CBE - Life Sciences Education	16
EURASIA Journal of…	16
Psychometrika	15
Higher Education Studies	13
Research Quarterly for…	13
Assessment & Evaluation in…	12
More ▼

Alonzo, Julie	12
Price, Gary G.	12
Tindal, Gerald	10
Lai, Cheng-Fei	9
Brennan, Robert L.	8
Raykov, Tenko	8
Feldt, Leonard S.	7
Livingston, Samuel A.	7
Park, Bitnara Jasmine	7
Irvin, P. Shawn	6
Anderson, Daniel	5
Gill, Brian	5
Lembke, Erica S.	5
Lord, Frederic M.	5
Marcoulides, George A.	5
Menold, Natalja	5
Subkoviak, Michael J.	5
Zimmerman, Donald W.	5
Abell, Neil	4
Brown, James Dean	4
Forsyth, Robert A.	4
Hambleton, Ronald K.	4
Harris, Chester W.	4
Huynh, Huynh	4
More ▼

Reports - Research	1872
Journal Articles	1818
Reports - Evaluative	187
Tests/Questionnaires	187
Speeches/Meeting Papers	107
Dissertations/Theses -…	93
Reports - Descriptive	74
Information Analyses	62
Numerical/Quantitative Data	34
Opinion Papers	23
Guides - Non-Classroom	18
Books	12
Guides - Classroom - Learner	6
Guides - General	6
Collected Works - General	4
Collected Works - Proceedings	3
Reference Materials -…	3
Reports - General	3
Book/Product Reviews	2
Collected Works - Serials	2
Guides - Classroom - Teacher	2
Non-Print Media	2
Collected Works - Serial	1
Dissertations/Theses	1
Dissertations/Theses -…	1
More ▼

Test of English as a Foreign…	14
Wechsler Intelligence Scale…	14
Strengths and Difficulties…	12
ACT Assessment	11
Woodcock Johnson Tests of…	11
Stanford Achievement Tests	10
Motivated Strategies for…	9
SAT (College Admission Test)	9
Autism Diagnostic Observation…	8
Iowa Tests of Basic Skills	7
Peabody Picture Vocabulary…	7
Program for International…	7
Torrance Tests of Creative…	7
Marlowe Crowne Social…	6
Maslach Burnout Inventory	6
Trends in International…	6
Beck Depression Inventory	5
Dynamic Indicators of Basic…	5
Armed Services Vocational…	4
Behavior Assessment System…	4
California Achievement Tests	4
Child Behavior Checklist	4
Childrens Manifest Anxiety…	4
Early Childhood Longitudinal…	4
Flesch Kincaid Grade Level…	4
More ▼