ERIC - Search Results

Publication Date

In 2025	0
Since 2024	1
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	10

Descriptor

Measurement Techniques	86
Test Reliability	41
Test Validity	36
Reliability	29
Evaluation Methods	25
Interrater Reliability	25
Research Methodology	16
Rating Scales	14
Validity	14
Higher Education	13
Psychometrics	10
Questionnaires	10
Comparative Analysis	9
Intermediate Grades	9
Research Design	9
Test Construction	9
Elementary Secondary Education	8
Measures (Individuals)	8
Error of Measurement	7
Analysis of Variance	6
Data Collection	6
Reading Research	6
Scores	6
Scoring	6
Student Attitudes	6
More ▼

Publication Type

Reports - Research	63
Speeches/Meeting Papers	37
Journal Articles	36
Information Analyses	7
Reports - Evaluative	7
Tests/Questionnaires	6
Guides - Non-Classroom	5
Opinion Papers	5
Reports - Descriptive	4
Books	3
Collected Works - General	1
Historical Materials	1
More ▼

Education Level

Early Childhood Education	2
Elementary Education	2
Grade 1	1
Grade 2	1
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Intermediate Grades	1
Preschool Education	1
Primary Education	1
More ▼

Audience

Researchers	86
Practitioners	9
Counselors	4
Students	2
Administrators	1
Media Staff	1
Policymakers	1
Teachers	1

Location

Australia	3
New Zealand	1

Laws, Policies, & Programs

Assessments and Surveys

SRA Achievement Series	2
Classroom Environment Scale	1
Computer Attitude Scale	1
Halstead Reitan…	1
Home Observation for…	1
Minnesota Multiphasic…	1
National Assessment of…	1
Parenting Stress Index	1
Wechsler Adult Intelligence…	1
Wechsler Intelligence Scale…	1
Wechsler Memory Scale	1
More ▼

What Works Clearinghouse Rating

Showing 1 to 15 of 86 results Save | Export

Visualizing Agreement: Bland-Altman Plots as a Supplement to Inter-Rater Reliability Indices

Peer reviewed

Direct link

Brogan L. Barr; Virginia V. W. McIntosh; Eileen F. Britt; Jennifer Jordan; Janet D. Carter – Measurement: Interdisciplinary Research and Perspectives, 2024

Even when raters demonstrate agreement in the use of a measure, limited score variability or violation of often-ignored statistical assumptions can result in lower reliability estimates than intuitively expected. This article uses data drawn from two randomized controlled trials of schema therapy and cognitive behavioral therapy for the treatment…

Descriptors: Evaluators, Interrater Reliability, Reliability, Measurement Techniques

Forming a Team to Ensure High-Quality Measurement in Education Studies. REL 2014-052

Download full text

Kisker, Ellen Eliason; Boller, Kimberly – Regional Educational Laboratory, 2014

This brief provides tips for forming a team of staff and consultants with the needed expertise to make key measurement decisions that will ensure high-quality data for answering the study's research questions. The brief outlines the main responsibilities of measurement team members. It also describes typical measurement tasks and discusses…

Descriptors: Teamwork, Measurement Techniques, Group Membership, Expertise

Improving Teacher Evaluation Systems: Making the Most of Multiple Measures

Direct link

Grissom, Jason A., Ed.; Youngs, Peter, Ed. – Teachers College Press, 2015

This is the first book to gather and address what we have learned about the impacts and challenges of data-intensive teacher evaluation systems--a defining characteristic of the current education policy landscape. Expert researchers and practitioners speak to what we know (and what remains to be known) about evaluation measures themselves, the…

Descriptors: Teacher Evaluation, Evaluation Methods, Evaluation Research, Test Validity

Reporting What Readers Need to Know about Education Research Measures: A Guide. REL 2014-064

Peer reviewed
PDF on ERIC

Download full text

Boller, Kimberly; Kisker, Ellen Eliason – Regional Educational Laboratory, 2014

This guide is designed to help researchers make sure that their research reports include enough information about study measures so that readers can assess the quality of the study's methods and results. The guide also provides examples of write-ups about measures and suggests resources for learning more about these topics. The guide assumes…

Descriptors: Research Reports, Research Methodology, Educational Research, Check Lists

Exploring Student Buy-In: Initial Development of an Instrument to Measure Likelihood of Children's Intervention Usage

Peer reviewed

Direct link

Briesch, Amy M.; Chafouleas, Sandra M. – Journal of Educational & Psychological Consultation, 2009

It has been suggested that both internal (e.g., acceptability) and external (e.g., feasibility) factors should be taken under consideration in order to fully understand children's usage of interventions designed to improve their behavior. The purpose of this study was to initiate development of a student self-report measure (Children's Usage…

Descriptors: Rating Scales, Intervention, Behavior Modification, Measurement Techniques

Captions, Consistency, Creativity, and the Consensual Assessment Technique: New Evidence of Reliability

Peer reviewed

Direct link

Kaufman, James C.; Lee, Joohyun; Baer, John; Lee, Soonmook – Thinking Skills and Creativity, 2007

The consensual assessment technique (CAT) is a measurement tool for creativity research in which appropriate experts evaluate creative products [Amabile, T. M. (1996). "Creativity in context: Update to the social psychology of creativity." Boulder, CO: Westview]. However, the CAT is hampered by the time-consuming nature of the products (asking…

Descriptors: Creativity, Reliability, Generalizability Theory, Measurement Techniques

The Weird World, and Equally Weird Measurement Models: Reactive Indicators and the Validity Revolution

Peer reviewed

Direct link

Hayduk, Leslie A.; Robinson, Hannah Pazderka; Cummings, Greta G.; Boadu, Kwame; Verbeek, Eric L.; Perks, Thomas A. – Structural Equation Modeling: A Multidisciplinary Journal, 2007

Researchers using structural equation modeling (SEM) aspire to learn about the world by seeking models with causal specifications that match the causal forces extant in the world. This quest for a model matching existing worldly causal forces constitutes an ontology that orients, or perhaps reorients, thinking about measurement validity. This…

Descriptors: Validity, Structural Equation Models, Reliability, Causal Models

Identifying the Onset and Offset of Stuttering Events.

Peer reviewed

Ingham, Roger J.; And Others – Journal of Speech and Hearing Research, 1995

Four experienced stuttering researchers viewed videodisks of spontaneous speech from chronic stutterers and attempted to locate the precise onset and offset of individual stuttering events. Results showed interjudge disagreements that challenge the reliability and validity of onset and offset judgments. Highly agreed stuttering events were…

Descriptors: Adults, Clinical Diagnosis, Evaluation Problems, Interrater Reliability

A Review of Coefficient Alpha and Some Basic Tenets of Classical Measurement Theory.

Download full text

Guthrie, Abbie C. – 2000

Too many researchers speak of "the reliability of the test," thus indicating their basic misunderstanding of reliability. This paper explains classical reliability and the score features that influence coefficient alpha. It explains when coefficient alpha can be negative, even though it is conceptually a variance-accounted-for statistic.…

Descriptors: Effect Size, Measurement Techniques, Reliability, Scores

Assessment Instruments for Measuring Young Children's Social-Emotional Behavioral Development

Peer reviewed

Direct link

Caselman, Tonia D.; Self, Patricia A. – Children & Schools, 2008

Early identification of social-emotional behavioral problems in infants and preschoolers is critical. Nine parent-report and caregiver/teacher-report instruments measuring preschool social-emotional behavioral problems and strengths are reviewed. Advantages to the use of parent-report and caregiver/teacher-report instruments are that they are easy…

Descriptors: Identification, Psychometrics, Evaluation Methods, Child Caregivers

Use of the Graded Response IRT Model to Assess the Reliability of Direct and Indirect Measures of Writing Assessment.

Download full text

Ackerman, Terry A. – 1986

The purpose of this paper is to compare the precision of direct and indirect measures of writing assessment using the test information functions from a graded response Item Response Theory (IRT) model. Subjects were 192 sophomore English students from a parochial high school in Wisconsin. Both direct and indirect measures of writing ability were…

Descriptors: Correlation, Essay Tests, High Schools, Interrater Reliability

Interobserver Reliability Using the Phonetic Level Evaluation with Severely and Profoundly Hearing-Impaired Children.

Peer reviewed

Shaw, Stephanie; Coggins, Truman E. – Journal of Speech and Hearing Research, 1991

This study, involving five experienced and trained speech language pathologists, categorized the elicited imitations of five profoundly and five severely prelingually hearing-impaired subjects using the Phonetic Level Evaluation. Failure to obtain acceptably high levels of reliability suggests that this measure may not yet be an accurate and…

Descriptors: Acoustic Phonetics, Articulation (Speech), Congenital Impairments, Deafness

The Measurement of Developmental Variables: An Overview.

Santmire, Toni E. – 1984

The purpose of this paper is to discuss ways in which developmental psychology suffers from the lack of an appropriate technology of measurement and statistical analysis. The paper begins by noting that developmental psychology is the study of change; that individuals develop through a succession of "stages" which are separated by…

Descriptors: Data Analysis, Data Collection, Developmental Psychology, Developmental Stages

Reviewer Standards in Division I Program Selection.

Download full text

Cason, Carolyn L.; And Others – 1986

Cason and Cason's model of performance rating was used to determine the extent to which variation in reviewer standards affected the reliability and validity of the program review process used to select papers for inclusion in the annual program. Data analyzed were the overall recommendation for acceptance and ratings on seven quality criteria…

Descriptors: Conference Papers, Data Analysis, Educational Research, Evaluation Criteria

Estimating the Reliability of Dynamic Variables Requiring Rater Judgment: A Generalizability Paradigm.

Download full text

Webber, Larry; And Others – 1986

Generalizability theory, which subsumes classical measurement theory as a special case, provides a general model for estimating the reliability of observational rating data by estimating the variance components of the measurement design. Research data from the "Heart Smart" health intervention program were analyzed as a heuristic tool.…

Descriptors: Behavior Rating Scales, Cardiovascular System, Error of Measurement, Generalizability Theory

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6

Journal of Speech and Hearing…	6
Exceptional Children	2
Journal of Autism and…	2
Regional Educational…	2
Research in Developmental…	2
American Educational Research…	1
American Psychologist	1
Australia and New Zealand…	1
Canadian Journal of…	1
Child Abuse & Neglect: The…	1
Child Development	1
Child Study Journal	1
Children & Schools	1
Comparative Education Review	1
Computers in the Schools	1
Education and Treatment of…	1
International Journal of…	1
Journal of Educational &…	1
Journal of Intellectual and…	1
Journal of the American…	1
Measurement:…	1
Mental Retardation	1
Multivariate Behavioral…	1
Public Libraries	1
Rehabilitation Counseling…	1
More ▼

Cason, Carolyn L.	3
Ingham, Roger J.	3
Boller, Kimberly	2
Cordes, Anne K.	2
Kisker, Ellen Eliason	2
Tindal, Gerald	2
Ackerman, Terry A.	1
Andrich, David	1
Aronson, David M.	1
Baer, John	1
Baggaley, Jon	1
Banta, Trudy W.	1
Bauch, Patricia A.	1
Baum, Steven K.	1
Berven, Norman L.	1
Boadu, Kwame	1
Bohannon, R. W.	1
Bornstein, Marc H.	1
Bradley, Robert H.	1
Braungart-Bloom, Diane S.	1
Briesch, Amy M.	1
Brogan L. Barr	1
Brown, Kenneth H.	1
Bullis, Michael	1
More ▼