Publication Date
In 2025 | 12 |
Since 2024 | 309 |
Since 2021 (last 5 years) | 1080 |
Since 2016 (last 10 years) | 2588 |
Since 2006 (last 20 years) | 6524 |
Descriptor
Reliability | 9544 |
Validity | 3795 |
Foreign Countries | 2723 |
Measures (Individuals) | 1885 |
Correlation | 1504 |
Factor Analysis | 1438 |
Statistical Analysis | 1276 |
Questionnaires | 1076 |
Scores | 1051 |
Student Attitudes | 998 |
Psychometrics | 958 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 181 |
Practitioners | 98 |
Teachers | 61 |
Administrators | 42 |
Policymakers | 32 |
Students | 21 |
Counselors | 10 |
Media Staff | 5 |
Community | 1 |
Parents | 1 |
Location
Turkey | 441 |
Australia | 154 |
Canada | 140 |
United States | 124 |
China | 115 |
Taiwan | 103 |
Nigeria | 97 |
United Kingdom | 97 |
California | 93 |
Netherlands | 89 |
United Kingdom (England) | 85 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 3 |
Meets WWC Standards with or without Reservations | 4 |
Does not meet standards | 2 |
Hsin-Yun Lee; You-Lin Chen; Li-Jen Weng – Journal of Experimental Education, 2024
The second version of Kaiser's Measure of Sampling Adequacy (MSA[subscript 2]) has been widely applied to assess the factorability of data in psychological research. The MSA[subscript 2] is developed in the population and little is known about its behavior in finite samples. If estimated MSA[subscript 2]s are biased due to sampling errors,…
Descriptors: Error of Measurement, Reliability, Sampling, Statistical Bias
Matthew K. Burns; Heba Z. Abdelnaby; Jonie B. Welland; Katherine A. Graves; Kari Kurto – Assessment for Effective Intervention, 2024
The current study examined the reliability of The Reading League Curriculum-Evaluation Guidelines (CEGs), which were developed to help school-based teams rate the presence of red flags when considering adopting specific literacy curricula. Coders (n = 30) independently used the CEGs to evaluate a free online English language arts curriculum. The…
Descriptors: English Curriculum, English Instruction, Language Arts, Curriculum Evaluation
John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024
Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…
Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics
Hulteen, Ryan M.; True, Larissa; Kroc, Edward – Measurement in Physical Education and Exercise Science, 2023
The typical process for assessing inter-rater reliability is facilitated by training raters within a research team. Lacking is an understanding if inter-rater reliability scores "between" research teams demonstrate adequate reliability. This study examined inter-rater reliability between 16 researchers who assessed fundamental motor…
Descriptors: Psychomotor Skills, Scores, Reliability, Interrater Reliability
Wiebe Koopal – Studies in Philosophy and Education, 2024
In this paper I try to 'rethink' consistency as an educational quality for the 3rd millennium, following Italo Calvino's choice to take it up in his lecture series Memos for the Next Millennium, and despite the fact that the (final) lecture devoted to this quality remained unwritten. After reflecting on how consistency already plays a certain role…
Descriptors: Reliability, Education, Instruction, Lecture Method
Riana Nurhayati; Suranto Aw; Siti Irene Astuti Dwiningrum; Mami Hajaroh; Herwin Herwin – International Journal of Educational Methodology, 2024
Evaluation of child-friendly school (CFS) policies is essential to determine the achievements of school efforts in reducing violence cases. This research aims to proving the reliability and validity of CFS policy evaluation instruments in elementary schools with different locations. This investigation uses the Context Input Process Product (CIPP)…
Descriptors: Validity, Reliability, School Policy, Program Evaluation
Wladymir Külkamp; Chris Bishop; Rafael Kons; Lara Antunes; evertoncrivoi Carmo; Deborah Hizume-Kunzler; Juliano Dal Pupo – Measurement in Physical Education and Exercise Science, 2024
The aim of this study was to verify the concurrent validity and the biological error-free reliability of a novel low-cost commercial encoder (Ergonauta I). Validity protocol involved comparisons with a custom system and other encoder commercially available (Vitruve). Reliability protocols involved interdevices and interunit comparisons. No…
Descriptors: Motion, Equipment, Reliability, Equipment Utilization
Tenko Raykov; George Marcoulides; Randall Schumacker – Measurement: Interdisciplinary Research and Perspectives, 2024
An application of Bayesian factor analysis for evaluation of scale reliability is discussed, which is developed within the framework of latent variable modeling. The method permits direct point and interval estimation of the reliability coefficient of multiple-component measuring instruments using Bayesian inference. The approach allows also point…
Descriptors: Reliability, Bayesian Statistics, Measurement Techniques, Computer Software
Tenko Raykov; George Marcoulides; James Anthony; Natalja Menold – Measurement: Interdisciplinary Research and Perspectives, 2024
A Bayesian statistics-based approach is discussed that can be used for direct evaluation of the popular Cronbach's coefficient alpha as an internal consistency index for multiple-component measuring instruments, as well as for testing its identity to scale reliability. The method represents an application of confirmatory factor analysis within the…
Descriptors: Reliability, Factor Analysis, Bayesian Statistics, Measurement Techniques
Nicole Brownlie; Katie Burke; Luke van der Laan – Quality Assurance in Education: An International Perspective, 2024
Purpose: The current literature on school teacher-created summative assessment lacks a clear consensus regarding its definition and key principles. The purpose of this research was therefore to arrive at a cohesive understanding of what constitutes effective summative assessment. Design/methodology/approach: Conducting a systematic literature…
Descriptors: Summative Evaluation, Educational Principles, Progress Monitoring, Teachers
Jamie Amemiya; Gail D. Heyman; Caren M. Walker – Developmental Science, 2024
When making inferences about the mental lives of others (e.g., others' preferences), it is critical to consider the extent to which the choices we observe are constrained. Prior research on the development of this tendency indicates a contradictory pattern: Children show remarkable sensitivity to constraints in traditional experimental paradigms,…
Descriptors: Children, Barriers, Power Structure, Childrens Attitudes
Schmidt, Ellyn M.; Rothenberg, W. Andrew; Davidson, Bridget C.; Barnett, Miya; Jent, Jason; Cadenas, Heleny; Fernandez, Corina; Davis, Eileen – Journal of Behavioral Education, 2023
Measuring classroom behavior among young children is important to guide assessment and intervention decisions, yet there is limited literature on appropriate direct observation tools for this purpose. This article describes the psychometric properties of the Behavior Assessment System for Children, Student Observation System (BASC-3 SOS) with 135…
Descriptors: Young Children, Special Education, Child Behavior, Psychometrics
Toma, Radu Bogdan – Technology, Knowledge and Learning, 2023
The development of computational thinking skills is attracting attention worldwide. The use of visual or block-based coding in primary schools has gained momentum. Yet, students' acceptance of such coding environments has been neglected in the literature. This study presents a measurement instrument that will allow pursuing such an endeavor. The…
Descriptors: Computation, Thinking Skills, Coding, Measurement
Kaila L. Stipancic; Mojgan Golzy; Yunxin Zhao; Louise Pinkerton; Andrea Rohl; Mili Kuruvilla-Dugdale – Journal of Speech, Language, and Hearing Research, 2023
Purpose: Auditory training has been shown to reduce rater variability in perceptual voice assessment. Because rater variability is also a central issue in the auditory-perceptual assessment of dysarthria, this study sought to determine if training produces a meaningful change in rater reliability, criterion validity, and scaling magnitude of four…
Descriptors: Auditory Training, Auditory Perception, Program Effectiveness, Speech Impairments
Süreyya Yörük; Sedat Sen – Creativity Research Journal, 2023
The Creative Achievement Questionnaire (CAQ) is widely used to measure the creative achievement levels of individuals. Previous studies reported a varying range of reliability coefficients for the CAQ. To this date, no study has investigated the variability in the reliability coefficients of the CAQ. A random-effects reliability generalization…
Descriptors: Reliability, Generalization, Meta Analysis, Creativity