Publication Date
In 2025 | 12 |
Since 2024 | 309 |
Since 2021 (last 5 years) | 1080 |
Since 2016 (last 10 years) | 2588 |
Since 2006 (last 20 years) | 6524 |
Descriptor
Reliability | 9544 |
Validity | 3795 |
Foreign Countries | 2723 |
Measures (Individuals) | 1885 |
Correlation | 1504 |
Factor Analysis | 1438 |
Statistical Analysis | 1276 |
Questionnaires | 1076 |
Scores | 1051 |
Student Attitudes | 998 |
Psychometrics | 958 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 181 |
Practitioners | 98 |
Teachers | 61 |
Administrators | 42 |
Policymakers | 32 |
Students | 21 |
Counselors | 10 |
Media Staff | 5 |
Community | 1 |
Parents | 1 |
Location
Turkey | 441 |
Australia | 154 |
Canada | 140 |
United States | 124 |
China | 115 |
Taiwan | 103 |
Nigeria | 97 |
United Kingdom | 97 |
California | 93 |
Netherlands | 89 |
United Kingdom (England) | 85 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 3 |
Meets WWC Standards with or without Reservations | 4 |
Does not meet standards | 2 |
Terra Blevins – ProQuest LLC, 2024
While large language models (LLMs) continue to grow in scale and gain new zero-shot capabilities, their performance for languages beyond English increasingly lags behind. This gap is due to the "curse of multilinguality," where multilingual language models perform worse on individual languages than a monolingual model trained on that…
Descriptors: Multilingualism, Computational Linguistics, Second Languages, Reliability
Miriam C. Boesch; M. Alexandra Da Fonte; Melissa J. Cavagnini; Kaitlyn R. Shaw; Keren E. Deneny; Margaret F. Davis – Journal of Special Education Technology, 2024
Students with complex communication needs have increasingly been using non-dedicated communication systems, such as mobile devices, to support their communication needs. This in turn, has led to an increased used of augmentative and alternative communication apps. The main challenge currently faced is the lack of empirically validated apps and…
Descriptors: Computer Oriented Programs, Evaluation Methods, Augmentative and Alternative Communication, Communication Disorders
Zirou Lin; Hanbing Yan; Li Zhao – Journal of Computer Assisted Learning, 2024
Background: Peer assessment has played an important role in large-scale online learning, as it helps promote the effectiveness of learners' online learning. However, with the emergence of numerical grades and textual feedback generated by peers, it is necessary to detect the reliability of the large amount of peer assessment data, and then develop…
Descriptors: Peer Evaluation, Automation, Grading, Models
Niziolek, Caroline A.; Parrell, Benjamin – Journal of Speech, Language, and Hearing Research, 2021
Purpose: Speakers use auditory feedback to guide their speech output, although individuals differ in the magnitude of their compensatory response to perceived errors in feedback. Little is known about the factors that contribute to the compensatory response or how fixed or flexible they are within an individual. Here, we test whether manipulating…
Descriptors: Acoustics, Speech, Auditory Perception, Reliability
Uyumaz, Gizem; Sirganci, Gözde – International Journal of Contemporary Educational Research, 2021
In this study, the assumption of the equality of psychological distance between categories of rating scale was tested based on the number of categories and ability distributions. Category parameters were estimated by using generalized partial credit model. The data sets based on the conditions of categories counts and ability distributions were…
Descriptors: Rating Scales, Classification, Reliability, Likert Scales
Barrick, Katie; Riegelman, Amy – College & Research Libraries, 2021
In recent years, various disciplines have engaged in efforts to increase research reproducibility including the adoption of replicable search methodologies. With the development of reporting checklists and guidelines for systematic reviews such as the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) Statement, authors…
Descriptors: Search Strategies, Punctuation, Search Engines, Reliability
Siqi Huang – North American Chapter of the International Group for the Psychology of Mathematics Education, 2023
The goal of this paper is twofold. First, the paper clarifies and elaborates on an important theoretical construct called orientation with respect to understanding in mathematics, which denotes the degree to which students exhibit an inclination towards and demonstrate an earnest concern for understanding in mathematical learning. Second, the…
Descriptors: Mathematics Instruction, Teaching Methods, Problem Solving, Reliability
Raykov, Tenko; Anthony, James C.; Menold, Natalja – Educational and Psychological Measurement, 2023
The population relationship between coefficient alpha and scale reliability is studied in the widely used setting of unidimensional multicomponent measuring instruments. It is demonstrated that for any set of component loadings on the common factor, regardless of the extent of their inequality, the discrepancy between alpha and reliability can be…
Descriptors: Correlation, Evaluation Research, Reliability, Measurement Techniques
Tsangaridou, Niki; Charalambous, Charalambos Y. – Quest, 2023
Focusing on systematic observation, one of the most potent methods of studying teaching quality, represents one of the numerous contributions of Daryl Siedentop to the profession. While he had a clear focus on issues of validity and reliability concerning systematic observation, over the past decades, attention to such issues appears to have…
Descriptors: Physical Education Teachers, Observation, Validity, Reliability
Poulsen, Mads; Juul, Holger; Elbro, Carsten – Annals of Dyslexia, 2023
Different definitions and tests of dyslexia can cause unfairness and make life difficult for people with dyslexia as well as for the professionals. In 2012, the Danish government decided to support the fight against dyslexia. The government issued a public tender for the development of "a standardized, electronically administered test of…
Descriptors: Dyslexia, National Competency Tests, Foreign Countries, Test Construction
Moore, C. Missy; Crawford, Carey C.; Tertichny, Alissa – Measurement and Evaluation in Counseling and Development, 2023
We examined dimensionality and temporal stability of the Interpersonal Stress Scale-Counselor (ISS-C) scores in a sample of professional counselors (n = 518). Confirmatory factor analyses provided support for a four-factor model previously identified through exploratory factor analysis and a bifactor model. Using a randomized test-retest, temporal…
Descriptors: Counselors, Interpersonal Relationship, Stress Variables, Measures (Individuals)
Bouwer, Renske; Koster, Monica; van den Bergh, Huub – Assessment in Education: Principles, Policy & Practice, 2023
Assessing students' writing performance is essential to adequately monitor and promote individual writing development, but it is also a challenge. The present research investigates a benchmark rating procedure for assessing texts written by upper-elementary students. In two studies we examined whether a benchmark rating procedure (1) leads to…
Descriptors: Benchmarking, Writing Evaluation, Evaluation Methods, Elementary School Students
Tine S. Prøitz – Teaching in Higher Education, 2023
Drawing on the concepts of consistency, this study contributes to the discussion of study programme plans and the links between curriculum elements. The main argument is that a universal requirement of consistency is taken for granted in study programme planning, even though critics have noted a need for closer scrutiny and debate. The literature…
Descriptors: Curriculum Development, Reliability, College Curriculum, Alignment (Education)
Matthew J. Madison; Seungwon Chung; Junok Kim; Laine P. Bradshaw – Grantee Submission, 2023
Recent developments have enabled the modeling of longitudinal assessment data in a diagnostic classification model (DCM) framework. These longitudinal DCMs were developed to provide measures of student growth on a discrete scale in the form of attribute mastery transitions, thereby supporting categorical and criterion-referenced interpretations of…
Descriptors: Models, Cognitive Measurement, Diagnostic Tests, Classification
Almehrizi, Rashid S. – Educational Measurement: Issues and Practice, 2022
Coefficient alpha reliability persists as the most common reliability coefficient reported in research. The assumptions for its use are, however, not well-understood. The current paper challenges the commonly used expressions of coefficient alpha and argues that while these expressions are correct when estimating reliability for summed scores,…
Descriptors: Reliability, Scores, Scaling, Statistical Analysis