Publication Date
In 2025 | 67 |
Since 2024 | 901 |
Since 2021 (last 5 years) | 3415 |
Since 2016 (last 10 years) | 7595 |
Since 2006 (last 20 years) | 14761 |
Descriptor
Test Reliability | 14547 |
Test Validity | 9865 |
Reliability | 9544 |
Foreign Countries | 6751 |
Test Construction | 4608 |
Validity | 4120 |
Measures (Individuals) | 3750 |
Factor Analysis | 3720 |
Psychometrics | 3393 |
Interrater Reliability | 3054 |
Correlation | 3009 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 703 |
Practitioners | 447 |
Teachers | 204 |
Administrators | 121 |
Policymakers | 62 |
Counselors | 42 |
Students | 37 |
Parents | 11 |
Community | 7 |
Media Staff | 5 |
Support Staff | 5 |
More ▼ |
Location
Turkey | 1246 |
Australia | 428 |
Canada | 371 |
China | 329 |
United States | 264 |
United Kingdom | 246 |
Taiwan | 221 |
Netherlands | 217 |
Indonesia | 214 |
California | 208 |
Spain | 201 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 8 |
Meets WWC Standards with or without Reservations | 9 |
Does not meet standards | 6 |
Kunar, Melina A.; Watson, Derrick G. – Cognitive Research: Principles and Implications, 2023
Computer-Aided Detection (CAD) has been proposed to help operators search for cancers in mammograms. Previous studies have found that although accurate CAD leads to an improvement in cancer detection, inaccurate CAD leads to an increase in both missed cancers and false alarms. This is known as the over-reliance effect. We investigated whether…
Descriptors: Assistive Technology, Computer Use, Clinical Diagnosis, Screening Tests
Bryce D. McLeod; Nicole Porter; Aaron Hogue; Emily M. Becker-Haimes; Amanda Jensen-Doss – Grantee Submission, 2023
Objective: The precise measurement of treatment fidelity (quantity and quality in the delivery of treatment strategies in an intervention) is essential for intervention development, evaluation, and implementation. Various informants are used in fidelity assessment (e.g., observers, practitioners [clinicians, teachers], clients), but these…
Descriptors: Measurement, Fidelity, Educational Research, Evidence Based Practice
Mohd Norlizam Mohd Razali; Aida Hanim A. Hamid; Bity Salwana Alias; Azlin Norhaini Mansor – Journal of Education and Learning (EduLearn), 2025
A teacher competency instrument was developed to determine the level of teacher competency in small schools in Peninsular Malaysia. This study was conducted in Perak and Negeri Sembilan to determine the instrument's reliability and validity. Exploratory factor analysis (EFA) and item reliability analysis were used to determine the questionnaire's…
Descriptors: Foreign Countries, Elementary Secondary Education, Small Schools, Rural Schools
Dankiw, Kylie A.; Baldock, Katherine L.; Kumar, Saravana; Tsiros, Margarita D. – Australasian Journal of Early Childhood, 2021
Identifying and describing children's play behaviours is an important component of evaluating child development. The Behaviour Mapping Schedule is a direct observational tool which aims to describe and quantify children's play behaviours but is yet to undergo reliability testing. This study aimed to determine the intra- and inter-rater reliability…
Descriptors: Interrater Reliability, Classification, Child Behavior, Play
Starmer, Heather M.; Arrese, Loni; Langmore, Susan; Ma, Yifei; Murray, Joseph; Patterson, Joanne; Pisegna, Jessica; Roe, Justin; Tabor-Gray, Lauren; Hutcheson, Katherine – Journal of Speech, Language, and Hearing Research, 2021
Purpose: While flexible endoscopic evaluation of swallowing (FEES) is a common clinical procedure used in the head and neck cancer (HNC) population, extant outcome measures for FEES such as bolus-level penetration-aspiration and residue scores are not well suited as global patient-level endpoint measures of dysphagia severity in cooperative group…
Descriptors: Medical Evaluation, Physical Disabilities, Safety, Efficiency
Barbara Jane Cunningham; Peter Rosenbaum; Anastasia Nepotiuk; Nancy Thomas-Stonell – Communication Disorders Quarterly, 2024
This brief report presents interrater reliability data for the Focus on the Outcomes of Communication Under Six (FOCUS-34) between parents, and between parents and speech-language pathologists (SLPs). Reliability for all three raters combined was good to excellent across three assessments. Reliability for pairs of raters was variable but generally…
Descriptors: Interrater Reliability, Outcome Measures, Preschool Children, Parents
Laura Scholes; Sarah McDonald; Garth Stahl; Barbara Comber – British Educational Research Journal, 2024
Sourcing information related to socio-scientific issues requires sophisticated literacies to read and evaluate conflicting accounts often signified by disagreement among experts, multiple solutions or misinformation. Much of the previous work exploring how young people approach conflicting information has tended to focus on students in the…
Descriptors: Middle School Students, Information Sources, Internet, Search Strategies
Joana Soares; Maria do Céu Taveira; Paulo Cardoso; Ana Daniela Silva – International Journal for Educational and Vocational Guidance, 2024
The Student Career Construction Inventory measures students' adapting behaviors. The present study validates this inventory in a sample of 314 Portuguese college students. Measurement confirmatory factorial analysis indicates better fit for the 18-items measurement model, comparing to the 25-items model. Reliability and criterion-related analyses…
Descriptors: College Students, Test Reliability, Test Validity, Vocational Interests
Fuat Ozcan; Ali Meydan – Journal of Education in Science, Environment and Health, 2024
The goal of this study is to create the Zero Waste Attitude Scale, which will be used to determine the zero-waste attitude of social studies teacher candidates and to conduct validity and reliability studies. The data for the study were collected with a 5-point Likert-type form from pre-service teachers studying in the social studies teaching…
Descriptors: Test Construction, Preservice Teachers, Social Studies, Test Validity
Merve Sapmaz Atalar; Gençer Genç; Ahsen Erim; Beyza Pehlivan; Bertug Sakin; Serpil Bulut; Neila J. Donovan – International Journal of Language & Communication Disorders, 2024
Background: Communication of people with Parkinson's disease (PwPD) is negatively affected. For PwPD with communication difficulties, it is important to use self-assessment tools as a primary assessment approach to evaluate their perspectives on communication. It is also important to evaluate PwPDs with self-assessment scales in order to determine…
Descriptors: Communication Skills, Neurological Impairments, Self Evaluation (Individuals), Test Validity
Cameron Downing; Markéta Caravolas – Reading and Writing: An Interdisciplinary Journal, 2024
Spelling and handwriting are related skills which are critical for writing but are typically assessed separately. Doing so makes it more difficult to understand their respective development. We describe the creation and evaluation of a tool for their concurrent assessment: the Spelling and Handwriting Legibility Test (SaHLT). We examined whether…
Descriptors: Spelling, Handwriting, Writing Skills, Test Construction
Ann C. Jolly; Kristen D. Beach; Heather H. Aiken; Steven J. Amendum – Journal of Educational and Psychological Consultation, 2024
The field of education relies heavily on instructional coaches to build teacher capacity in the implementation of evidence-based practices (EBPs). Although observation tools are commonly used to measure the fidelity of implementation by teachers, fewer tools are available to identify specific coaching behaviors used during in situ coaching…
Descriptors: Coaching (Performance), Observation, Research Tools, Reliability
Denise Swanson; Gerald Tindal – Behavioral Research and Teaching, 2024
This technical report provides an authoritative bibliographic resource of all the studies conducted on "easyCBM"® and published on the main website for Behavioral Research and Teaching under Publications (https://brtprojects.org). The "easyCBM"© software is a direct descendent of "Curriculum-based Measurement" (CBM)…
Descriptors: Bibliographies, Computer Software, Test Construction, Test Reliability
Amanda A. Wolkowitz; Russell Smith – Practical Assessment, Research & Evaluation, 2024
A decision consistency (DC) index is an estimate of the consistency of a classification decision on an exam. More specifically, DC estimates the percentage of examinees that would have the same classification decision on an exam if they were to retake the same or a parallel form of the exam again without memory of taking the exam the first time.…
Descriptors: Testing, Test Reliability, Replication (Evaluation), Decision Making
Terra Blevins – ProQuest LLC, 2024
While large language models (LLMs) continue to grow in scale and gain new zero-shot capabilities, their performance for languages beyond English increasingly lags behind. This gap is due to the "curse of multilinguality," where multilingual language models perform worse on individual languages than a monolingual model trained on that…
Descriptors: Multilingualism, Computational Linguistics, Second Languages, Reliability