Publication Date
In 2025 | 51 |
Since 2024 | 535 |
Descriptor
Test Reliability | 535 |
Test Validity | 384 |
Foreign Countries | 270 |
Test Construction | 192 |
Psychometrics | 137 |
Factor Analysis | 124 |
Measures (Individuals) | 118 |
Student Attitudes | 55 |
College Students | 51 |
Questionnaires | 51 |
Evaluation Methods | 50 |
More ▼ |
Source
Author
D. Betsy McCoach | 3 |
Achmad Samsudin | 2 |
Adam Hadiana Aminudin | 2 |
Aisha K. Yousafzai | 2 |
Ali Roohani | 2 |
Amy M. Briesch | 2 |
Ana Daniela Silva | 2 |
Anthony J. Gambino | 2 |
Bowen Xiao | 2 |
Bradley T. Erford | 2 |
Daniel Long | 2 |
More ▼ |
Publication Type
Education Level
Location
Turkey | 52 |
China | 27 |
Indonesia | 22 |
Spain | 14 |
Canada | 10 |
Italy | 9 |
Germany | 8 |
India | 8 |
Sweden | 8 |
United Kingdom | 8 |
Iran | 7 |
More ▼ |
Laws, Policies, & Programs
Every Student Succeeds Act… | 2 |
Head Start | 1 |
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Rehabilitation Act 1973… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Samuel D'Emanuele; Francesca Nardello; Fabrizio Garau; Diego Campaci; Federico Schena; Cantor Tarperi – Measurement in Physical Education and Exercise Science, 2025
The agreement between a wearable inertial sensor (GYKO, G) and the force platform (P) was assessed by evaluating "test-retest" and "inter-rater reliability." Thirty-eight subjects were enrolled; the selected indices of balance were investigated over foot positions and (un)stable conditions. Intraclass correlation coefficient…
Descriptors: Human Posture, Measurement Equipment, Interrater Reliability, Measurement Techniques
Hsin-Yun Lee; You-Lin Chen; Li-Jen Weng – Journal of Experimental Education, 2024
The second version of Kaiser's Measure of Sampling Adequacy (MSA[subscript 2]) has been widely applied to assess the factorability of data in psychological research. The MSA[subscript 2] is developed in the population and little is known about its behavior in finite samples. If estimated MSA[subscript 2]s are biased due to sampling errors,…
Descriptors: Error of Measurement, Reliability, Sampling, Statistical Bias
Angus Kittelman; Sara Izzard; Kent McIntosh; Kelsey R. Morris; Timothy J. Lewis – Assessment for Effective Intervention, 2024
The purpose of this study was to evaluate the psychometric properties of the Self-Assessment Survey (SAS) 4.0, an updated measure assessing implementation fidelity of positive behavioral interventions and supports (PBIS). A total of 627 school personnel from 33 schools in six U.S. states completed the SAS 4.0 during the 2021-2022 school year. We…
Descriptors: Positive Behavior Supports, Teachers, Self Evaluation (Individuals), Test Reliability
Aislinn Ganci; Miran Qazizada; Brianna Fehr; Ana Vucenovic; Edmond Lou; Eric Parent – Measurement in Physical Education and Exercise Science, 2024
Spinal alignment can be assessed without radiation using three-dimensional ultrasound imaging (3DUS). Reliable measurements could inform the ideal arm position for scoliosis radiographs. This study determined the inter-evaluator reliability of axial vertebral rotation (AVR) measurements and sagittal curve angles in healthy females from 3DUS spinal…
Descriptors: Foreign Countries, Young Adults, Adults, Adolescents
Kathryn J. Greenslade; Julia K. Bushell; Emily F. Dillon; Amy E. Ramage – International Journal of Language & Communication Disorders, 2025
Background: Pragmatic communication difficulties encompass many distinct behaviours, including the use of vague and/or insufficient language, a common characteristic following traumatic brain injury (TBI) that negatively impacts psychosocial outcomes. Existing assessments evaluate pragmatic communication broadly, often with only one or two items…
Descriptors: Neurological Impairments, Head Injuries, Language Impairments, Language Tests
Julia Brochey-Taylor; Joseph A. Taylor – Educational Research and Reviews, 2024
The purpose of this synthesis study was to assess the reliability and validity of the Draw-A-Scientist Test (DAST) and its variations across multiple studies, aiming to understand limitations and propose modifications for future application within and beyond the science domain. Given the existence of multiple DAST versions, this study quantified…
Descriptors: Cognitive Tests, Freehand Drawing, Personality Measures, Projective Measures
Arielle Boguslav; Julie Cohen – Journal of Teacher Education, 2024
Teacher preparation programs are increasingly expected to use data on preservice teacher (PST) skills to drive program improvement and provide targeted supports. Observational ratings are especially vital, but also prone to measurement issues. Scores may be influenced by factors unrelated to PSTs' instructional skills, including rater standards.…
Descriptors: Preservice Teachers, Measures (Individuals), Evaluation Problems, Teaching Skills
Riana Nurhayati; Suranto Aw; Siti Irene Astuti Dwiningrum; Mami Hajaroh; Herwin Herwin – International Journal of Educational Methodology, 2024
Evaluation of child-friendly school (CFS) policies is essential to determine the achievements of school efforts in reducing violence cases. This research aims to proving the reliability and validity of CFS policy evaluation instruments in elementary schools with different locations. This investigation uses the Context Input Process Product (CIPP)…
Descriptors: Validity, Reliability, School Policy, Program Evaluation
Swapneel Thite; Jayashri Ravishankar; Inmaculada Tomeo-Reyes; Araceli Martinez Ortiz – European Journal of Engineering Education, 2024
Effectively working in an engineering workplace requires strong teamwork skills, yet the existing literature within various disciplines reveals discrepancies in evaluating these skills. This complicates the design of a generic teamwork peer evaluation tool for engineering students. This study aims to address this gap by introducing the DRIVE…
Descriptors: Scoring Rubrics, Evaluation Methods, Peer Evaluation, Teamwork
Janice Kinghorn; Katherine McGuire; Bethany L. Miller; Aaron Zimmerman – Assessment Update, 2024
In this article, the authors share their reflections on how different experiences and paradigms have broadened their understanding of the work of assessment in higher education. As they collaborated to create a panel for the 2024 International Conference on Assessing Quality in Higher Education, they recognized that they, as assessment…
Descriptors: Higher Education, Assessment Literacy, Evaluation Criteria, Evaluation Methods
Jiayu Zhai; Vahid Aryadoust – Metacognition and Learning, 2024
Metacognitive awareness is essential in regulating second language (L2) listening and has been predominantly assessed by a multidimensional instrument named the Metacognitive Awareness Listening Questionnaire (MALQ). Since previous studies have yielded inconclusive evidence concerning the generalization of MALQ, it is important to examine the…
Descriptors: Metacognition, Second Language Learning, Listening, Test Reliability
Amanda M. Snyder – ProQuest LLC, 2024
The ever-changing advances in technology require digital literacy skills for success in the workplace. To determine the critical digital literacy skills needed in the workplace today, the development of a reliable, valid instrument occurred using the nine steps of scale development by DeVellis and Thorpe (2021). Based on the SkillRise (2020a)…
Descriptors: Digital Literacy, Measures (Individuals), Job Skills, Test Reliability
Paul Alexander Siegel – ProQuest LLC, 2024
While multimodality and multiliteracies has been a concept for 25 years (Kalantzis & Cope, 2023; The New London Group, 1996), research on and application of the concept within text complexity measures has been limited. Attempts to assess multiliteracies and multimodality (Jacobs, 2013; Schmerbeck & Lucht, 2017; Wyatt-Smith & Kimber,…
Descriptors: Multiple Literacies, Learning Modalities, Test Validity, Test Reliability
Ting Ma; Lawrence Jun Zhang; Judy M. Parr – Language Awareness, 2025
Studies have shown that raising L2 learners' metaphor awareness contributes to the acquisition of figurative language, which fosters students' development of language skills. However, the instruments measuring metaphor awareness, in the majority of relevant research, did not seem to have undergone proper methodological procedures for checking…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Figurative Language
Guido Schwarzer; Gerta Rücker; Cristina Semaca – Research Synthesis Methods, 2024
The "LFK" index has been promoted as an improved method to detect bias in meta-analysis. Putatively, its performance does not depend on the number of studies in the meta-analysis. We conducted a simulation study, comparing the "LFK" index test to three standard tests for funnel plot asymmetry in settings with smaller or larger…
Descriptors: Bias, Meta Analysis, Simulation, Evaluation Methods