Publication Date
In 2025 | 0 |
Since 2024 | 5 |
Since 2021 (last 5 years) | 12 |
Since 2016 (last 10 years) | 30 |
Since 2006 (last 20 years) | 72 |
Descriptor
Evaluation Methods | 192 |
Rating Scales | 192 |
Test Reliability | 92 |
Test Validity | 64 |
Reliability | 62 |
Interrater Reliability | 47 |
Measurement Techniques | 37 |
Higher Education | 32 |
Correlation | 29 |
Validity | 26 |
Test Construction | 24 |
More ▼ |
Source
Author
Follman, John | 4 |
Cason, Gerald J. | 3 |
Algina, James | 2 |
Alonso, Ariel | 2 |
Cason, Carolyn L. | 2 |
Crawford, Angela R. | 2 |
Erford, Bradley T. | 2 |
Ginns, Paul | 2 |
Houston, Walter M. | 2 |
Johnson, Evelyn S. | 2 |
Laenen, Annouschka | 2 |
More ▼ |
Publication Type
Education Level
Location
United Kingdom (England) | 3 |
Canada | 2 |
Europe | 2 |
United States | 2 |
Arkansas | 1 |
Australia | 1 |
China | 1 |
China (Beijing) | 1 |
Hawaii | 1 |
Illinois | 1 |
Indonesia | 1 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Womens Educational Equity Act | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Lucy Chambers; Sylvia Vitello; Carmen Vidal Rodeiro – Assessment in Education: Principles, Policy & Practice, 2024
In England, some secondary-level qualifications comprise non-exam assessments which need to undergo moderation before grading. Currently, moderation is conducted at centre (school) level. This raises challenges for maintaining the standard across centres. Recent technological advances enable novel moderation methods that are no longer bound by…
Descriptors: Foreign Countries, Evaluation Methods, Comparative Analysis, Grading
Nicole D. Martin; Stephanie N. Baker; Madeline Haynes; Jayce R. Warner – Computer Science Education, 2024
Background and Context: As computer science (CS) education expands and the need for well-prepared CS teachers grows, understanding what motivates teachers to teach CS can help address challenges to recruiting, preparing, and retaining teachers. Objective: The goal of this work was to develop and validate a scale that measures teachers' motivation…
Descriptors: Computer Science Education, Teacher Motivation, Measurement Techniques, Construct Validity
Weiwei Tong; Prasong Saihong; Kanyarat Sonsupap – International Journal of Language Education, 2024
The main objective of this study is to revise and validate the assessment of self-presentation skills of middle school students. The assessment is based on existing self-assessment scales and adaptively modified for a more accurate assessment of middle school students' self-presentation skills. Considering the characteristics of middle school…
Descriptors: Middle School Students, Self Evaluation (Individuals), Rating Scales, Reliability
Rossin, Emily G.; Bergee, Martin J. – Journal of Research in Music Education, 2021
This is the sixth and culminating study in a series whose purpose has been to acquire a conceptual understanding of school band performance and to develop an assessment based on this understanding. With the present study, we cross-validated and applied a rating scale for school band performance. In the cross-validation phase, college students…
Descriptors: Music Education, Music Activities, Music, Performance
Huiying Cai; Xun Yan – Language Testing, 2024
Rater comments tend to be qualitatively analyzed to indicate raters' application of rating scales. This study applied natural language processing (NLP) techniques to quantify meaningful, behavioral information from a corpus of rater comments and triangulated that information with a many-facet Rasch measurement (MFRM) analysis of rater scores. The…
Descriptors: Natural Language Processing, Item Response Theory, Rating Scales, Writing Evaluation
Williamson, Joanna; Child, Simon – Journal of Vocational Education and Training, 2022
School- and college-based vocational and technical qualifications (VTQs) in England are required to award successful candidates a grade rather than simple pass or fail. Ensuring the reliability and validity of these grades is considered vital, particularly in light of the high-stakes purposes for which school assessment results in England are…
Descriptors: Foreign Countries, Vocational Education, Qualifications, Student Evaluation
Walland, Emma – Research Matters, 2022
In this article, I report on examiners' views and experiences of using Pairwise Comparative Judgement (PCJ) and Rank Ordering (RO) as alternatives to traditional analytical marking for GCSE English Language essays. Fifteen GCSE English Language examiners took part in the study. After each had judged 100 pairs of essays using PCJ and eight packs of…
Descriptors: Essays, Grading, Writing Evaluation, Evaluators
Gordon, Jean K.; Clough, Sharice – Journal of Speech, Language, and Hearing Research, 2022
Purpose: Aphasia fluency is multiply determined by underlying impairments in lexical retrieval, grammatical formulation, and speech production. This poses challenges for establishing a reliable and feasible tool to measure fluency in the clinic. We examine the reliability and validity of perceptual ratings and clinical perspectives on the utility…
Descriptors: Aphasia, Language Fluency, Language Impairments, Evaluation Methods
Wiggins, Holly C.; Roscoe, Eileen M. – Journal of Applied Behavior Analysis, 2020
Although a demand analysis is helpful for identifying potential establishing operations for the functional analysis (FA) demand condition, it may not always be practical due to time constraints. A potential alternative is the Negative Reinforcement Rating Scale (NRRS), an indirect assessment tool that may serve as a time efficient alternative to a…
Descriptors: Functional Behavioral Assessment, Evaluation Methods, Negative Reinforcement, Autism
Koçak, Duygu – International Electronic Journal of Elementary Education, 2020
One of the most commonly used methods for measuring higher-order thinking skills such as problem-solving or written expression is open-ended items. Three main approaches are used to evaluate responses to open-ended items: general evaluation, rating scales, and rubrics. In order to measure and improve problem-solving skills of students, firstly, an…
Descriptors: Interrater Reliability, Item Response Theory, Test Items, Rating Scales
Valeria Cavioni; Luisa Broli; Ilaria Grazzani – International Journal of Emotional Education, 2024
The importance of enhancing social and emotional skills in educational settings has gained prominence, with many countries and organizations embracing the Social and Emotional Learning (SEL) framework to equip individuals with the tools needed for shaping a self-identity, emotional regulation, goal achievement, empathy, nurturing relationships,…
Descriptors: Social Emotional Learning, Guidelines, Educational Policy, Cross Cultural Studies
Crawford, Angela R.; Johnson, Evelyn S.; Moylan, Laura A.; Zheng, Yuzhu – Assessment for Effective Intervention, 2019
This study describes the development and initial psychometric evaluation of the Recognizing Effective Special Education Teachers (RESET) observation instrument. The study uses generalizability theory to compare two versions of a rubric, one with general descriptors of performance levels and one with item-specific descriptors of performance levels,…
Descriptors: Teacher Evaluation, Special Education Teachers, Scoring Rubrics, Observation
Thai, Thuy; Sheehan, Susan – Language Education & Assessment, 2022
In language performance tests, raters are important as their scoring decisions determine which aspects of performance the scores represent; however, raters are considered as one of the potential sources contributing to unwanted variability in scores (Davis, 2012). Although a great number of studies have been conducted to unpack how rater…
Descriptors: Rating Scales, Speech Communication, Second Language Learning, Second Language Instruction
Wakabayashi, Tomoko; Claxton, Jill; Smith, Everett V., Jr. – Journal of Psychoeducational Assessment, 2019
The Child Observation Record (COR), initially developed in 1993 by HighScope Educational Research Foundation, is an observation-based instrument that provides systematic assessment of young children's knowledge and abilities in all major areas of development. Teachers or caregivers spend a few minutes each day writing brief notes or…
Descriptors: Observation, Evaluation Methods, Early Childhood Education, Kindergarten
Erford, Bradley T.; Jackson, Jessica; Bardhoshi, Gerta; Duncan, Kelly; Atalay, Zumra – Measurement and Evaluation in Counseling and Development, 2018
Psychometric meta-analyses and reviews were provided for four commonly used suicidal ideation instruments: the Beck Scale for Suicide Ideation, the Suicide Ideation Questionnaire, the Suicide Probability Scale, and Columbia--Suicide Severity Rating Scale. Practical and technical issues and best use recommendations for screening and outcome…
Descriptors: Suicide, Psychological Patterns, Meta Analysis, Evaluation Methods