Publication Date
In 2025 | 2 |
Since 2024 | 50 |
Since 2021 (last 5 years) | 147 |
Since 2016 (last 10 years) | 362 |
Since 2006 (last 20 years) | 812 |
Descriptor
Rating Scales | 1580 |
Test Reliability | 857 |
Test Validity | 587 |
Reliability | 528 |
Foreign Countries | 368 |
Factor Analysis | 322 |
Interrater Reliability | 281 |
Psychometrics | 274 |
Test Construction | 267 |
Correlation | 259 |
Validity | 247 |
More ▼ |
Source
Author
Publication Type
Education Level
Location
Turkey | 83 |
Canada | 31 |
Australia | 26 |
China | 23 |
United States | 19 |
Taiwan | 14 |
South Korea | 13 |
Singapore | 12 |
Spain | 11 |
California | 10 |
Florida | 10 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 4 |
No Child Left Behind Act 2001 | 2 |
Americans with Disabilities… | 1 |
Early Head Start | 1 |
Elementary and Secondary… | 1 |
Improving Americas Schools… | 1 |
Womens Educational Equity Act | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
Matthew K. Burns; Heba Z. Abdelnaby; Jonie B. Welland; Katherine A. Graves; Kari Kurto – Assessment for Effective Intervention, 2024
The current study examined the reliability of The Reading League Curriculum-Evaluation Guidelines (CEGs), which were developed to help school-based teams rate the presence of red flags when considering adopting specific literacy curricula. Coders (n = 30) independently used the CEGs to evaluate a free online English language arts curriculum. The…
Descriptors: English Curriculum, English Instruction, Language Arts, Curriculum Evaluation
Kathryn J. Greenslade; Julia K. Bushell; Emily F. Dillon; Amy E. Ramage – International Journal of Language & Communication Disorders, 2025
Background: Pragmatic communication difficulties encompass many distinct behaviours, including the use of vague and/or insufficient language, a common characteristic following traumatic brain injury (TBI) that negatively impacts psychosocial outcomes. Existing assessments evaluate pragmatic communication broadly, often with only one or two items…
Descriptors: Neurological Impairments, Head Injuries, Language Impairments, Language Tests
Venkatraman, Yamini; Mahalingam, Shenbagavalli; Boominathan, Prakash – Journal of Speech, Language, and Hearing Research, 2022
Purpose: The Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V) is a standardized instrument used in voice assessment to assess voice quality. It has been translated and culturally adapted in several languages. This study aimed at developing and validating a Tamil version of CAPE-V through auditory perceptual evaluation of remotely…
Descriptors: Sentences, Dravidian Languages, Acoustics, Auditory Perception
Rajeshwari Panigrahi; Khaliq Lubza Nihar; Neha Singh – Higher Learning Research Communications, 2024
Objective: This study aimed to develop and test a scale for measuring the quality of blended learning models in higher education. Methods: This research adopts a sequential mixed-method approach to construct a new measurement scale. The first phase consisted of the inductive approach to identify the items, followed by exploratory factor analysis.…
Descriptors: Blended Learning, Educational Quality, Higher Education, Test Construction
Gunjawate, Dhanshree R.; Ravi, Rohit; Bhagavan, Srividya – Journal of Speech, Language, and Hearing Research, 2020
Purpose: The purpose of this study was to evaluate the reliability and validity of the Kannada version of the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V). Method: The Kannada version of CAPE-V comprises six phrases that are phonetically designed as per the CAPE-V requirements. Sixty-five (21 individuals with dysphonia and 44…
Descriptors: Test Reliability, Test Validity, Dravidian Languages, Voice Disorders
Sojeong Nam; Byeolbee Um; Jeongwoon Jeong; Monique Rodriguez; David Lardier – Measurement and Evaluation in Counseling and Development, 2024
This study aimed to provide meta-analytic reliability information of the Columbia-Suicide Severity Rating Scale (C-SSRS). We implemented systematic search procedures to 35 eligible studies (N = 23,247; Mage = 26.74 years) that reported reliability estimates. The synthesized average values of Cronbach's alpha were 0.88 (95% CI [0.85, 0.92]) for the…
Descriptors: Scores, Test Reliability, Rating Scales, Suicide
Uyumaz, Gizem; Sirganci, Gözde – International Journal of Contemporary Educational Research, 2021
In this study, the assumption of the equality of psychological distance between categories of rating scale was tested based on the number of categories and ability distributions. Category parameters were estimated by using generalized partial credit model. The data sets based on the conditions of categories counts and ability distributions were…
Descriptors: Rating Scales, Classification, Reliability, Likert Scales
Menold, Natalja – Field Methods, 2023
While numerical bipolar rating scales may evoke positivity bias, little is known about the corresponding bias in verbal bipolar rating scales. The choice of verbalization of the middle category may lead to response bias, particularly if it is not in line with the scale polarity. Unipolar and bipolar seven-category rating scales in which the…
Descriptors: Rating Scales, Test Bias, Verbal Tests, Responses
Yesildag Hasancebi, Funda; Yuksel, Busra Tuncay; Mesci, Gunkut – International Journal of Assessment Tools in Education, 2022
The purpose of this study was to develop a reliable and valid rating scale for the use of the assessment and evaluation of lesson plans and teaching practices that are based on argumentation-based inquiry (ABI). The study covered two academic years (four academic semesters). Qualitative and quantitative methods were utilized throughout the…
Descriptors: Foreign Countries, Rating Scales, Test Construction, Test Validity
Catherine P. Bradshaw; Heather L. McDaniel; Chelsea A. Kaihoi; Summer S. Braun; Elise T. Pas; Jessika H. Bottiani; Anne H. Cash; Katrina J. Debnam – Assessment for Effective Intervention, 2024
This article focuses on the psychometric properties and characteristics of the Assessing School Settings: Interactions of Students and Teachers (ASSIST), an observational assessment administered by trained external observers of teacher practices, classroom context, and student behaviors at the classroom level. Study 1 examines variability,…
Descriptors: Psychometrics, Rating Scales, Observation, Classroom Observation Techniques
Lucy Chambers; Sylvia Vitello; Carmen Vidal Rodeiro – Assessment in Education: Principles, Policy & Practice, 2024
In England, some secondary-level qualifications comprise non-exam assessments which need to undergo moderation before grading. Currently, moderation is conducted at centre (school) level. This raises challenges for maintaining the standard across centres. Recent technological advances enable novel moderation methods that are no longer bound by…
Descriptors: Foreign Countries, Evaluation Methods, Comparative Analysis, Grading
D. Betsy McCoach; Scott Peters; Anthony J. Gambino; Daniel Long; Del Siegle – Grantee Submission, 2024
Teacher rating scales (TRS) often play a part in service eligibility decisions for gifted services. Although schools regularly use TRS to identify gifted students either as part of an informal nomination process or through behavioral rating scales, there is little research documenting the between-teacher variance in teacher ratings and the…
Descriptors: Gifted Education, Rating Scales, Academically Gifted, Academic Achievement
María Vallejo-Valdivielso; Pilar de Castro-Manglano; Cristina Vidal-Adroher; Azucena Díez-Suárez; Cesar A. Soutullo – Journal of Attention Disorders, 2024
Objective: To develop a short version of the Spanish 18-item ADHD-Rating Scale IV.es (sADHD-RS-IV.es) to be used as a potential screening tool in pediatric population. Methods: We recruited 652 subjects, ages 6 to 18 (mean ± SD = 11.14 ± 3.27): 518 patients with ADHD (per DSM-IV criteria); and 134 healthy controls. We performed a stepwise logistic…
Descriptors: Rating Scales, Attention Deficit Hyperactivity Disorder, Screening Tests, Children
D. Betsy McCoach; Scott Peters; Anthony J. Gambino; Daniel Long; Del Siegle – Exceptional Children, 2024
Teacher rating scales (TRS) often play a part in service eligibility decisions for gifted services. Although schools regularly use TRS to identify gifted students either as part of an informal nomination process or through behavioral rating scales, there is little research documenting the between-teacher variance in teacher ratings and the…
Descriptors: Gifted Education, Rating Scales, Academically Gifted, Academic Achievement
Lambert, Richard G.; Holcomb, T. Scott; Bottoms, Bryndle – Center for Educational Measurement and Evaluation, 2022
The validity of the Kappa coefficient of chance-corrected agreement has been questioned when the prevalence of specific rating scale categories is low and agreement between raters is high. The researchers proposed the Lambda Coefficient of Rater-Mediated Agreement as an alternative to Kappa to address these concerns. Lambda corrects for chance…
Descriptors: Interrater Reliability, Evaluators, Rating Scales, Teacher Evaluation