Publication Date
In 2025 | 0 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 14 |
Since 2016 (last 10 years) | 30 |
Since 2006 (last 20 years) | 74 |
Descriptor
Comparative Analysis | 125 |
Rating Scales | 125 |
Reliability | 51 |
Test Reliability | 50 |
Correlation | 36 |
Test Validity | 32 |
Interrater Reliability | 29 |
Foreign Countries | 28 |
Scores | 23 |
Validity | 23 |
Psychometrics | 20 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Practitioners | 3 |
Administrators | 2 |
Teachers | 2 |
Researchers | 1 |
Location
Australia | 4 |
United States | 4 |
California | 3 |
China | 2 |
Cyprus | 2 |
Europe | 2 |
Germany | 2 |
New Mexico | 2 |
Portugal | 2 |
Sweden | 2 |
Turkey (Ankara) | 2 |
More ▼ |
Laws, Policies, & Programs
Improving Americas Schools… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Lucy Chambers; Sylvia Vitello; Carmen Vidal Rodeiro – Assessment in Education: Principles, Policy & Practice, 2024
In England, some secondary-level qualifications comprise non-exam assessments which need to undergo moderation before grading. Currently, moderation is conducted at centre (school) level. This raises challenges for maintaining the standard across centres. Recent technological advances enable novel moderation methods that are no longer bound by…
Descriptors: Foreign Countries, Evaluation Methods, Comparative Analysis, Grading
Kapsner-Smith, Mara R.; Opuszynski, Amanda; Stepp, Cara E.; Eadie, Tanya L. – Journal of Speech, Language, and Hearing Research, 2021
Purpose: The reliability of auditory-perceptual judgments between listeners is a long-standing problem in the assessment of voice disorders. The purpose of this study was to determine whether a relatively novel experimental scaling method, called visual sort and rate (VSR), yielded stronger reliability than the more frequently used method of…
Descriptors: Voice Disorders, Interrater Reliability, Rating Scales, Severity (of Disability)
Seyda Aydin-Karaca; Mustafa Serdar Köksal; Bilkay Bi – Journal of Psychoeducational Assessment, 2024
This study aimed to develop a parent rating scale (PRSG) for screening children for further identification process in terms of giftedness. The participants of the study were 255 parents of gifted and non-gifted students. The PRSG, consisting of 30 items, was created by consulting parents and reviewing instruments existent in the literature. As…
Descriptors: Rating Scales, Parent Attitudes, Scores, Comparative Analysis
Shannon Ryan; Thomas J. Power; Laura Pendergast; Bridget Poznanski; Jenelle Nissley-Tsiopinis; Howard Abikoff; Richard Gallagher; Katie Tremont; Jaclyn Cacia; Jennifer A. Mautone – Grantee Submission, 2024
Organization, time management, and planning (OTMP) skills are behavioral manifestations of executive functioning linked to academic outcomes. Interventions to improve OTMP skills have shown favorable outcomes. The Children's Organizational Skills Scale parent and teacher forms (COSS-P, COSS-T) are widely used for assessing OTMP skills, but there…
Descriptors: Psychometrics, Rating Scales, Executive Function, Time Management
Shannon Ryan; Thomas J. Power; Laura Pendergast; Bridget Poznanski; Jenelle Nissley-Tsiopinis; Howard Abikoff; Richard Gallagher; Katie Tremont; Jaclyn Cacia; Jennifer A. Mautone – School Mental Health, 2024
Organization, time management, and planning (OTMP) skills are behavioral manifestations of executive functioning linked to academic outcomes. Interventions to improve OTMP skills have shown favorable outcomes. The Children's Organizational Skills Scale parent and teacher forms (COSS-P, COSS-T) are widely used for assessing OTMP skills, but there…
Descriptors: Psychometrics, Rating Scales, Executive Function, Time Management
Walland, Emma – Research Matters, 2022
In this article, I report on examiners' views and experiences of using Pairwise Comparative Judgement (PCJ) and Rank Ordering (RO) as alternatives to traditional analytical marking for GCSE English Language essays. Fifteen GCSE English Language examiners took part in the study. After each had judged 100 pairs of essays using PCJ and eight packs of…
Descriptors: Essays, Grading, Writing Evaluation, Evaluators
Neitzel, Jennifer; Early, Diane; Sideris, John; LaForrett, Doré; Abel, Michael B.; Soli, Margaret; Davidson, Dawn L.; Haboush-Deloye, Amanda; Hestenes, Linda L.; Jenson, Denise; Johnson, Cindy; Kalas, Jennifer; Mamrak, Angela; Masterson, Marie L.; Mims, Sharon U.; Oya, Patti; Philson, Bobbi; Showalter, Megan; Warner-Richter, Mallory; Kortright Wood, Jill – Journal of Early Childhood Research, 2019
The Early Childhood Environment Rating Scales, including the "Early Childhood Environment Rating Scale--Revised" (Harms et al., 2005) and the "Early Childhood Environment Rating Scale, Third Edition" (Harms et al., 2015) are the most widely used observational assessments in early childhood learning environments. The most recent…
Descriptors: Rating Scales, Early Childhood Education, Educational Quality, Scoring
Tian, Meng; Virtanen, Tuomo – ECNU Review of Education, 2021
Purpose: Drawing on distributed leadership and motivation theories, this study investigates teachers' perceptions of resource and agency distributions and identifies the key factors motivating leadership among teachers. Design/Approach/Methods: This quantitative study collected data from 327 teachers in nine schools in Shanghai. Chi-square tests…
Descriptors: Foreign Countries, Principals, Faculty Workload, Teacher Leadership
Manzano, Dexter L. – International Journal of Language Testing, 2022
The increasing popularity of self-assessment prompted several scholars to investigate its effectiveness and accuracy in relation to teacher assessment. However, most of these studies focused only on the consistency estimate perspective. Thus, the current study investigated the interrater reliability between self- and teacher assessment of…
Descriptors: Oral Language, Self Evaluation (Individuals), College Students, Interrater Reliability
Hestenes, Linda L.; Rucker, Lia; Wang, Yudan Chen; Mims, Sharon U.; Hestenes, Stephen E.; Cassidy, Deborah J. – Early Education and Development, 2019
Research Findings: The present study provides an initial descriptive comparison of the Early Childhood Environment Rating Scale-Revised (ECERS-R) and the Early Childhood Environment Rating Scale-Third Edition (ECERS-3) in a relatively large sample in 1 state that uses the Environment Rating Scales within its Quality Rating and Improvement System…
Descriptors: Comparative Analysis, Educational Quality, Rating Scales, Early Childhood Education
Xu, Jing; Jones, Edmund; Laxton, Victoria; Galaczi, Evelina – Assessment in Education: Principles, Policy & Practice, 2021
Recent advances in machine learning have made automated scoring of learner speech widespread, and yet validation research that provides support for applying automated scoring technology to assessment is still in its infancy. Both the educational measurement and language assessment communities have called for greater transparency in describing…
Descriptors: Second Language Learning, Second Language Instruction, English (Second Language), Computer Software
Jeong, Heejeong – Language Testing in Asia, 2019
In writing assessment, finding a valid, reliable, and efficient scale is critical. Appropriate scales, increase rater reliability, and can also save time and money. This exploratory study compared the effects of a binary scale and an analytic scale across teacher raters and expert raters. The purpose of the study is to find out how different scale…
Descriptors: Writing Evaluation, English (Second Language), Second Language Learning, Second Language Instruction
Seipp, Larry Michael – ProQuest LLC, 2021
Nonmusical factors affect the Virginia Band and Orchestra Directors Association (VBODA) concert performances and subsequent assessment results; namely, school size, ethnicity, and socioeconomic status. A comparison of ratings given by individual trained evaluators demonstrates interrater reliability. A comparison of final ratings given at…
Descriptors: Comparative Analysis, Predictor Variables, Socioeconomic Status, Ethnicity
Park, Mi Sun – Language Assessment Quarterly, 2020
In the present study, I examined the effects of rater characteristics, in particular, raters' familiarity with a foreign accent, on the assessment of second language (L2) pronunciation. Forty-three native English-speaking teachers were divided into three groups according to their reported types of familiarity with Korean accents: heritage,…
Descriptors: Evaluators, Familiarity, Second Language Learning, English (Second Language)
Ozdemir, Hasan Fehmi; Kutlu, Omer; Huang, Shaofu; Crick, Ruth – International Journal of Assessment Tools in Education, 2022
The aim of this study is to adapt the Crick Learning for Resilient Agency (CLARA) to Turkish culture, and to examine the psychometric features of the Inventory according to both Classical Test Theory (CTT) and Item Response Theory (IRT). In this respect, it is a descriptive level survey design research. Two different study groups were formed in…
Descriptors: Item Response Theory, Psychometrics, English (Second Language), English Literature