Publication Date
In 2025 | 5 |
Since 2024 | 14 |
Since 2021 (last 5 years) | 29 |
Since 2016 (last 10 years) | 77 |
Since 2006 (last 20 years) | 109 |
Descriptor
Scoring Rubrics | 117 |
Student Evaluation | 117 |
Interrater Reliability | 47 |
Reliability | 43 |
Test Reliability | 42 |
Test Validity | 36 |
Evaluation Methods | 33 |
Foreign Countries | 32 |
Validity | 27 |
Undergraduate Students | 25 |
Test Construction | 21 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Teachers | 3 |
Administrators | 1 |
Policymakers | 1 |
Practitioners | 1 |
Location
Australia | 5 |
California | 3 |
Canada | 3 |
Turkey | 3 |
United Kingdom (England) | 3 |
Utah | 3 |
Brazil | 2 |
Colorado | 2 |
Connecticut | 2 |
Florida | 2 |
Iran | 2 |
More ▼ |
Laws, Policies, & Programs
Every Student Succeeds Act… | 2 |
Elementary and Secondary… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
National Assessment of… | 3 |
New York State Regents… | 2 |
Flesch Kincaid Grade Level… | 1 |
Florida Comprehensive… | 1 |
What Works Clearinghouse Rating
Swapneel Thite; Jayashri Ravishankar; Inmaculada Tomeo-Reyes; Araceli Martinez Ortiz – European Journal of Engineering Education, 2024
Effectively working in an engineering workplace requires strong teamwork skills, yet the existing literature within various disciplines reveals discrepancies in evaluating these skills. This complicates the design of a generic teamwork peer evaluation tool for engineering students. This study aims to address this gap by introducing the DRIVE…
Descriptors: Scoring Rubrics, Evaluation Methods, Peer Evaluation, Teamwork
Shasha Chen; Shaohui Chi; Zuhao Wang – Journal of Baltic Science Education, 2025
Interdisciplinary thinking is critical for equipping students to apply scientific knowledge and tackle societal challenges across various disciplines, which has been recognized as a key objective of twenty-first century science education. However, research on effective interdisciplinary assessment in secondary school science education is still…
Descriptors: Thinking Skills, Interdisciplinary Approach, Science Instruction, Grade 7
Joan Li; Nikhil Kumar Jangamreddy; Ryuto Hisamoto; Ruchita Bhansali; Amalie Dyda; Luke Zaphir; Mashhuda Glencross – Australasian Journal of Educational Technology, 2024
Generative artificial intelligence technologies, such as ChatGPT, bring an unprecedented change in education by leveraging the power of natural language processing and machine learning. Employing ChatGPT to assist with marking written assessment presents multiple advantages including scalability, improved consistency, eliminating biases associated…
Descriptors: Higher Education, Artificial Intelligence, Grading, Scoring Rubrics
Shilan Shafiei – Language Testing in Asia, 2024
The present study aimed to develop an analytic assessment rubric for the consecutive interpreting course in the educational setting in the Iranian academic context. To this end, the general procedure of rubric development, including data preparation, selection, and refinement, was applied. The performance criteria were categorized into content,…
Descriptors: Scoring Rubrics, Translation, Language Processing, Second Languages
Saenz, David Arron – Online Submission, 2023
There is a vast body of literature documenting the positive impacts that rater training and calibration sessions have on inter-rater reliability as research indicates several factors including frequency and timing play crucial roles towards ensuring inter-rater reliability. Additionally, increasing amounts research indicate possible links in…
Descriptors: Interrater Reliability, Scoring, Training, Scoring Rubrics
Wenjing Guo – ProQuest LLC, 2021
Constructed response (CR) items are widely used in large-scale testing programs, including the National Assessment of Educational Progress (NAEP) and many district and state-level assessments in the United States. One unique feature of CR items is that they depend on human raters to assess the quality of examinees' work. The judgment of human…
Descriptors: National Competency Tests, Responses, Interrater Reliability, Error of Measurement
Sievers, Matt; Reemts, Connor; Dickinson, Katherine J.; Mukerji, Joya; Beltran, Ismael Barreras; Theobald, Elli J.; Velasco, Vicente; Freeman, Scott – Biochemistry and Molecular Biology Education, 2023
Researchers have called for undergraduate courses to update teaching frameworks based on the Modern Synthesis with insights from molecular biology, by stressing the molecular underpinnings of variation and adaptation. To support this goal, we developed a modified version of the widely used Assessing Conceptual Reasoning of Natural Selection…
Descriptors: Student Evaluation, Knowledge Level, Molecular Biology, Evolution
Marcelo Fernando Rauber; Christiane Gresse von Wangenheim; Pedro Alberto Barbetta; Adriano Ferreti Borgatto; Ramon Mayor Martins; Jean Carlo Rossa Hauck – Informatics in Education, 2024
The insertion of Machine Learning (ML) in everyday life demonstrates the importance of popularizing an understanding of ML already in school. Accompanying this trend arises the need to assess the students' learning. Yet, so far, few assessments have been proposed, most lacking an evaluation. Therefore, we evaluate the reliability and validity of…
Descriptors: Artificial Intelligence, Measures (Individuals), Test Reliability, Test Validity
Kylie Gorney; Sandip Sinharay – Journal of Educational Measurement, 2025
Although there exists an extensive amount of research on subscores and their properties, limited research has been conducted on categorical subscores and their interpretations. In this paper, we focus on the claim of Feinberg and von Davier that categorical subscores are useful for remediation and instructional purposes. We investigate this claim…
Descriptors: Tests, Scores, Test Interpretation, Alternative Assessment
Slaviša Radovic; Niels Seidel – International Journal of Assessment Tools in Education, 2024
Advanced learning technologies have become a focal point in recent educational research, holding the promise of enhancing students' self-regulated learning (SRL) by facilitating various processes of planning, monitoring, performing, and reflecting upon learning experiences. However, concerns have arisen regarding the efficacy and design of…
Descriptors: Self Management, Educational Technology, Scoring Rubrics, Educational Environment
Ramy Shabara; Khaled ElEbyary; Deena Boraie – Teaching English with Technology, 2024
Although there are claims that ChatGPT, an AI-based language model, is capable of assessing the writing of L2 learners accurately and consistently in the classroom, a number of recent studies have shown discrepancies between AI and human raters. Furthermore, there is a lack of studies investigating the intrareliability of ChatGPT scores.…
Descriptors: Foreign Countries, Artificial Intelligence, Scoring Rubrics, Student Evaluation
Gina Pancorbo; Ricardo Primi; Oliver P. John; Daniel Santos; Filip De Fruyt – Assessment in Education: Principles, Policy & Practice, 2023
Rubrics are popular tools to assess students' social-emotional skills. However, little research has been devoted to investigating whether rubrics' performance level descriptions validly represent the intended skills. In this study, we addressed the validity of Self-management rubrics by examining the presence of construct-irrelevant variance in…
Descriptors: Social Emotional Learning, Skill Development, Scoring Rubrics, Student Evaluation
Maria Blevins; Bryce Hughes; Jennifer Green; Leila Sterman; Shannon Willoughby – Journal of College Science Teaching, 2025
In this work, the authors document an expansion of the Public Speaking Competency Rubric (PSCR). First developed in 2012 by Schreiber, et al., the original rubric has only one item related to non-verbal communication. The authors of this work expanded the rubric to include 10 items related to the non-verbal aspects of public speaking and had it…
Descriptors: Test Construction, Public Speaking, Competence, Scoring Rubrics
Williamson, Joanna; Child, Simon – Journal of Vocational Education and Training, 2022
School- and college-based vocational and technical qualifications (VTQs) in England are required to award successful candidates a grade rather than simple pass or fail. Ensuring the reliability and validity of these grades is considered vital, particularly in light of the high-stakes purposes for which school assessment results in England are…
Descriptors: Foreign Countries, Vocational Education, Qualifications, Student Evaluation
Ari, Gökhan – International Journal of Progressive Education, 2021
Writing rubrics have been used in doctoral dissertations in Turkey to assess student writing for nearly twenty years. This study aims to determine which features are assessed in the rubrics used in doctoral dissertations. Twenty-five rubrics were selected to determine the analysis of validity and reliability, rubric dimensions, features of…
Descriptors: Scoring Rubrics, Doctoral Dissertations, Foreign Countries, Student Evaluation