Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 14 |
Since 2006 (last 20 years) | 37 |
Descriptor
Qualitative Research | 47 |
Scoring | 47 |
Foreign Countries | 18 |
Statistical Analysis | 17 |
Interviews | 13 |
Evaluation Methods | 9 |
English (Second Language) | 8 |
Student Evaluation | 8 |
Test Construction | 8 |
College Students | 7 |
Second Language Learning | 7 |
More ▼ |
Source
Author
Newhouse, C. Paul | 2 |
Tarricone, Pina | 2 |
Artino, Anthony R., Jr. | 1 |
Avery, Mitchell | 1 |
Babaii, Esmat | 1 |
Baker, Sheldon R. | 1 |
Bao, Xiaoli | 1 |
Bastiaens, Theo | 1 |
Bennett, Randy Elliot | 1 |
Berlin, Rebekah | 1 |
Beutner, Marc | 1 |
More ▼ |
Publication Type
Reports - Research | 36 |
Journal Articles | 33 |
Reports - Evaluative | 6 |
Speeches/Meeting Papers | 6 |
Tests/Questionnaires | 5 |
Dissertations/Theses -… | 4 |
Numerical/Quantitative Data | 1 |
Reports - Descriptive | 1 |
Education Level
Audience
Location
Australia | 4 |
Turkey | 2 |
California | 1 |
Finland | 1 |
Germany | 1 |
India | 1 |
Iowa | 1 |
Iran | 1 |
Japan | 1 |
Kentucky | 1 |
Massachusetts (Boston) | 1 |
More ▼ |
Laws, Policies, & Programs
Race to the Top | 1 |
Assessments and Surveys
Early Childhood Environment… | 2 |
Flanders System of… | 1 |
Graduate Record Examinations | 1 |
National Assessment of… | 1 |
Strategy Inventory for… | 1 |
Test of English as a Foreign… | 1 |
United States Medical… | 1 |
What Works Clearinghouse Rating
Berlin, Rebekah; Cohen, Julie – ZDM: The International Journal on Mathematics Education, 2018
In this paper, we analyze mathematics lessons using the Classroom Assessment Scoring System (CLASS), a standardized observation protocol that suggests that high-quality lessons are distinguished by the tenor and frequency of classroom interactions. Because the CLASS focuses on interactions, rather than the specifics of content teaching, it can be…
Descriptors: Educational Quality, Instructional Effectiveness, Mathematics Instruction, Classroom Observation Techniques
Reinertsen, Nathanael – English in Australia, 2018
The difference in how humans read and how Automated Essay Scoring (AES) systems process written language leads to a situation where a portion of student responses will be comprehensible to human markers, but unable to be parsed by AES systems. This paper examines a number of pieces of student writing that were marked by trained human markers, but…
Descriptors: Qualitative Research, Writing Evaluation, Essay Tests, Computer Assisted Testing
Tarricone, Pina; Newhouse, C. Paul – Educational Assessment, 2017
In this article we describe a three-year study that was conducted in three phases to evaluate the feasibility of assessing digitized portfolios of student creative work for high-stakes purposes. The first two phases suggested that creative work could be digitized with adequate fidelity, and that students could submit their own work from schools to…
Descriptors: Scoring, Reliability, Comparative Analysis, Portfolios (Background Materials)
Qudah, Ahmad Hassan – Journal of Education and Practice, 2016
The research aims to reveal the specific way to evaluate learning mathematics, so that we get the "measuring tool" for the achievement of learners in mathematics that reflect their level of understanding by score (mark), which we trust it with high degree. The behavior of the learner can be measured by a professional way to build the…
Descriptors: Mathematics Instruction, Mathematics Teachers, Student Evaluation, Evaluation Methods
Beutner, Marc; Rüscher, Frederike Anna – International Association for Development of the Information Society, 2017
This paper provides insights in the development of a skill matching test which addresses soft skills integrated videos as media to provide information about situations to be rated. The design of the skill testing and matching tool is situated in the educational ERASMUS+ project SMART which is presented as well. With a specific view on team work…
Descriptors: Foreign Countries, Test Construction, Testing, Video Technology
Park, Yoon Soo; Hyderi, Abbas; Bordage, Georges; Xing, Kuan; Yudkowsky, Rachel – Advances in Health Sciences Education, 2016
Recent changes to the patient note (PN) format of the United States Medical Licensing Examination have challenged medical schools to improve the instruction and assessment of students taking the Step-2 clinical skills examination. The purpose of this study was to gather validity evidence regarding response process and internal structure, focusing…
Descriptors: Interrater Reliability, Generalizability Theory, Licensing Examinations (Professions), Physicians
Yeates, Peter; O'Neill, Paul; Mann, Karen; Eva, Kevin – Advances in Health Sciences Education, 2013
Assessors' scores in performance assessments are known to be highly variable. Attempted improvements through training or rating format have achieved minimal gains. The mechanisms that contribute to variability in assessors' scoring remain unclear. This study investigated these mechanisms. We used a qualitative approach to study…
Descriptors: Performance Based Assessment, Scores, Evaluators, Scoring
Reeves, Todd D. – Action in Teacher Education, 2017
Current preservice teacher education practice related to data use has been deemed inadequate, in that it is unevenly distributed and often superficial. In response, this article describes a course-based classroom assessment data-literacy experience for preservice elementary teachers. Grounded in extant theory and research concerning data literacy…
Descriptors: Preservice Teachers, Elementary School Teachers, Scoring, Teaching Methods
Zeng, Songtian – ProQuest LLC, 2017
Over 30 states have adopted the Early Childhood Environmental Rating Scale-Revised (ECERS-R) as a component of their program quality assessment systems, but the use of ECERS-R on such a large scale has raised important questions about implementation. One of the most pressing question centers upon decisions users must make between two scoring…
Descriptors: Rating Scales, Scoring, Validity, Comparative Analysis
Moran, Renee M. R. – Educational Studies: Journal of the American Educational Studies Association, 2017
The use of student achievement data to evaluate an individual teacher's effectiveness has become a new focus in educational policy. This article focuses on the underresearched teacher perception of this new policy measure. Drawing on ethnographic research procedures, this article explores how first-grade teachers in one state navigated a new…
Descriptors: Accountability, Teacher Evaluation, Teacher Attitudes, Ethnography
Cornish, Disa Lubker; Losch, Mary E.; Avery, Mitchell – American Journal of Sexuality Education, 2016
Monitoring fidelity of implementation is a critical task when initiating evidence-based programs. This pilot study sought to identify best practices in a fidelity monitoring process and determine the feasibility of continuing a fidelity monitoring process with a multisite, multiprogram initiative. A fidelity log was created for each of 11…
Descriptors: Evidence Based Practice, Pilot Projects, Best Practices, Fidelity
Greene, Barbara A.; Lubin, Ian A.; Slater, Janis L.; Walden, Susan E. – Journal of Science Education and Technology, 2013
Two studies were conducted to examine content knowledge changes following 2 weeks of professional development that included scientific research with university scientists. Engaging teachers in scientific research is considered to be an effective way of encouraging knowledge of both inquiry pedagogy and content knowledge. We used concept maps with…
Descriptors: Scoring, Science Teachers, Concept Mapping, Replication (Evaluation)
Newhouse, C. Paul; Tarricone, Pina – Canadian Journal of Learning and Technology, 2014
High-stakes external assessment for practical courses is fraught with problems impacting on the manageability, validity and reliability of scoring. Alternative approaches to assessment using digital technologies have the potential to address these problems. This paper describes a study that investigated the use of these technologies to create and…
Descriptors: High Stakes Tests, Student Evaluation, Evaluation Methods, Scoring
Han, Turgay; Huang, Jinyan – PASAA: Journal of Language Teaching and Learning in Thailand, 2017
Using generalizability (G-) theory and rater interviews as both quantitative and qualitative approaches, this study examined the impact of scoring methods (i.e., holistic versus analytic scoring) on the scoring variability and reliability of an EFL institutional writing assessment at a Turkish university. Ten raters were invited to rate 36…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Scoring
Güler, Nese – Eurasian Journal of Educational Research, 2014
Problem Statement: The most significant disadvantage of open-ended items that allow the valid measurement of upper level cognitive behaviours, such as synthesis and evaluation, is scoring. The difficulty associated with objectively scoring the answers to the items contributes to the reduction of the reliability of the scores. Moreover, other…
Descriptors: Item Response Theory, Statistics, Scoring, Reliability