NotesFAQContact Us
Collection
Advanced
Search Tips
Publication Date
In 20250
Since 20240
Since 2021 (last 5 years)0
Since 2016 (last 10 years)14
Since 2006 (last 20 years)37
Audience
Laws, Policies, & Programs
Race to the Top1
What Works Clearinghouse Rating
Showing 1 to 15 of 47 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Berlin, Rebekah; Cohen, Julie – ZDM: The International Journal on Mathematics Education, 2018
In this paper, we analyze mathematics lessons using the Classroom Assessment Scoring System (CLASS), a standardized observation protocol that suggests that high-quality lessons are distinguished by the tenor and frequency of classroom interactions. Because the CLASS focuses on interactions, rather than the specifics of content teaching, it can be…
Descriptors: Educational Quality, Instructional Effectiveness, Mathematics Instruction, Classroom Observation Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
Reinertsen, Nathanael – English in Australia, 2018
The difference in how humans read and how Automated Essay Scoring (AES) systems process written language leads to a situation where a portion of student responses will be comprehensible to human markers, but unable to be parsed by AES systems. This paper examines a number of pieces of student writing that were marked by trained human markers, but…
Descriptors: Qualitative Research, Writing Evaluation, Essay Tests, Computer Assisted Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Tarricone, Pina; Newhouse, C. Paul – Educational Assessment, 2017
In this article we describe a three-year study that was conducted in three phases to evaluate the feasibility of assessing digitized portfolios of student creative work for high-stakes purposes. The first two phases suggested that creative work could be digitized with adequate fidelity, and that students could submit their own work from schools to…
Descriptors: Scoring, Reliability, Comparative Analysis, Portfolios (Background Materials)
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Qudah, Ahmad Hassan – Journal of Education and Practice, 2016
The research aims to reveal the specific way to evaluate learning mathematics, so that we get the "measuring tool" for the achievement of learners in mathematics that reflect their level of understanding by score (mark), which we trust it with high degree. The behavior of the learner can be measured by a professional way to build the…
Descriptors: Mathematics Instruction, Mathematics Teachers, Student Evaluation, Evaluation Methods
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Beutner, Marc; Rüscher, Frederike Anna – International Association for Development of the Information Society, 2017
This paper provides insights in the development of a skill matching test which addresses soft skills integrated videos as media to provide information about situations to be rated. The design of the skill testing and matching tool is situated in the educational ERASMUS+ project SMART which is presented as well. With a specific view on team work…
Descriptors: Foreign Countries, Test Construction, Testing, Video Technology
Peer reviewed Peer reviewed
Direct linkDirect link
Park, Yoon Soo; Hyderi, Abbas; Bordage, Georges; Xing, Kuan; Yudkowsky, Rachel – Advances in Health Sciences Education, 2016
Recent changes to the patient note (PN) format of the United States Medical Licensing Examination have challenged medical schools to improve the instruction and assessment of students taking the Step-2 clinical skills examination. The purpose of this study was to gather validity evidence regarding response process and internal structure, focusing…
Descriptors: Interrater Reliability, Generalizability Theory, Licensing Examinations (Professions), Physicians
Peer reviewed Peer reviewed
Direct linkDirect link
Yeates, Peter; O'Neill, Paul; Mann, Karen; Eva, Kevin – Advances in Health Sciences Education, 2013
Assessors' scores in performance assessments are known to be highly variable. Attempted improvements through training or rating format have achieved minimal gains. The mechanisms that contribute to variability in assessors' scoring remain unclear. This study investigated these mechanisms. We used a qualitative approach to study…
Descriptors: Performance Based Assessment, Scores, Evaluators, Scoring
Peer reviewed Peer reviewed
Direct linkDirect link
Reeves, Todd D. – Action in Teacher Education, 2017
Current preservice teacher education practice related to data use has been deemed inadequate, in that it is unevenly distributed and often superficial. In response, this article describes a course-based classroom assessment data-literacy experience for preservice elementary teachers. Grounded in extant theory and research concerning data literacy…
Descriptors: Preservice Teachers, Elementary School Teachers, Scoring, Teaching Methods
Zeng, Songtian – ProQuest LLC, 2017
Over 30 states have adopted the Early Childhood Environmental Rating Scale-Revised (ECERS-R) as a component of their program quality assessment systems, but the use of ECERS-R on such a large scale has raised important questions about implementation. One of the most pressing question centers upon decisions users must make between two scoring…
Descriptors: Rating Scales, Scoring, Validity, Comparative Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Moran, Renee M. R. – Educational Studies: Journal of the American Educational Studies Association, 2017
The use of student achievement data to evaluate an individual teacher's effectiveness has become a new focus in educational policy. This article focuses on the underresearched teacher perception of this new policy measure. Drawing on ethnographic research procedures, this article explores how first-grade teachers in one state navigated a new…
Descriptors: Accountability, Teacher Evaluation, Teacher Attitudes, Ethnography
Peer reviewed Peer reviewed
Direct linkDirect link
Cornish, Disa Lubker; Losch, Mary E.; Avery, Mitchell – American Journal of Sexuality Education, 2016
Monitoring fidelity of implementation is a critical task when initiating evidence-based programs. This pilot study sought to identify best practices in a fidelity monitoring process and determine the feasibility of continuing a fidelity monitoring process with a multisite, multiprogram initiative. A fidelity log was created for each of 11…
Descriptors: Evidence Based Practice, Pilot Projects, Best Practices, Fidelity
Peer reviewed Peer reviewed
Direct linkDirect link
Greene, Barbara A.; Lubin, Ian A.; Slater, Janis L.; Walden, Susan E. – Journal of Science Education and Technology, 2013
Two studies were conducted to examine content knowledge changes following 2 weeks of professional development that included scientific research with university scientists. Engaging teachers in scientific research is considered to be an effective way of encouraging knowledge of both inquiry pedagogy and content knowledge. We used concept maps with…
Descriptors: Scoring, Science Teachers, Concept Mapping, Replication (Evaluation)
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Newhouse, C. Paul; Tarricone, Pina – Canadian Journal of Learning and Technology, 2014
High-stakes external assessment for practical courses is fraught with problems impacting on the manageability, validity and reliability of scoring. Alternative approaches to assessment using digital technologies have the potential to address these problems. This paper describes a study that investigated the use of these technologies to create and…
Descriptors: High Stakes Tests, Student Evaluation, Evaluation Methods, Scoring
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Han, Turgay; Huang, Jinyan – PASAA: Journal of Language Teaching and Learning in Thailand, 2017
Using generalizability (G-) theory and rater interviews as both quantitative and qualitative approaches, this study examined the impact of scoring methods (i.e., holistic versus analytic scoring) on the scoring variability and reliability of an EFL institutional writing assessment at a Turkish university. Ten raters were invited to rate 36…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Scoring
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Güler, Nese – Eurasian Journal of Educational Research, 2014
Problem Statement: The most significant disadvantage of open-ended items that allow the valid measurement of upper level cognitive behaviours, such as synthesis and evaluation, is scoring. The difficulty associated with objectively scoring the answers to the items contributes to the reduction of the reliability of the scores. Moreover, other…
Descriptors: Item Response Theory, Statistics, Scoring, Reliability
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4