Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 5 |
Since 2006 (last 20 years) | 23 |
Descriptor
Interrater Reliability | 25 |
Tests | 25 |
Evaluation Methods | 8 |
Foreign Countries | 8 |
Scores | 8 |
Correlation | 7 |
Test Reliability | 7 |
Comparative Analysis | 5 |
Scoring | 5 |
Statistical Analysis | 5 |
Student Attitudes | 5 |
More ▼ |
Source
Author
Ainslie, Martha | 1 |
Anderson, Deborah | 1 |
Arnold, Mariah | 1 |
Atilgan, Hakan | 1 |
Aulie, Vibeke Smith | 1 |
Basokcu, Tahsin Oguz | 1 |
Bednarz, Robert | 1 |
Bell, John | 1 |
Bjornson, Kristie F. | 1 |
Boon, Helen | 1 |
Burmester, Kristen O'Rourke | 1 |
More ▼ |
Publication Type
Education Level
Audience
Location
Turkey | 2 |
United Kingdom | 2 |
United Kingdom (England) | 2 |
United States | 2 |
Asia | 1 |
Australia | 1 |
Brazil | 1 |
Canada | 1 |
Connecticut | 1 |
Denmark | 1 |
Egypt | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Autism Diagnostic Observation… | 1 |
Kaufman Brief Intelligence… | 1 |
Peabody Developmental Motor… | 1 |
What Works Clearinghouse Rating
Cheung, Kason Ka Ching; Tai, Kevin W. H. – Research in Science & Technological Education, 2023
Background: Intercoder reliability is a statistic commonly reported by researchers to demonstrate the rigour of coding procedures during data analysis. Its importance is debatable in the analysis of qualitative interview data. It raises a question on whether researchers should identify the same codes and themes in a transcript or they should…
Descriptors: Interrater Reliability, Data Analysis, Interviews, Research Methodology
Atilgan, Hakan; Demir, Elif Kübra; Ogretmen, Tuncay; Basokcu, Tahsin Oguz – International Journal of Progressive Education, 2020
It has become a critical question what the reliability level would be when open-ended questions are used in large-scale selection tests. One of the aims of the present study is to determine what the reliability would be in the event that the answers given by test-takers are scored by experts when open-ended short answer questions are used in…
Descriptors: Foreign Countries, Secondary School Students, Test Items, Test Reliability
Wesolowski, Brian C.; Wind, Stefanie A. – Journal of Educational Measurement, 2019
Rater-mediated assessments are a common methodology for measuring persons, investigating rater behavior, and/or defining latent constructs. The purpose of this article is to provide a pedagogical framework for examining rater variability in the context of rater-mediated assessments using three distinct models. The first model is the observation…
Descriptors: Interrater Reliability, Models, Observation, Measurement
Maxwell, Bruce; Boon, Helen; Tanchuk, Nicolas; Rauwerda, Bryan – Journal of Moral Education, 2021
This article documents the adaptation, piloting and validation of a measure of teachers' ethical sensitivity. To create the test, we modified a measure from dentistry drawing on literature in teacher professional ethics and drew on the expertise of professional ethics scholars and practitioners. Based on the results of Rasch analysis combined with…
Descriptors: Ethics, Moral Values, Scores, Teacher Education Programs
Nieto, Ricardo; Casabianca, Jodi M. – Journal of Educational Measurement, 2019
Many large-scale assessments are designed to yield two or more scores for an individual by administering multiple sections measuring different but related skills. Multidimensional tests, or more specifically, simple structured tests, such as these rely on multiple multiple-choice and/or constructed responses sections of items to generate multiple…
Descriptors: Tests, Scoring, Responses, Test Items
Starling, A. Leyf Peirce; Lo, Ya-Yu; Rivera, Christopher J. – Journal of the American Academy of Special Education Professionals, 2015
This study evaluated the differential effects of three different science teaching methods, namely engineering teaching kit (ETK), explicit instruction (EI), and a combination of the two methods (ETK+EI), in two sixth-grade science classrooms. Twelve students with learning disabilities (LD) and/or attention deficit hyperactivity disorder (ADHD)…
Descriptors: Science Education, Middle School Students, Problem Solving, Tests
Felderman, Theresa A. – Journal of College Teaching & Learning, 2014
Interteaching has shown to be an effective alternative to traditional lecture in a number of studies, but thorough analyses of its components, including frequent exams, is limited. Research suggests that increasing the frequency of exams may improve student learning. This study assessed the effectiveness of interteaching's frequent exams component…
Descriptors: Community Colleges, Tests, Lecture Method, Academic Achievement
Holm, Inger; Tveter, Anne Therese; Aulie, Vibeke Smith; Stuge, Britt – Research in Developmental Disabilities: A Multidisciplinary Journal, 2013
The aim of the present study was to evaluate the intra- and inter-tester reliability of the movement assessment battery for children-second edition (MABC-2), ageband 2. We wanted to analyze the collected data, with adequate statistical methods, to provide relevant recommendations for physical therapists who are interpreting changes in the context…
Descriptors: Physical Therapy, Correlation, Scores, Error of Measurement
Hermans, Heidi; van der Pas, Femke H.; Evenhuis, Heleen M. – Research in Developmental Disabilities: A Multidisciplinary Journal, 2011
Background: In the last decades several instruments measuring anxiety in adults with intellectual disabilities have been developed. Aim: To give an overview of the characteristics and psychometric properties of self-report and informant-report instruments measuring anxiety in this group. Method: Systematic review of the literature. Results:…
Descriptors: Mental Retardation, Learning Disabilities, Interrater Reliability, Measures (Individuals)
Yocum, Allison; McCoy, Sarah Westcott; Bjornson, Kristie F.; Mullens, Pamela; Burton, Gay Naganuma – Physical & Occupational Therapy in Pediatrics, 2010
A standardized protocol for a pediatric heel-rise test was developed and reliability and validity are reported. Fifty-seven children developing typically (CDT) and 34 children with plantar flexion weakness performed three tests: unilateral heel rise, vertical jump, and force measurement using handheld dynamometry. Intraclass correlation…
Descriptors: Test Validity, Test Reliability, Interrater Reliability, Psychomotor Skills
Gustafsson, Jan-Eric; Erickson, Gudrun – Educational Assessment, Evaluation and Accountability, 2013
In the Swedish educational system, teachers have the dual responsibility of assigning final grades and marking their own students' national tests. The Government has mandated the Swedish Schools Inspectorate to remark samples of the national tests to see if teacher marking can be trusted. Reports from this project have concluded that intermarker…
Descriptors: Logical Thinking, Student Evaluation, Inferences, Trust (Psychology)
Zayac, Ryan M.; Paulk, Amber L. – Journal of the Scholarship of Teaching and Learning, 2014
Although previous research has found interteaching to be an effective form of instruction, all of the currently published data have been collected in courses that have allowed for a minimum of 48 hours between class sessions. In the current study, we examined the effectiveness of interteaching compared to traditional lecture during a six week…
Descriptors: School Schedules, Behavior Modification, Teaching Methods, Course Descriptions
Pearl, Amanda M.; Murray, Michael J.; Smith, Laura A.; Arnold, Mariah – Autism: The International Journal of Research and Practice, 2013
There is a paucity of instruments designed to measure social competence of adolescents with autism spectrum disorders. The Social Responsiveness Scale is one of a few that can be used. This study compared differences between mother and father reports of social competence of adolescents. Data were collected from parents of 50 adolescents with and…
Descriptors: Social Behavior, Measurement Techniques, Adolescents, Tests
Kim, Minsung; Bednarz, Robert – Journal of Geography in Higher Education, 2013
This study developed an interview-based critical spatial thinking oral test and used the test to investigate the effects of Geographic Information System (GIS) learning on three components of critical spatial thinking: evaluating data reliability, exercising spatial reasoning, and assessing problem-solving validity. Thirty-two students at a large…
Descriptors: Spatial Ability, Geographic Information Systems, Critical Thinking, Pretests Posttests
Suto, Irenka; Nadas, Rita; Bell, John – Research Papers in Education, 2011
Accurate marking is crucial to the reliability and validity of public examinations, in England and internationally. Factors contributing to accuracy have been conceptualised as affecting either marking task demands or markers' personal expertise. The aim of this empirical study was to develop this conceptualisation through investigating the…
Descriptors: Academic Achievement, Examiners, Biology, Foreign Countries
Previous Page | Next Page »
Pages: 1 | 2