Publication Date
In 2025 | 1 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 6 |
Since 2016 (last 10 years) | 30 |
Since 2006 (last 20 years) | 69 |
Descriptor
Data Analysis | 84 |
Interrater Reliability | 84 |
Foreign Countries | 20 |
Evaluation Methods | 18 |
Observation | 17 |
Scores | 17 |
Coding | 15 |
Correlation | 13 |
Data Collection | 13 |
Comparative Analysis | 12 |
Reliability | 12 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 6 |
Practitioners | 1 |
Location
Australia | 5 |
Netherlands | 4 |
Turkey | 4 |
Canada | 3 |
United Kingdom | 3 |
California | 2 |
China | 2 |
Connecticut | 2 |
Florida | 2 |
Greece | 2 |
India | 2 |
More ▼ |
Laws, Policies, & Programs
Temporary Assistance for… | 1 |
Assessments and Surveys
Iowa Tests of Basic Skills | 1 |
Motivation to Read Profile | 1 |
Program for International… | 1 |
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 1 |
Meets WWC Standards with or without Reservations | 1 |
John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024
Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…
Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics
Victoria Reyes; Elizabeth Bogumil; Levin Elias Welch – Sociological Methods & Research, 2024
Transparency is once again a central issue of debate across types of qualitative research. Work on how to conduct qualitative data analysis, on the other hand, walks us through the step-by-step process on how to code and understand the data we've collected. Although there are a few exceptions, less focus is on transparency regarding…
Descriptors: Qualitative Research, Data Analysis, Guides, Databases
Cheung, Kason Ka Ching; Tai, Kevin W. H. – Research in Science & Technological Education, 2023
Background: Intercoder reliability is a statistic commonly reported by researchers to demonstrate the rigour of coding procedures during data analysis. Its importance is debatable in the analysis of qualitative interview data. It raises a question on whether researchers should identify the same codes and themes in a transcript or they should…
Descriptors: Interrater Reliability, Data Analysis, Interviews, Research Methodology
Dart, Evan H.; Radley, Keith C. – Psychology in the Schools, 2023
Single-case design is a research methodology that entails repeated measurement to assess the influence of an independent variable on a dependent variable over time. Data collected in this manner are regularly analyzed using visual analysis of data displayed in a linear graph. Although there is agreement regarding critical elements of visual…
Descriptors: Research Design, Research Methodology, Data Collection, Data Analysis
Xiner Liu; Andres Felipe Zambrano; Ryan S. Baker; Amanda Barany; Jaclyn Ocumpaugh; Jiayi Zhang; Maciej Pankiewicz; Nidhi Nasiar; Zhanlan Wei – Journal of Learning Analytics, 2025
This study explores the potential of the large language model GPT-4 as an automated tool for qualitative data analysis by educational researchers, exploring which techniques are most successful for different types of constructs. Specifically, we assess three different prompt engineering strategies -- Zero-shot, Few-shot, and Fewshot with…
Descriptors: Coding, Artificial Intelligence, Automation, Data Analysis
Kelly Little; Yongyue Qi; Vanessa D. Jewell – Journal of Occupational Therapy Education, 2023
The Occupation-Centered Intervention Assessment (OCIA) was developed as a reflective tool for students to improve their comprehension of occupation-centered practice. Finding new and innovative ways to incorporate occupation-centered assignments can serve as a strategy to develop student integration of occupation-centered practice and allow…
Descriptors: Occupational Therapy, Allied Health Occupations Education, Interrater Reliability, Intervention
Huckabee, Maggie-Lee; McIntosh, Theresa; Fuller, Laura; Curry, Morgan; Thomas, Paige; Walshe, Margaret; McCague, Ellen; Battel, Irene; Nogueira, Dalia; Frank, Ulrike; van den Engel-Hoek, Lenie; Sella-Weiss, Oshrat – International Journal of Language & Communication Disorders, 2018
Background: Clinical swallowing assessment is largely limited to qualitative assessment of behavioural observations. There are limited quantitative data that can be compared with a healthy population for identification of impairment. The Test of Masticating and Swallowing Solids (TOMASS) was developed as a quantitative assessment of solid bolus…
Descriptors: Medical Evaluation, Clinical Diagnosis, Motor Reactions, Reliability
Uzun, N. Bilge; Aktas, Mehtap; Asiret, Semih; Yormaz, Seha – Asian Journal of Education and Training, 2018
The goal of this study is to determine the reliability of the performance points of dentistry students regarding communication skills and to examine the scoring reliability by generalizability theory in balanced random and fixed facet (mixed design) data, considering also the interactions of student, rater and duty. The study group of the research…
Descriptors: Foreign Countries, Generalizability Theory, Scores, Test Reliability
Cascio, M. Ariel; Lee, Eunlye; Vaudrin, Nicole; Freedman, Darcy A. – Field Methods, 2019
In this article, we discuss methodological opportunities related to using a team-based approach for iterative-inductive analysis of qualitative data involving detailed open coding of semistructured interviews and focus groups. Iterative-inductive methods generate rich thematic analyses useful in sociology, anthropology, public health, and many…
Descriptors: Coding, Teamwork, Interrater Reliability, Data Analysis
Lin, Chih-Kai – Language Testing, 2017
Sparse-rated data are common in operational performance-based language tests, as an inevitable result of assigning examinee responses to a fraction of available raters. The current study investigates the precision of two generalizability-theory methods (i.e., the rating method and the subdividing method) specifically designed to accommodate the…
Descriptors: Data Analysis, Language Tests, Generalizability Theory, Accuracy
Tengberg, Michael – Language Assessment Quarterly, 2018
Reading comprehension is often treated as a multidimensional construct. In many reading tests, items are distributed over reading process categories to represent the subskills expected to constitute comprehension. This study explores (a) the extent to which specified subskills of reading comprehension tests are conceptually conceivable to…
Descriptors: Reading Tests, Reading Comprehension, Scores, Test Results
McGee, Monnie – Journal of Statistics Education, 2019
In several sporting events, the winner is chosen on the basis of a subjective score. These sports include gymnastics, ice skating, and diving. Unlike for other subjectively judged sports, diving competitions consist of multiple rounds in quick succession on the same apparatus. These multiple rounds lead to an extra layer of complexity in the data,…
Descriptors: Data Use, Visualization, Interrater Reliability, Introductory Courses
Aragón, Sonia; Lapresa, Daniel; Arana, Javier; Anguera, M. Teresa; Garzón, Belén – Measurement in Physical Education and Exercise Science, 2017
Polar coordinate analysis is a powerful data reduction technique based on the Zsum statistic, which is calculated from adjusted residuals obtained by lag sequential analysis. Its use has been greatly simplified since the addition of a module in the free software program HOISAN for performing the necessary computations and producing…
Descriptors: Physical Activities, Track and Field, Data Analysis, Males
Temel, Gülhan Orekici; Erdogan, Semra; Selvi, Hüseyin; Kaya, Irem Ersöz – Educational Sciences: Theory and Practice, 2016
Studies based on longitudinal data focus on the change and development of the situation being investigated and allow for examining cases regarding education, individual development, cultural change, and socioeconomic improvement in time. However, as these studies require taking repeated measures in different time periods, they may include various…
Descriptors: Investigations, Sample Size, Longitudinal Studies, Interrater Reliability
He, Peng; Liu, Xiufeng; Zheng, Changlong; Jia, Mengying – Chemistry Education Research and Practice, 2016
This study intends to develop a standardized instrument for measuring classroom teaching and learning in secondary chemistry lessons. Based on previous studies and interviews with expert teachers, the progression of five quality levels was constructed hypothetically to represent the quality of chemistry lessons in Chinese secondary schools. The…
Descriptors: Foreign Countries, Secondary School Science, Science Instruction, Chemistry