Publication Date
In 2025 | 2 |
Since 2024 | 33 |
Descriptor
Source
Author
Bridget Poznanski | 2 |
Howard Abikoff | 2 |
Jenelle Nissley-Tsiopinis | 2 |
Laura Pendergast | 2 |
Lucy Chambers | 2 |
Shannon Ryan | 2 |
Thomas J. Power | 2 |
Alain Bengochea | 1 |
Albert Sesé | 1 |
Alberto Fernández-Costales | 1 |
Alexandra Budke | 1 |
More ▼ |
Publication Type
Journal Articles | 31 |
Reports - Research | 28 |
Information Analyses | 3 |
Reports - Descriptive | 2 |
Reports - Evaluative | 2 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 10 |
Postsecondary Education | 10 |
Elementary Education | 5 |
Secondary Education | 5 |
Middle Schools | 4 |
Junior High Schools | 3 |
Early Childhood Education | 2 |
Grade 6 | 1 |
Intermediate Grades | 1 |
Preschool Education | 1 |
Audience
Location
China | 2 |
Spain | 2 |
Africa | 1 |
Australia | 1 |
Ethiopia | 1 |
Germany | 1 |
Ohio (Cincinnati) | 1 |
Singapore | 1 |
United Kingdom (England) | 1 |
Laws, Policies, & Programs
Assessments and Surveys
International English… | 1 |
Self Description Questionnaire | 1 |
What Works Clearinghouse Rating
Ian Jones; Ben Davies – International Journal of Research & Method in Education, 2024
Educational researchers often need to construct precise and reliable measurement scales of complex and varied representations such as participants' written work, videoed lesson segments and policy documents. Developing such scales using can be resource-intensive and time-consuming, and the outcomes are not always reliable. Here we present…
Descriptors: Educational Research, Comparative Analysis, Educational Researchers, Measurement
Manjula Wijewickrema – portal: Libraries and the Academy, 2024
This research compares the performance measures reported by two bibliographic databases relevant to a set of authors who have published in predatory journals. The reliability of decision-making based on the information provided by uncontrolled bibliographic databases is examined to support rational decisions. A sample of authors who published in…
Descriptors: Periodicals, Ethics, Deception, Authors
Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024
The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…
Descriptors: Accuracy, Reliability, Computational Linguistics, Standards
Lucy Chambers; Emma Walland; Jo Ireland – Research Matters, 2024
Comparative Judgement (CJ) is traditionally and primarily used to compare written texts. In this study we explored whether we could extend its use to comparing audio files. We used GCSE Music portfolios which contained a mix of audio recordings, musical scores and text documents. Fifteen judges completed two exercises: one comparing musical…
Descriptors: Evaluative Thinking, Judges, Comparative Analysis, Reliability
Kiri Mealings; Kelly Miles; Joerg M. Buchholz – International Journal of Listening, 2025
A child's ability to comprehend speech in the mainstream classroom is vital for intellectual and social development. However, listening conditions are often sub-optimal; the presence of multiple talkers, high noise levels, and long reverberation times add to the challenge of listening with a developing auditory system. An assessment that captures…
Descriptors: Elementary School Students, Listening Comprehension Tests, Comparative Analysis, Speech Communication
Lucy Chambers; Sylvia Vitello; Carmen Vidal Rodeiro – Assessment in Education: Principles, Policy & Practice, 2024
In England, some secondary-level qualifications comprise non-exam assessments which need to undergo moderation before grading. Currently, moderation is conducted at centre (school) level. This raises challenges for maintaining the standard across centres. Recent technological advances enable novel moderation methods that are no longer bound by…
Descriptors: Foreign Countries, Evaluation Methods, Comparative Analysis, Grading
Antonio P. Gutierrez de Blume; Diana Marcela Montoya Londoño; Virginia Jiménez Rodríguez; Olivia Morán Núñez; Ariel Cuadro; Lilián Daset; Mauricio Molina Delgado; Claudia García de la Cadena; María Beatríz Beltrán Navarro; Aníbal Puente Ferreras; Sebastián Urquijo; Walter Lizandro Arias – Metacognition and Learning, 2024
Metacognition is defined as a higher-order thinking skill that enables individuals to monitor, control, and regulate their thinking and behavior. In education, this skill is important, as learners need to self-regulate their learning behaviors for successful lifelong learning. Thus, it is essential for educators and learners alike to know their…
Descriptors: Metacognition, Measures (Individuals), Psychometrics, Standards
Yuan Tian; Xi Yang; Suhail A. Doi; Luis Furuya-Kanamori; Lifeng Lin; Joey S. W. Kwong; Chang Xu – Research Synthesis Methods, 2024
RobotReviewer is a tool for automatically assessing the risk of bias in randomized controlled trials, but there is limited evidence of its reliability. We evaluated the agreement between RobotReviewer and humans regarding the risk of bias assessment based on 1955 randomized controlled trials. The risk of bias in these trials was assessed via two…
Descriptors: Risk, Randomized Controlled Trials, Classification, Robotics
Qusai Khraisha; Sophie Put; Johanna Kappenberg; Azza Warraitch; Kristin Hadfield – Research Synthesis Methods, 2024
Systematic reviews are vital for guiding practice, research and policy, although they are often slow and labour-intensive. Large language models (LLMs) could speed up and automate systematic reviews, but their performance in such tasks has yet to be comprehensively evaluated against humans, and no study has tested Generative Pre-Trained…
Descriptors: Peer Evaluation, Research Reports, Artificial Intelligence, Computer Software
Teck Kiang Tan – Practical Assessment, Research & Evaluation, 2024
The procedures of carrying out factorial invariance to validate a construct were well developed to ensure the reliability of the construct that can be used across groups for comparison and analysis, yet mainly restricted to the frequentist approach. This motivates an update to incorporate the growing Bayesian approach for carrying out the Bayesian…
Descriptors: Bayesian Statistics, Factor Analysis, Programming Languages, Reliability
Dadi Ramesh; Suresh Kumar Sanampudi – European Journal of Education, 2024
Automatic essay scoring (AES) is an essential educational application in natural language processing. This automated process will alleviate the burden by increasing the reliability and consistency of the assessment. With the advances in text embedding libraries and neural network models, AES systems achieved good results in terms of accuracy.…
Descriptors: Scoring, Essays, Writing Evaluation, Memory
Dae Woong Ham; Luke Miratrix – Grantee Submission, 2024
The consequence of a change in school leadership (e.g., principal turnover) on student achievement has important implications for education policy. The impact of such an event can be estimated via the popular Difference in Difference (DiD) estimator, where those schools with a turnover event are compared to a selected set of schools that did not…
Descriptors: Trend Analysis, Faculty Mobility, Academic Achievement, Principals
Marine Simon; Alexandra Budke – Journal of Geography in Higher Education, 2024
Comparison is an important geographic method and a common task in geography education. Mastering comparison is a complex competency and written comparisons are challenging tasks both for students and assessors. As yet, however, there is no set test for evaluating comparison competency nor tool for enhancing it. Moreover, little is known about…
Descriptors: Geography Instruction, Student Evaluation, Comparative Analysis, Reliability
Seyda Aydin-Karaca; Mustafa Serdar Köksal; Bilkay Bi – Journal of Psychoeducational Assessment, 2024
This study aimed to develop a parent rating scale (PRSG) for screening children for further identification process in terms of giftedness. The participants of the study were 255 parents of gifted and non-gifted students. The PRSG, consisting of 30 items, was created by consulting parents and reviewing instruments existent in the literature. As…
Descriptors: Rating Scales, Parent Attitudes, Scores, Comparative Analysis
Lisa Frances; Frances Quinn; Sue Elliott; Jo Bird – Australian Educational Researcher, 2024
In this article, we explore inconsistencies in the implementation of outdoor learning across Australian early years' education. The benefits of outdoor learning justify regular employment of this pedagogical approach in both early childhood education and primary school settings. Early childhood education services provide daily outdoor learning…
Descriptors: Foreign Countries, Outdoor Education, Program Implementation, Elementary Education