ERIC - Search Results

Publication Date

In 2025	2
Since 2024	33

Descriptor

Comparative Analysis	33
Reliability	25
Foreign Countries	10
Computer Software	9
Scores	9
Artificial Intelligence	8
Evaluators	7
Correlation	6
English (Second Language)	6
Second Language Learning	6
Teaching Methods	6
Test Reliability	6
Undergraduate Students	6
Validity	6
Computational Linguistics	5
Evaluation Methods	5
Psychometrics	5
Second Language Instruction	5
Writing Evaluation	5
Academic Achievement	4
Accuracy	4
Elementary School Students	4
Essays	4
Instructional Effectiveness	4
Rating Scales	4
More ▼

Publication Type

Journal Articles	31
Reports - Research	28
Information Analyses	3
Reports - Descriptive	2
Reports - Evaluative	2
Tests/Questionnaires	1

Education Level

Higher Education	10
Postsecondary Education	10
Elementary Education	5
Secondary Education	5
Middle Schools	4
Junior High Schools	3
Early Childhood Education	2
Grade 6	1
Intermediate Grades	1
Preschool Education	1

Audience

Location

China	2
Spain	2
Africa	1
Australia	1
Ethiopia	1
Germany	1
Ohio (Cincinnati)	1
Singapore	1
United Kingdom (England)	1

Laws, Policies, & Programs

Assessments and Surveys

International English…	1
Self Description Questionnaire	1

What Works Clearinghouse Rating

Showing 1 to 15 of 33 results Save | Export

Comparative Judgement in Education Research

Peer reviewed

Direct link

Ian Jones; Ben Davies – International Journal of Research & Method in Education, 2024

Educational researchers often need to construct precise and reliable measurement scales of complex and varied representations such as participants' written work, videoed lesson segments and policy documents. Developing such scales using can be resource-intensive and time-consuming, and the outcomes are not always reliable. Here we present…

Descriptors: Educational Research, Comparative Analysis, Educational Researchers, Measurement

Reality or Illusion: Comparing Google Scholar and Scopus Data for Predatory Journals

Peer reviewed

Direct link

Manjula Wijewickrema – portal: Libraries and the Academy, 2024

This research compares the performance measures reported by two bibliographic databases relevant to a set of authors who have published in predatory journals. The reliability of decision-making based on the information provided by uncontrolled bibliographic databases is examined to support rational decisions. A sample of authors who published in…

Descriptors: Periodicals, Ethics, Deception, Authors

Accuracy and Reliability of Large Language Models in Assessing Learning Outcomes Achievement across Cognitive Domains

Peer reviewed

Direct link

Swapna Haresh Teckwani; Amanda Huee-Ping Wong; Nathasha Vihangi Luke; Ivan Cherh Chiet Low – Advances in Physiology Education, 2024

The advent of artificial intelligence (AI), particularly large language models (LLMs) like ChatGPT and Gemini, has significantly impacted the educational landscape, offering unique opportunities for learning and assessment. In the realm of written assessment grading, traditionally viewed as a laborious and subjective process, this study sought to…

Descriptors: Accuracy, Reliability, Computational Linguistics, Standards

Comparing Music Recordings Using Pairwise Comparative Judgement: Exploring the Judge Experience

Download full text

Lucy Chambers; Emma Walland; Jo Ireland – Research Matters, 2024

Comparative Judgement (CJ) is traditionally and primarily used to compare written texts. In this study we explored whether we could extend its use to comparing audio files. We used GCSE Music portfolios which contained a mix of audio recordings, musical scores and text documents. Fifteen judges completed two exercises: one comparing musical…

Descriptors: Evaluative Thinking, Judges, Comparative Analysis, Reliability

A Methodological Review of Listening Comprehension Tests for Primary School Children

Peer reviewed

Direct link

Kiri Mealings; Kelly Miles; Joerg M. Buchholz – International Journal of Listening, 2025

A child's ability to comprehend speech in the mainstream classroom is vital for intellectual and social development. However, listening conditions are often sub-optimal; the presence of multiple talkers, high noise levels, and long reverberation times add to the challenge of listening with a developing auditory system. An assessment that captures…

Descriptors: Elementary School Students, Listening Comprehension Tests, Comparative Analysis, Speech Communication

Moderation of Non-Exam Assessments: A Novel Approach Using Comparative Judgement

Peer reviewed

Direct link

Lucy Chambers; Sylvia Vitello; Carmen Vidal Rodeiro – Assessment in Education: Principles, Policy & Practice, 2024

In England, some secondary-level qualifications comprise non-exam assessments which need to undergo moderation before grading. Currently, moderation is conducted at centre (school) level. This raises challenges for maintaining the standard across centres. Recent technological advances enable novel moderation methods that are no longer bound by…

Descriptors: Foreign Countries, Evaluation Methods, Comparative Analysis, Grading

Psychometric Properties of the Metacognitive Awareness Inventory (MAI): Standardization to an International Spanish with 12 Countries

Peer reviewed

Direct link

Antonio P. Gutierrez de Blume; Diana Marcela Montoya Londoño; Virginia Jiménez Rodríguez; Olivia Morán Núñez; Ariel Cuadro; Lilián Daset; Mauricio Molina Delgado; Claudia García de la Cadena; María Beatríz Beltrán Navarro; Aníbal Puente Ferreras; Sebastián Urquijo; Walter Lizandro Arias – Metacognition and Learning, 2024

Metacognition is defined as a higher-order thinking skill that enables individuals to monitor, control, and regulate their thinking and behavior. In education, this skill is important, as learners need to self-regulate their learning behaviors for successful lifelong learning. Thus, it is essential for educators and learners alike to know their…

Descriptors: Metacognition, Measures (Individuals), Psychometrics, Standards

Towards the Automatic Risk of Bias Assessment on Randomized Controlled Trials: A Comparison of RobotReviewer and Humans

Peer reviewed

Direct link

Yuan Tian; Xi Yang; Suhail A. Doi; Luis Furuya-Kanamori; Lifeng Lin; Joey S. W. Kwong; Chang Xu – Research Synthesis Methods, 2024

RobotReviewer is a tool for automatically assessing the risk of bias in randomized controlled trials, but there is limited evidence of its reliability. We evaluated the agreement between RobotReviewer and humans regarding the risk of bias assessment based on 1955 randomized controlled trials. The risk of bias in these trials was assessed via two…

Descriptors: Risk, Randomized Controlled Trials, Classification, Robotics

Can Large Language Models Replace Humans in Systematic Reviews? Evaluating GPT-4's Efficacy in Screening and Extracting Data from Peer-Reviewed and Grey Literature in Multiple Languages

Peer reviewed

Direct link

Qusai Khraisha; Sophie Put; Johanna Kappenberg; Azza Warraitch; Kristin Hadfield – Research Synthesis Methods, 2024

Systematic reviews are vital for guiding practice, research and policy, although they are often slow and labour-intensive. Large language models (LLMs) could speed up and automate systematic reviews, but their performance in such tasks has yet to be comprehensively evaluated against humans, and no study has tested Generative Pre-Trained…

Descriptors: Peer Evaluation, Research Reports, Artificial Intelligence, Computer Software

Frequentist and Bayesian Factorial Invariance Using R

Peer reviewed
PDF on ERIC

Download full text

Teck Kiang Tan – Practical Assessment, Research & Evaluation, 2024

The procedures of carrying out factorial invariance to validate a construct were well developed to ensure the reliability of the construct that can be used across groups for comparison and analysis, yet mainly restricted to the frequentist approach. This motivates an update to incorporate the growing Bayesian approach for carrying out the Bayesian…

Descriptors: Bayesian Statistics, Factor Analysis, Programming Languages, Reliability

Coherence-Based Automatic Short Answer Scoring Using Sentence Embedding

Peer reviewed

Direct link

Dadi Ramesh; Suresh Kumar Sanampudi – European Journal of Education, 2024

Automatic essay scoring (AES) is an essential educational application in natural language processing. This automated process will alleviate the burden by increasing the reliability and consistency of the assessment. With the advances in text embedding libraries and neural network models, AES systems achieved good results in terms of accuracy.…

Descriptors: Scoring, Essays, Writing Evaluation, Memory

Benefits and Costs of Matching Prior to a Difference in Differences Analysis When Parallel Trends Does Not Hold

Peer reviewed

Direct link

Dae Woong Ham; Luke Miratrix – Grantee Submission, 2024

The consequence of a change in school leadership (e.g., principal turnover) on student achievement has important implications for education policy. The impact of such an event can be estimated via the popular Difference in Difference (DiD) estimator, where those schools with a turnover event are compared to a selected set of schools that did not…

Descriptors: Trend Analysis, Faculty Mobility, Academic Achievement, Principals

Students' Comparison Competencies in Geography: Results from an Explorative Assessment Study

Peer reviewed

Direct link

Marine Simon; Alexandra Budke – Journal of Geography in Higher Education, 2024

Comparison is an important geographic method and a common task in geography education. Mastering comparison is a complex competency and written comparisons are challenging tasks both for students and assessors. As yet, however, there is no set test for evaluating comparison competency nor tool for enhancing it. Moreover, little is known about…

Descriptors: Geography Instruction, Student Evaluation, Comparative Analysis, Reliability

Adaptation and Development of Parent Rating Scale for Giftedness

Peer reviewed

Direct link

Seyda Aydin-Karaca; Mustafa Serdar Köksal; Bilkay Bi – Journal of Psychoeducational Assessment, 2024

This study aimed to develop a parent rating scale (PRSG) for screening children for further identification process in terms of giftedness. The participants of the study were 255 parents of gifted and non-gifted students. The PRSG, consisting of 30 items, was created by consulting parents and reviewing instruments existent in the literature. As…

Descriptors: Rating Scales, Parent Attitudes, Scores, Comparative Analysis

Outdoor Learning across the Early Years in Australia: Inconsistencies, Challenges, and Recommendations

Peer reviewed

Direct link

Lisa Frances; Frances Quinn; Sue Elliott; Jo Bird – Australian Educational Researcher, 2024

In this article, we explore inconsistencies in the implementation of outdoor learning across Australian early years' education. The benefits of outdoor learning justify regular employment of this pedagogical approach in both early childhood education and primary school settings. Early childhood education services provide daily outdoor learning…

Descriptors: Foreign Countries, Outdoor Education, Program Implementation, Elementary Education

Previous Page | Next Page »

Pages: 1 | 2 | 3

Advances in Physiology…	2
Grantee Submission	2
Research Synthesis Methods	2
ACT, Inc.	1
Assessment in Education:…	1
Australian Educational…	1
British Journal of…	1
British Journal of…	1
Education and Information…	1
European Journal of Education	1
International Journal of…	1
International Journal of…	1
Journal for Multicultural…	1
Journal of Education and…	1
Journal of Educational and…	1
Journal of Geography in…	1
Journal of Psychoeducational…	1
Journal of Technology and…	1
Language Teaching Research…	1
Language, Speech, and Hearing…	1
Metacognition and Learning	1
Physical Review Physics…	1
Practical Assessment,…	1
Reading & Writing Quarterly	1
Research Matters	1
More ▼

Bridget Poznanski	2
Howard Abikoff	2
Jenelle Nissley-Tsiopinis	2
Laura Pendergast	2
Lucy Chambers	2
Shannon Ryan	2
Thomas J. Power	2
Alain Bengochea	1
Albert Sesé	1
Alberto Fernández-Costales	1
Alexandra Budke	1
Allan S. Cohen	1
Amanda Huee-Ping Wong	1
Amssalu Wondmagegn Getu	1
Andrew R. Thompson	1
Antonio P. Gutierrez de Blume	1
Aníbal Puente Ferreras	1
Ariel Cuadro	1
Azza Warraitch	1
Ben Davies	1
Bilkay Bi	1
Brian Weiler	1
Burkhard Priemer	1
Carmen Vidal Rodeiro	1
Chang Xu	1
More ▼