Publication Date
In 2025 | 3 |
Since 2024 | 29 |
Descriptor
Reliability | 19 |
Test Reliability | 9 |
Evaluation Methods | 8 |
Test Validity | 6 |
Artificial Intelligence | 5 |
Foreign Countries | 5 |
Standards | 5 |
Validity | 5 |
Computer Assisted Testing | 4 |
Error of Measurement | 4 |
Ethics | 4 |
More ▼ |
Source
Author
Amit Sevak | 2 |
Daniel Fishtein | 2 |
Ikkyu Choi | 2 |
Jesse Sparks | 2 |
Teresa Ober | 2 |
Aditya Shah | 1 |
Adriana Lis | 1 |
Ajay Devmane | 1 |
Alexia E Metz | 1 |
Alexis Thomas | 1 |
Ali Lucas Winterburn | 1 |
More ▼ |
Publication Type
Reports - Evaluative | 29 |
Journal Articles | 26 |
Information Analyses | 1 |
Numerical/Quantitative Data | 1 |
Speeches/Meeting Papers | 1 |
Education Level
Higher Education | 7 |
Postsecondary Education | 7 |
Early Childhood Education | 2 |
Elementary Education | 2 |
Adult Education | 1 |
Elementary Secondary Education | 1 |
Secondary Education | 1 |
Audience
Policymakers | 1 |
Teachers | 1 |
Location
Australia | 1 |
District of Columbia | 1 |
Italy | 1 |
Massachusetts | 1 |
Ohio | 1 |
Singapore | 1 |
South Africa | 1 |
Tennessee | 1 |
Texas | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Massachusetts Comprehensive… | 1 |
Program for International… | 1 |
Trends in International… | 1 |
What Works Clearinghouse Rating
John R. Donoghue; Carol Eckerly – Applied Measurement in Education, 2024
Trend scoring constructed response items (i.e. rescoring Time A responses at Time B) gives rise to two-way data that follow a product multinomial distribution rather than the multinomial distribution that is usually assumed. Recent work has shown that the difference in sampling model can have profound negative effects on statistics usually used to…
Descriptors: Scoring, Error of Measurement, Reliability, Scoring Rubrics
Wiebe Koopal – Studies in Philosophy and Education, 2024
In this paper I try to 'rethink' consistency as an educational quality for the 3rd millennium, following Italo Calvino's choice to take it up in his lecture series Memos for the Next Millennium, and despite the fact that the (final) lecture devoted to this quality remained unwritten. After reflecting on how consistency already plays a certain role…
Descriptors: Reliability, Education, Instruction, Lecture Method
Bennett L. Schwartz – Metacognition and Learning, 2024
Retrospective confidence refers to the phenomenological experience of the level of certainty that retrieved information is, in fact, correct. Retrospective confidence judgments are examined across a range of sub-disciplines in psychology from perception to memory research, and in education and legal applications. This paper focuses on…
Descriptors: Memory, Recall (Psychology), Cues, Learning Processes
Atli Harðarson – Educational Philosophy and Theory, 2024
This paper has two aims. One is to draw a distinction between two types of trust. The other is to argue for its applicability in academic discourse on educational policies. One of the two types of trust is "ethical trust" that rests on beliefs about others' ethical virtues. The other is "institutional trust" that typically…
Descriptors: Trust (Psychology), Ethics, Reliability, Schools
Courtney Bell; Jessalynn James; Eric S. Taylor; James Wyckoff – Journal of Policy Analysis and Management, 2025
We study the returns to experience in teaching, estimated using supervisor ratings from classroom observations. We describe the assumptions required to interpret changes in observation ratings over time as the causal effect of experience on performance. We compare two difference-in-differences strategies: the two-way fixed effects estimator common…
Descriptors: Lesson Observation Criteria, Teaching Experience, Teacher Evaluation, Supervisors
Robert H. Eaglen; Steven J. Durning; Holly S. Meyer; Christopher S. Candler – Quality in Higher Education, 2024
Higher education accreditation has spread internationally as a vehicle for quality assurance and improvement but is strongly influenced by accreditation practices in the United States. The organisational structure and processes of seven United States health professions accreditors were analysed to identify common characteristics that reflect…
Descriptors: Accreditation (Institutions), Quality Assurance, Evaluators, Evaluation Methods
Aditya Shah; Ajay Devmane; Mehul Ranka; Prathamesh Churi – Education and Information Technologies, 2024
Online learning has grown due to the advancement of technology and flexibility. Online examinations measure students' knowledge and skills. Traditional question papers include inconsistent difficulty levels, arbitrary question allocations, and poor grading. The suggested model calibrates question paper difficulty based on student performance to…
Descriptors: Computer Assisted Testing, Difficulty Level, Grading, Test Construction
Yuan Tian; Xi Yang; Suhail A. Doi; Luis Furuya-Kanamori; Lifeng Lin; Joey S. W. Kwong; Chang Xu – Research Synthesis Methods, 2024
RobotReviewer is a tool for automatically assessing the risk of bias in randomized controlled trials, but there is limited evidence of its reliability. We evaluated the agreement between RobotReviewer and humans regarding the risk of bias assessment based on 1955 randomized controlled trials. The risk of bias in these trials was assessed via two…
Descriptors: Risk, Randomized Controlled Trials, Classification, Robotics
Tenko Raykov – Educational and Psychological Measurement, 2024
This note is concerned with the benefits that can result from the use of the maximal reliability and optimal linear combination concepts in educational and psychological research. Within the widely used framework of unidimensional multi-component measuring instruments, it is demonstrated that the linear combination of their components that…
Descriptors: Educational Research, Behavioral Science Research, Reliability, Error of Measurement
Maria Mirandi; Adriana Lis; Claudia Mazzeschi; Elisa Delvecchio – European Journal of Developmental Psychology, 2024
Perceived parenting is a crucial and complex factor for the psychological well-being of adolescents. The Adolescent Family Process Short-Form (AFP-SF) investigates the perception of adolescents' maternal and paternal parenting across six dimensions: closeness, support, communication, monitoring, peer approval, and conflict. This was the first…
Descriptors: Foreign Countries, Adolescents, Parent Child Relationship, Adolescent Attitudes
Mansi Wadhwa; Jingwen Zheng; Thomas D. Cook – Review of Educational Research, 2024
Clearinghouses set standards of scientific quality to vet existing research to determine how "evidence-based" an intervention is. This paper examines 12 educational clearinghouses to describe their effectiveness criteria, to estimate how consistently[underlined] they rate the same program, and to probe why their judgments differ. All the…
Descriptors: Clearinghouses, Standards, Evaluation Criteria, Reliability
Steven A. Stolz; Ali Lucas Winterburn; Edward Palmer – Educational Philosophy and Theory, 2024
The recent proliferation of Large Language Models (LLMs) raises questions as to the role of such tools both within an educational learning environment and their epistemic capacity. If, as Alfred North Whitehead remarked, western philosophy indeed 'consists of a series of footnotes to Plato', it would be of doubtless importance to evaluate the…
Descriptors: Artificial Intelligence, Technology Uses in Education, Natural Language Processing, Philosophy
Lisa Frances; Frances Quinn; Sue Elliott; Jo Bird – Australian Educational Researcher, 2024
In this article, we explore inconsistencies in the implementation of outdoor learning across Australian early years' education. The benefits of outdoor learning justify regular employment of this pedagogical approach in both early childhood education and primary school settings. Early childhood education services provide daily outdoor learning…
Descriptors: Foreign Countries, Outdoor Education, Program Implementation, Elementary Education
Marsela Thanasi-Boçe; Julian Hoxha – Education and Information Technologies, 2024
Entrepreneurship education has evolved to meet the demands of a dynamic business environment, necessitating innovative teaching methods to prepare entrepreneurs for market uncertainties. Large Language Models (LLMs) like the Generative Pre-trained Transformer 4 (GPT-4), recognized for their exceptional performance on public datasets, are examined…
Descriptors: Entrepreneurship, Business Administration Education, Technology Integration, Artificial Intelligence
Stephen Downes – International Association for Development of the Information Society, 2024
This paper reports on literature related to the assessment of barriers to educational technology assessment. It surveys the development of technology acceptance models from social cognitive theory and innovation diffusion theory through to a unified theory that considers performance expectancy, effort expectancy, and social influence. Because risk…
Descriptors: Barriers, Educational Technology, Attitudes, Risk Assessment
Previous Page | Next Page »
Pages: 1 | 2