NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 631 to 645 of 26,352 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Tugra Karademir Coskun; Ayfer Alper – Digital Education Review, 2024
This study aims to examine the potential differences between teacher evaluations and artificial intelligence (AI) tool-based assessment systems in university examinations. The research has evaluated a wide spectrum of exams including numerical and verbal course exams, exams with different assessment styles (project, test exam, traditional exam),…
Descriptors: Artificial Intelligence, Visual Aids, Video Technology, Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Alper Börekci; Esra Dalkiran; Zeki Nacakci – International Journal of Music Education, 2024
The Music Performance Self-Efficacy Scale (MPSES) is an important scale designed to reflect the four sources of self-efficacy of Bandura by Zelenak, and has been used in many studies of music education in the international literature in recent years. This study was carried out to ensure the validity and reliability of the Turkish translation of…
Descriptors: Translation, Test Validity, Test Reliability, Music
Peer reviewed Peer reviewed
Direct linkDirect link
Reeta Neittaanmäki; Iasonas Lamprianou – Language Testing, 2024
This article focuses on rater severity and consistency and their relation to different types of rater experience over a long period of time. The article is based on longitudinal data collected from 2009 to 2019 from the second language Finnish speaking subtest in the National Certificates of Language Proficiency in Finland. The study investigated…
Descriptors: Foreign Countries, Interrater Reliability, Error of Measurement, Experience
Lauren Westerberg – ProQuest LLC, 2024
A major challenge to promoting effective early science and engineering education is the lack of reliable and validated assessments that align with current educational guidelines for science and engineering. Existing early science and engineering assessments either cover a narrow range of concepts and practices and/or are not designed in a way to…
Descriptors: Preschool Curriculum, Preschool Education, Preschool Evaluation, Preschool Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Ashani Jayasekera; Laura Stapleton – Society for Research on Educational Effectiveness, 2024
Background: A growing number of surveys are conducted online where respondents can choose to complete the questionnaire (Lehdonvirta et al., 2020). As respondents are self-selected, there is potential that the respondents will not be an accurate representation of the population. For example, white people are disproportionately more likely to…
Descriptors: Online Surveys, Test Construction, Test Validity, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Rebernik, Teja; Jacobi, Jidde; Tiede, Mark; Wieling, Martijn – Journal of Speech, Language, and Hearing Research, 2021
Purpose: This study compares two electromagnetic articulographs manufactured by Northern Digital, Inc.: the NDI Wave System (from 2008) and the NDI Vox-EMA System (from 2020). Method: Four experiments were completed: (1) comparison of statically positioned sensors; (2) tracking dynamic movements of sensors manipulated using a motor-driven LEGO…
Descriptors: Measurement Equipment, Articulation (Speech), Accuracy, Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Seeber, Marco; Vlegels, Jef; Reimink, Elwin; Marusic, Ana; Pina, David G. – Research Evaluation, 2021
We have limited understanding of why reviewers tend to strongly disagree when scoring the same research proposal. Thus far, research that explored disagreement has focused on the characteristics of the proposal or the applicants, while ignoring the characteristics of the reviewers themselves. This article aims to address this gap by exploring…
Descriptors: Foreign Countries, Evaluators, Interrater Reliability, Research Proposals
Peer reviewed Peer reviewed
Direct linkDirect link
Hogan, Thomas; DeStefano, Marissa; Gilby, Caitlin; Kosman, Dana; Peri, Joshua – Applied Measurement in Education, 2021
Buros' "Mental Measurements Yearbook (MMY)" has provided professional reviews of commercially published psychological and educational tests for over 80 years. It serves as a kind of conscience for the testing industry. For a random sample of 50 entries in the "19th MMY" (a total of 100 separate reviews) this study determined…
Descriptors: Test Reviews, Interrater Reliability, Psychological Testing, Educational Testing
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kazimi, Parviz Firudin Oqlu – Journal of Practical Studies in Education, 2021
The reliability of information in the global information space is one of the most important problems of globalization. The credibility of various information resources is currently being studied and considered in different ways. In some cases, the problem of the reliability of information can be assessed as harmful and dangerous. This article,…
Descriptors: Information Sources, Reliability, Credibility, Classification
Lambert, Richard G.; Holcomb, T. Scott; Bottoms, Bryndle L. – Center for Educational Measurement and Evaluation, 2021
The validity of the Kappa coefficient of chance-corrected agreement has been questioned when the prevalence of specific rating scale categories is low and agreement between raters is high. The researchers proposed the Lambda Coefficient of Rater-Mediated Agreement as an alternative to Kappa to address these concerns. Lambda corrects for chance…
Descriptors: Interrater Reliability, Teacher Evaluation, Test Validity, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Sondergeld, Toni A.; Johnson, Carla C. – School Science and Mathematics, 2019
In response to the call for more rigorously validated educational assessments, this study used an iterative multimethod validation process to develop and validate outcomes from the 21st Century Skills Assessment global rating scale. Qualitative and quantitative data sources were used to inform four types of validity evidence: content, response…
Descriptors: 21st Century Skills, Test Construction, Test Validity, Educational Assessment
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Gersten, Russell; Jayanthi, Madhavi; Newman-Gonchar, Rebecca; Anderson, Daniel; Spallone, Samantha; Taylor, Mary Jo – Regional Educational Laboratory Southeast, 2020
Several school districts in Georgia use two teacher-administered diagnostic assessments of student knowledge of mathematics as part of their multi-tiered system of support in grades K-8: the Global Strategy Stage (GloSS; New Zealand Ministry of Education, 2012) and the Individual Knowledge Assessment of Number (IKAN; New Zealand Ministry of…
Descriptors: Mathematics Tests, Diagnostic Tests, Test Reliability, Test Validity
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Regional Educational Laboratory Southeast, 2020
This document are the appendixes for the report, "The Reliability and Consequential Validity of Two Teacher-Administered Student Mathematics Diagnostic Assessments." Rather than relying on occasional testimonials from the field, decisions about using diagnostic assessments across the state should be based on psychometric data from an…
Descriptors: Mathematics Tests, Diagnostic Tests, Test Reliability, Test Validity
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Regional Educational Laboratory Southeast, 2020
Teachers need to assess their students' current level of mathematical understanding to provide appropriate interventions for students who are struggling. Several school districts in Georgia currently use two assessments for this purpose--the Global Strategy Stage (GloSS) and the Individual Knowledge Assessment of Number (IKAN). The IKAN is…
Descriptors: Mathematics Tests, Diagnostic Tests, Test Reliability, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Raymond, Mark R.; Jiang, Zhehan – Educational and Psychological Measurement, 2020
Conventional methods for evaluating the utility of subscores rely on traditional indices of reliability and on correlations among subscores. One limitation of correlational methods is that they do not explicitly consider variation in subtest means. An exception is an index of score profile reliability designated as [G], which quantifies the ratio…
Descriptors: Generalizability Theory, Multivariate Analysis, Scores, Reliability
Pages: 1  |  ...  |  39  |  40  |  41  |  42  |  43  |  44  |  45  |  46  |  47  |  ...  |  1757