Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 20 |
Since 2016 (last 10 years) | 66 |
Since 2006 (last 20 years) | 167 |
Descriptor
Tests | 214 |
Test Reliability | 116 |
Reliability | 83 |
Test Validity | 77 |
Foreign Countries | 66 |
Scores | 58 |
Correlation | 42 |
Psychometrics | 35 |
Validity | 30 |
Test Construction | 29 |
Comparative Analysis | 26 |
More ▼ |
Source
Author
Atilgan, Hakan | 2 |
Bednarz, Robert | 2 |
Charlin, Bernard | 2 |
DeMars, Christine E. | 2 |
Evenhuis, Heleen M. | 2 |
Gagnon, Robert | 2 |
Göçer, Ali | 2 |
Kolen, Michael J. | 2 |
Sawilowsky, Shlomo S. | 2 |
Suto, Irenka | 2 |
Walstad, William B. | 2 |
More ▼ |
Publication Type
Education Level
Location
Turkey | 11 |
United Kingdom | 5 |
United States | 5 |
Canada | 4 |
United Kingdom (England) | 4 |
Australia | 3 |
Germany | 3 |
Jordan | 3 |
Netherlands | 3 |
Taiwan | 3 |
California | 2 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Sella-Weiss, Oshrat – International Journal of Language & Communication Disorders, 2023
Background: Quantitative measures can increase precision in describing swallowing function, improve interrater and test-retest reliability, and advance clinical decision-making. The Test of Mastication and Swallowing Solids (TOMASS) and the Timed Water Swallow Test (TWST) are functional tests for swallowing that provide quantitative results. Aims:…
Descriptors: Human Body, Motor Reactions, Tests, Test Reliability
Cheung, Kason Ka Ching; Tai, Kevin W. H. – Research in Science & Technological Education, 2023
Background: Intercoder reliability is a statistic commonly reported by researchers to demonstrate the rigour of coding procedures during data analysis. Its importance is debatable in the analysis of qualitative interview data. It raises a question on whether researchers should identify the same codes and themes in a transcript or they should…
Descriptors: Interrater Reliability, Data Analysis, Interviews, Research Methodology
Atilgan, Hakan; Demir, Elif Kübra; Ogretmen, Tuncay; Basokcu, Tahsin Oguz – International Journal of Progressive Education, 2020
It has become a critical question what the reliability level would be when open-ended questions are used in large-scale selection tests. One of the aims of the present study is to determine what the reliability would be in the event that the answers given by test-takers are scored by experts when open-ended short answer questions are used in…
Descriptors: Foreign Countries, Secondary School Students, Test Items, Test Reliability
Kroc, Edward; Olvera Astivia, Oscar L. – Educational and Psychological Measurement, 2022
Setting cutoff scores is one of the most common practices when using scales to aid in classification purposes. This process is usually done univariately where each optimal cutoff value is decided sequentially, subscale by subscale. While it is widely known that this process necessarily reduces the probability of "passing" such a test,…
Descriptors: Multivariate Analysis, Cutting Scores, Classification, Measurement
Yücel Makaraci; Kazim Nas; Kerem Gündüz; Abdullah Uysal; Samuel T. Orange; Juan D. Ruiz-Cárdenas – Measurement in Physical Education and Exercise Science, 2024
The aim was to determine the validity and test-retest reliability of the Sit to Stand App variables (rising time, vertical velocity, and power) for measuring single-leg sit-to-stand (STS) test compared to those derived from ground reaction force data. Twenty-seven female athletes performed the single-leg STS test over three consecutive sessions…
Descriptors: Computer Simulation, Measurement Techniques, Athletics, Physical Fitness
Tugra Karademir Coskun; Ayfer Alper – Digital Education Review, 2024
This study aims to examine the potential differences between teacher evaluations and artificial intelligence (AI) tool-based assessment systems in university examinations. The research has evaluated a wide spectrum of exams including numerical and verbal course exams, exams with different assessment styles (project, test exam, traditional exam),…
Descriptors: Artificial Intelligence, Visual Aids, Video Technology, Tests
Fergadiotis, Gerasimos; Casilio, Marianne; Dickey, Michael Walsh; Steel, Stacey; Nicholson, Hannele; Fleegle, Mikala; Swiderski, Alexander; Hula, William D. – Journal of Speech, Language, and Hearing Research, 2023
Purpose: Item response theory (IRT) is a modern psychometric framework with several advantageous properties as compared with classical test theory. IRT has been successfully used to model performance on anomia tests in individuals with aphasia; however, all efforts to date have focused on noun production accuracy. The purpose of this study is to…
Descriptors: Item Response Theory, Psychometrics, Verbs, Naming
Rushton, Nicky; Vitello, Sylvia; Suto, Irenka – Research Matters, 2021
It is important to define what an error in a question paper is so that there is a common understanding and to avoid people's own conceptions impacting upon the way in which they write or check question papers. We carried out an interview study to investigate our colleagues' definitions of error. We found that there is no single accepted definition…
Descriptors: Definitions, Tests, Foreign Countries, Problems
Wesolowski, Brian C. – Music Educators Journal, 2020
Validity, reliability, and fairness are three prominent indicators for evaluating the quality of assessment processes. Each of the indicators is most often written about and applied in the context of large-scale assessment. As a result, the technical properties of these indicators make them limited in both their practicality and relevance for…
Descriptors: Music Education, Test Validity, Test Reliability, Student Evaluation
Wesolowski, Brian C.; Wind, Stefanie A. – Journal of Educational Measurement, 2019
Rater-mediated assessments are a common methodology for measuring persons, investigating rater behavior, and/or defining latent constructs. The purpose of this article is to provide a pedagogical framework for examining rater variability in the context of rater-mediated assessments using three distinct models. The first model is the observation…
Descriptors: Interrater Reliability, Models, Observation, Measurement
Miranda, Constanza; Goñi, Julian; Pickenpack, Astrid; Sotomayor, Trinidad – International Journal of Technology and Design Education, 2022
K-12 Engineering Education has placed a lot of attention on students' attitudes or predispositions towards science and technology. However, most assessment methods are focused on STEM as a whole or only on technology. In this article, we will discuss the instrument called Technology and Engineering Attitude Scale (TEAS) which focuses on attitudes…
Descriptors: Elementary Secondary Education, Engineering Education, Test Validity, Foreign Countries
Maxwell, Bruce; Boon, Helen; Tanchuk, Nicolas; Rauwerda, Bryan – Journal of Moral Education, 2021
This article documents the adaptation, piloting and validation of a measure of teachers' ethical sensitivity. To create the test, we modified a measure from dentistry drawing on literature in teacher professional ethics and drew on the expertise of professional ethics scholars and practitioners. Based on the results of Rasch analysis combined with…
Descriptors: Ethics, Moral Values, Scores, Teacher Education Programs
Koçak, Duygu – International Journal of Progressive Education, 2020
The aim of this study was to determine the effect of chance success on test equalization. For this purpose, artificially generated 500 and 1000 sample size data sets were synchronized using linear equalization and equal percentage equalization methods. In the data which were produced as a simulative, a total of four cases were created with no…
Descriptors: Test Theory, Equated Scores, Error of Measurement, Sample Size
Liu, Xiaolu; Keating, Xiaofen D. – European Physical Education Review, 2021
Pre-service physical education teachers (PPETs) may be implementing health-related fitness testing (HRFT) in schools in the future. Thus, exploring their attitudes toward HRFT would help us understand physical education (PE) teachers' attitudes toward HRFT. This study investigated PPET attitudes toward HRFT in the USA and the effects of teacher…
Descriptors: Preservice Teachers, Physical Education Teachers, Student Attitudes, Physical Fitness
Alkis Küçükaydin, Mensure; Akkanat, Çigdem – Problems of Education in the 21st Century, 2022
Computational thinking is recognized as a vital skill related to problem-solving in technological and non-technological fields. The existence of different sub-domains related to this skill has been pointed out. Therefore, there is a need for tools that measure these different sub-domains. Because of its structure that includes different skills,…
Descriptors: Elementary School Students, Thinking Skills, Computation, Tests