Publication Date
In 2025 | 67 |
Since 2024 | 901 |
Since 2021 (last 5 years) | 3415 |
Since 2016 (last 10 years) | 7595 |
Since 2006 (last 20 years) | 14761 |
Descriptor
Test Reliability | 14547 |
Test Validity | 9865 |
Reliability | 9544 |
Foreign Countries | 6751 |
Test Construction | 4608 |
Validity | 4120 |
Measures (Individuals) | 3750 |
Factor Analysis | 3720 |
Psychometrics | 3393 |
Interrater Reliability | 3054 |
Correlation | 3009 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 703 |
Practitioners | 447 |
Teachers | 204 |
Administrators | 121 |
Policymakers | 62 |
Counselors | 42 |
Students | 37 |
Parents | 11 |
Community | 7 |
Media Staff | 5 |
Support Staff | 5 |
More ▼ |
Location
Turkey | 1246 |
Australia | 428 |
Canada | 371 |
China | 329 |
United States | 264 |
United Kingdom | 246 |
Taiwan | 221 |
Netherlands | 217 |
Indonesia | 214 |
California | 208 |
Spain | 201 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 8 |
Meets WWC Standards with or without Reservations | 9 |
Does not meet standards | 6 |
Pauline Frizelle; Ana Buckley; Tricia Biancone; Anna Ceroni; Darren Dahly; Paul Fletcher; Dorothy V. M. Bishop; Cristina McKean – Journal of Child Language, 2024
This study reports on the feasibility of using the Test of Complex Syntax- Electronic (TECS-E), as a self-directed app, to measure sentence comprehension in children aged 4 to 5 ½ years old; how testing apps might be adapted for effective independent use; and agreement levels between face-to-face supported computerized and independent computerized…
Descriptors: Language Processing, Computer Software, Language Tests, Syntax
Fumei Liu – Cogent Education, 2024
This paper details how to effectively share three-dimensional geological models using data conversion between two mainstream mining software, Micromine and Surpac. It also discusses the impact of this conversion method on geological integrated exploration decision-making guidance. The current situation primarily manifests in the fact that both…
Descriptors: Computer Software, Geology, Models, Decision Making
Trixy Elizabeth John; Benny Thomas; N. T. Sudhesh; Santhosh Kareepadath Rajan – Asia-Pacific Education Researcher, 2024
In this article, we report the development and psychometric validation of the Teachers' Receptivity to Change Scale (TRCS). The sample included secondary school teachers of Kerala, India. In India, the teachers' receptivity to change becomes important in the context of the newly drafted National Education Policy, (2020) which places teachers' at…
Descriptors: Foreign Countries, Secondary School Teachers, Teacher Response, Test Reliability
Sevil Cicek Ozdemir; Ayten Senturk Erenel – Health Education & Behavior, 2024
It is obvious that current tools in literature that are used to measure female's sexual quality of life focus only on the objective dimension of sexual function, failing to examine quality of life on a multidimensional level. The aim of this research is to examine the validity and reliability of the ADORE for Turkish society. In the methodological…
Descriptors: Turkish, Test Validity, Test Reliability, Females
Jeff Witmer – Journal of Statistics and Data Science Education, 2024
Data reported from memory can be unreliable. A simple activity lets students experience this firsthand.
Descriptors: Memory, Trust (Psychology), Reliability, Class Activities
Rajeshwari Panigrahi; Khaliq Lubza Nihar; Neha Singh – Higher Learning Research Communications, 2024
Objective: This study aimed to develop and test a scale for measuring the quality of blended learning models in higher education. Methods: This research adopts a sequential mixed-method approach to construct a new measurement scale. The first phase consisted of the inductive approach to identify the items, followed by exploratory factor analysis.…
Descriptors: Blended Learning, Educational Quality, Higher Education, Test Construction
Benjamin R. Shear; Derek C. Briggs – Asia Pacific Education Review, 2024
Research in the social and behavioral sciences relies on a wide range of experimental and quasi-experimental designs to estimate the causal effects of specific programs, policies, and events. In this paper we highlight measurement issues relevant to evaluating the validity of causal estimation and generalization. These issues impact all four…
Descriptors: Measurement Techniques, Inferences, COVID-19, Pandemics
Using Differential Item Functioning to Test for Interrater Reliability in Constructed Response Items
Walker, Cindy M.; Göçer Sahin, Sakine – Educational and Psychological Measurement, 2020
The purpose of this study was to investigate a new way of evaluating interrater reliability that can allow one to determine if two raters differ with respect to their rating on a polytomous rating scale or constructed response item. Specifically, differential item functioning (DIF) analyses were used to assess interrater reliability and compared…
Descriptors: Test Bias, Interrater Reliability, Responses, Correlation
Gunjawate, Dhanshree R.; Ravi, Rohit; Bhagavan, Srividya – Journal of Speech, Language, and Hearing Research, 2020
Purpose: The purpose of this study was to evaluate the reliability and validity of the Kannada version of the Consensus Auditory-Perceptual Evaluation of Voice (CAPE-V). Method: The Kannada version of CAPE-V comprises six phrases that are phonetically designed as per the CAPE-V requirements. Sixty-five (21 individuals with dysphonia and 44…
Descriptors: Test Reliability, Test Validity, Dravidian Languages, Voice Disorders
Thompson, W. Jake; Nash, Brooke; Clark, Amy K.; Hoover, Jeffrey C. – Journal of Educational Measurement, 2023
As diagnostic classification models become more widely used in large-scale operational assessments, we must give consideration to the methods for estimating and reporting reliability. Researchers must explore alternatives to traditional reliability methods that are consistent with the design, scoring, and reporting levels of diagnostic assessment…
Descriptors: Diagnostic Tests, Simulation, Test Reliability, Accuracy
Zigler, Christina K.; Lin, Li; McFatrich, Molly; Lucas, Nicole; Gordon, Kelly L.; Jones, Harrison N.; Berent, Allyson; Panagoulias, Jennifer; Evans, Paula; Reeve, Bryce B. – American Journal on Intellectual and Developmental Disabilities, 2023
There is a critical need for high-quality clinical outcome assessments to capture the important aspects of communication ability of individuals with Angelman syndrome (AS). To center the perspective of caregivers, our team developed the novel Observer-Reported Communication Ability (ORCA) measure using best practice guidelines, with the goal of…
Descriptors: Genetic Disorders, Test Validity, Observation, Communication Skills
Ates, Esin; Konal Korkmaz, Ebru; Temel, Ayla Baylk – Journal of School Health, 2023
Background: Appropriate diagnosis of sleep problems is crucial, given the importance of sleep in childhood development. The Sleep Self-Report Scale (SSRS) is used to assess children's sleep problems in the United States and Spain, and this study aimed to expand the usability of this instrument by evaluating its validity and reliability in Turkish…
Descriptors: Foreign Countries, Sleep, Child Health, Test Validity
Weingarden, Merav; Heyd-Metzuyanim, Einat – Journal of Mathematics Teacher Education, 2023
In this study, we examine "what went wrong" in our professional development program for encouraging cognitively demanding instruction, focusing on the difficulties we encountered in using an observational tool for evaluating this type of instruction and reaching inter-rater reliability. We do so through the lens of a discursive theory of…
Descriptors: Mathematics Instruction, Interrater Reliability, Cognitive Processes, Difficulty Level
Cheng, Yao-Chung – Asia-Pacific Education Researcher, 2023
A principals' school management imaginative capability is the cornerstone of visionary leadership. Based on the imagination theory, this study constructed a Principal's School Management Imaginative Capability Scale (PSMICS). Questionnaires were conducted through stratified random sampling. Thirteen hundred and two valid samples were obtained.…
Descriptors: Principals, Leadership, Imagination, Test Construction
Chamba-Eras, Luis; Arruarte, Ana; Elorriaga, Jon A. – IEEE Transactions on Learning Technologies, 2023
In the context of virtual learning communities (VLCs), where the participants may not know each other, it is necessary to have a mechanism to help when deciding who to work with and what reliable contents and information sources are. This study aims to design a generic trust model, named T-VLC, applicable to VLCs, which can be adapted to different…
Descriptors: Communities of Practice, Electronic Learning, Trust (Psychology), Models