Publication Date
In 2025 | 13 |
Since 2024 | 184 |
Since 2021 (last 5 years) | 672 |
Since 2016 (last 10 years) | 1414 |
Since 2006 (last 20 years) | 2654 |
Descriptor
Psychometrics | 3393 |
Test Reliability | 2321 |
Test Validity | 1780 |
Foreign Countries | 1200 |
Factor Analysis | 1101 |
Measures (Individuals) | 1076 |
Reliability | 958 |
Test Construction | 782 |
Factor Structure | 606 |
Validity | 594 |
Correlation | 523 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 58 |
Practitioners | 29 |
Counselors | 7 |
Students | 4 |
Teachers | 4 |
Administrators | 3 |
Community | 3 |
Parents | 1 |
Policymakers | 1 |
Location
Turkey | 191 |
China | 87 |
Spain | 66 |
Australia | 65 |
Canada | 59 |
United States | 54 |
Hong Kong | 46 |
Germany | 40 |
United Kingdom | 40 |
Taiwan | 35 |
Netherlands | 34 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Matthew K. Burns; Heba Z. Abdelnaby; Jonie B. Welland; Katherine A. Graves; Kari Kurto – Assessment for Effective Intervention, 2024
The current study examined the reliability of The Reading League Curriculum-Evaluation Guidelines (CEGs), which were developed to help school-based teams rate the presence of red flags when considering adopting specific literacy curricula. Coders (n = 30) independently used the CEGs to evaluate a free online English language arts curriculum. The…
Descriptors: English Curriculum, English Instruction, Language Arts, Curriculum Evaluation
Ichikowitz, Kerri; Bruce, Carolyn; Meitanis, Vanessa; Cheung, Kelly; Kim, Yekyung; Talbourdet, Esther; Newton, Caroline – International Journal of Language & Communication Disorders, 2023
Background: People with aphasia (PWA) can experience functional numeracy difficulties, that is, problems understanding or using numbers in everyday life, which can have numerous negative impacts on their daily lives. There is growing interest in designing functional numeracy interventions for PWA; however, there are limited suitable assessments…
Descriptors: Test Construction, Test Validity, Numeracy, Adults
Riana Nurhayati; Suranto Aw; Siti Irene Astuti Dwiningrum; Mami Hajaroh; Herwin Herwin – International Journal of Educational Methodology, 2024
Evaluation of child-friendly school (CFS) policies is essential to determine the achievements of school efforts in reducing violence cases. This research aims to proving the reliability and validity of CFS policy evaluation instruments in elementary schools with different locations. This investigation uses the Context Input Process Product (CIPP)…
Descriptors: Validity, Reliability, School Policy, Program Evaluation
Schmidt, Ellyn M.; Rothenberg, W. Andrew; Davidson, Bridget C.; Barnett, Miya; Jent, Jason; Cadenas, Heleny; Fernandez, Corina; Davis, Eileen – Journal of Behavioral Education, 2023
Measuring classroom behavior among young children is important to guide assessment and intervention decisions, yet there is limited literature on appropriate direct observation tools for this purpose. This article describes the psychometric properties of the Behavior Assessment System for Children, Student Observation System (BASC-3 SOS) with 135…
Descriptors: Young Children, Special Education, Child Behavior, Psychometrics
Kristen Bottema-Beutel; Shannon Crowley LaPoint; So Yoon Kim; Sarah Mohiuddin; Qun Yu; Rachael McKinnon – Exceptional Children, 2024
In this secondary analysis of a previously conducted systematic review, we analyze social validity assessments in intervention research for transition-age autistic youth. Social validity is concerned with the acceptability of the intervention goals, the acceptability and feasibility of the intervention procedures, and the perceived importance of…
Descriptors: Autism Spectrum Disorders, Intervention, Validity, Psychometrics
Van Elsen, Joris; Faddar, Jerich; Appels, Lies; De Maeyer, Sven; Vanhoof, Jan; Van Petegem, Peter – School Effectiveness and School Improvement, 2023
In order to support research on school effectiveness, there is a need for valid and reliable instruments to assess policymaking capacities of schools. Increasingly, policymaking is seen as a shared responsibility of the entire pedagogical team of a school. In this article, data were analysed from a sample of 1,696 (care) teachers coordinators and…
Descriptors: Educational Policy, Policy Formation, Questionnaires, School Effectiveness
Sermin Metin; Mehmet Basaran; Merve Yildirim Seheryeli; Emily Relkin; Damla Kalyenci – Journal of Science Education and Technology, 2024
In the early years, it has become essential to support the acquisition of computational thinking, which is seen as a 21st-century skill and new literacy. A valid and reliable measurement tool is needed to develop and evaluate educational practices related to these skills. "TechCheck" is a validated unplugged assessment of computational…
Descriptors: Computation, Thinking Skills, Test Validity, Test Reliability
Tavares, Walter; Kinnear, Benjamin; Schumacher, Daniel J.; Forte, Milena – Advances in Health Sciences Education, 2023
In this perspective, the authors critically examine "rater training" as it has been conceptualized and used in medical education. By "rater training," they mean the educational events intended to "improve" rater performance and contributions during assessment events. Historically, rater training programs have focused…
Descriptors: Medical Education, Interrater Reliability, Evaluation Methods, Training
Erin Johnson; Samantha Barstack; Yikai Xu; Hannah Wise; Bradley T. Erford; Catharina Chang; David Delmonico – Measurement and Evaluation in Counseling and Development, 2025
Problem Statement: Among individuals aged 12 years or older, 14.3% (40.0 million) reporting the use of an illicit drug in the previous year. Given the prevalence of drug abuse, it is increasingly important to determine effective screening practices, treatment procedures, and best practices among various subpopulations to identify drug use-related…
Descriptors: Drug Abuse, Screening Tests, Psychometrics, Synthesis
Stefanie A. Wind; Yuan Ge – Measurement: Interdisciplinary Research and Perspectives, 2024
Mixed-format assessments made up of multiple-choice (MC) items and constructed response (CR) items that are scored using rater judgments include unique psychometric considerations. When these item types are combined to estimate examinee achievement, information about the psychometric quality of each component can depend on that of the other. For…
Descriptors: Interrater Reliability, Test Bias, Multiple Choice Tests, Responses
Bronson Hui; Zhiyi Wu – Studies in Second Language Acquisition, 2024
A slowdown or a speedup in response times across experimental conditions can be taken as evidence of online deployment of knowledge. However, response-time difference measures are rarely evaluated on their reliability, and there is no standard practice to estimate it. In this article, we used three open data sets to explore an approach to…
Descriptors: Reliability, Reaction Time, Psychometrics, Criticism
Hüseyin Öztürk; Mustafa Karabulut; Mine Baydan-Aran; Suna Tokgöz-Yilmaz – Journal of Deaf Studies and Deaf Education, 2024
This methodological study aimed to assess the validity and reliability of the Turkish version of the Evaluation of the Impact of Hearing Loss in Adults (ERSA) questionnaire for individuals with treated hearing loss. The study involved 200 participants, and both exploratory factor analysis and confirmatory factor analysis were used to examine…
Descriptors: Turkish, Test Validity, Test Reliability, Hearing Impairments
Practices in Instrument Use and Development in "Chemistry Education Research and Practice" 2010-2021
Lazenby, Katherine; Tenney, Kristin; Marcroft, Tina A.; Komperda, Regis – Chemistry Education Research and Practice, 2023
Assessment instruments that generate quantitative data on attributes (cognitive, affective, behavioral, "etc.") of participants are commonly used in the chemistry education community to draw conclusions in research studies or inform practice. Recently, articles and editorials have stressed the importance of providing evidence for the…
Descriptors: Chemistry, Periodicals, Journal Articles, Science Education
Huscroft-D'Angelo, Jacqueline; Wery, Jessica; Martin-Gutel, Jodie D.; Pierce, Corey; Loftin, Kara – Assessment for Effective Intervention, 2022
The Scales for Assessing Emotional Disturbance Screener--Third Edition (SAED-3) is a standardized, norm-referenced measure designed to identify school-age students at risk for emotional and behavioral problems. Four studies are reported to address the psychometric status of the SAED-3 Screener. Study 1 examined the internal consistency of the…
Descriptors: Emotional Disturbances, Test Reliability, Test Validity, Screening Tests
Azaan Vhora; Ryan L. Davies; Kylie Rice – Psychology Learning and Teaching, 2024
Background: Objective Structured Clinical Examinations (OSCEs) are a simulation-based assessment tool used extensively in medical education for evaluating clinical competence. OSCEs are widely regarded as more valid, reliable, and valuable compared to traditional assessment measures, and are now emerging within professional psychology training…
Descriptors: Psychology, Higher Education, Psychometrics, Objective Tests