Publication Date
In 2025 | 103 |
Since 2024 | 950 |
Since 2021 (last 5 years) | 3486 |
Since 2016 (last 10 years) | 7671 |
Since 2006 (last 20 years) | 14844 |
Descriptor
Test Reliability | 14596 |
Test Validity | 9898 |
Reliability | 9570 |
Foreign Countries | 6774 |
Test Construction | 4627 |
Validity | 4130 |
Measures (Individuals) | 3759 |
Factor Analysis | 3728 |
Psychometrics | 3406 |
Interrater Reliability | 3068 |
Correlation | 3013 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 703 |
Practitioners | 447 |
Teachers | 204 |
Administrators | 121 |
Policymakers | 62 |
Counselors | 42 |
Students | 37 |
Parents | 11 |
Community | 7 |
Media Staff | 5 |
Support Staff | 5 |
More ▼ |
Location
Turkey | 1249 |
Australia | 428 |
Canada | 371 |
China | 332 |
United States | 265 |
United Kingdom | 246 |
Taiwan | 222 |
Netherlands | 217 |
Indonesia | 215 |
California | 208 |
Spain | 204 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 8 |
Meets WWC Standards with or without Reservations | 9 |
Does not meet standards | 6 |
Jordan B. Westcott; Louis M. Rocconi – Measurement and Evaluation in Counseling and Development, 2025
Objective: This study sought to examine the factor structure, internal consistency, and measurement invariance of the Brief Resilience Scale (BRS) and the Multidimensional Scale of Perceived Social Support (MSPSS) among older sexual minority women with disabilities. Method: Participants (n = 208) consisted of sexual minority women aged 55 and…
Descriptors: Psychometrics, Resilience (Psychology), Social Support Groups, Measures (Individuals)
Esra Sözer Boz – Education and Information Technologies, 2025
International large-scale assessments provide cross-national data on students' cognitive and non-cognitive characteristics. A critical methodological issue that often arises in comparing data from cross-national studies is ensuring measurement invariance, indicating that the construct under investigation is the same across the compared groups.…
Descriptors: Achievement Tests, International Assessment, Foreign Countries, Secondary School Students
Louise Badham – Oxford Review of Education, 2025
Different sources of assessment evidence are reviewed during International Baccalaureate (IB) grade awarding to convert marks into grades and ensure fair results for students. Qualitative and quantitative evidence are analysed to determine grade boundaries, with statistical evidence weighed against examiner judgement and teachers' feedback on…
Descriptors: Advanced Placement Programs, Grading, Interrater Reliability, Evaluative Thinking
Gael I. Orsmond; Sharada G. Krishnan; Elizabeth G. S. Munsell; Ellen S. Cohn; Wendy J. Coster – Journal of Autism and Developmental Disorders, 2025
Purpose: Research documents poor outcomes for autistic adults in the domains of employment, independent living, and social relationships. Measurement and sample limitations in prior studies may have amplified past estimates of poor outcomes. The goal of the current study was to improve upon past approaches and to create and describe a measurement…
Descriptors: Autism Spectrum Disorders, Young Adults, High School Graduates, Employment
Park, Yeonggwang; Cádiz, Manuel Díaz; Nagle, Kathleen F.; Stepp, Cara E. – Journal of Speech, Language, and Hearing Research, 2020
Purpose: Assessment of strained voice quality is difficult due to the weak reliability of auditory-perceptual evaluation and lack of strong acoustic correlates. This study evaluated the contributions of relative fundamental frequency (RFF) and mid-to-high frequency noise to the perception of strain. Method: Stimuli were created using recordings of…
Descriptors: Acoustics, Audio Equipment, Auditory Perception, Correlation
Kinnear, George; Bennett, Max; Binnie, Rachel; Bolt, Róisín; Zheng, Yinglan – Teaching Mathematics and Its Applications, 2020
The MATH taxonomy classifies questions according to the mathematical skills required to answer them. It was created to aid the development of more balanced assessments in undergraduate mathematics and has since been used to compare different assessment regimes across school and university. To date, there has been no systematic investigation of the…
Descriptors: Taxonomy, Mathematics Instruction, Teaching Methods, Reliability
The AI Teacher Test: Measuring the Pedagogical Ability of Blender and GPT-3 in Educational Dialogues
Tack, Anaïs; Piech, Chris – International Educational Data Mining Society, 2022
How can we test whether state-of-the-art generative models, such as Blender and GPT-3, are good AI teachers, capable of replying to a student in an educational dialogue? Designing an AI teacher test is challenging: although evaluation methods are much-needed, there is no off-the-shelf solution to measuring pedagogical ability. This paper reports…
Descriptors: Artificial Intelligence, Dialogs (Language), Bayesian Statistics, Decision Making
Eryilmaz, Önder – Participatory Educational Research, 2022
Although there is an increasing number of studies concentrating upon education, some researchers have revealed that most studies, including qualitative studies in education, have methodological issues. One of the most common mistakes and neglected issues in qualitative studies is not to ensure the trustworthiness of the research, which indeed is…
Descriptors: Foreign Countries, Doctoral Dissertations, Research Methodology, Credibility
Verhelst, Dries; Vanhoof, Jan; Van Petegem, Peter – Environmental Education Research, 2022
Empirically based tools to map education for sustainable development within school organisations are not readily available, which is both a cause and a consequence of the scarce empirical and quantitative research on school organisations and education for sustainable development. In present study, the Education for Sustainable Development School…
Descriptors: Test Construction, Test Validity, Sustainable Development, School Organization
Kinarsky, Alana R.; Christie, Christina A. – American Journal of Evaluation, 2022
Since 2007, two taxonomies have been proposed to identify the components of evaluation practice that may be specified in an evaluation policy. Little is known, however, about how these taxonomies align with evaluation policies developed by philanthropic foundations. Through thematic analysis, this article first compares 12 foundation evaluation…
Descriptors: Taxonomy, Evaluation Methods, Philanthropic Foundations, Educational Policy
Ozalp, Ugur; Cetin, Munevver – International Journal of Assessment Tools in Education, 2022
The aim of this study was to develop a scale instrument for measuring academic intellectual capital in the Turkish higher education context depending on student perceptions. The sample consisted of students of higher education institutions in the 2020-2021 academic year. Data were gathered in two stages. Exploratory Factor Analysis (EFA) was…
Descriptors: Measures (Individuals), College Students, Test Validity, Test Reliability
Akdeniz, Seher; Budak, Hatice; Ahçi, Zeynep G. – International Education Studies, 2022
Narcissism in social media reveals itself differently than in daily social interactions. Therefore, the present study aimed to develop a Scale of Narcissism in Social Media through the lens of the Narcissistic Admiration and Rivalry Model and to investigate its psychometric characteristics. The total sample of the study consisted of 740…
Descriptors: Test Construction, Personality Traits, Social Media, Psychometrics
Dambha, Tasneem; Swanepoel, De Wet; Mahomed-Asmail, Faheema; De Sousa, Karina C.; Graham, Marien A.; Smits, Cas – Journal of Speech, Language, and Hearing Research, 2022
Purpose: This study compared the test characteristics, test-retest reliability, and test efficiency of three novel digits-in-noise (DIN) test procedures to a conventional antiphasic 23-trial adaptive DIN (D23). Method: One hundred twenty participants with an average age of 42 years (SD = 19) were included. Participants were tested and retested…
Descriptors: Auditory Tests, Screening Tests, Efficiency, Test Format
Gil-Llario, María Dolores; Flores-Buils, Raquel; Elipe-Miravet, Marcel; Fernández-García, Olga; Ballester-Arnal, Rafael – Journal of Applied Research in Intellectual Disabilities, 2022
Background: This paper presents a description of the development and psychometric properties of a self-report instrument for the assessment of sexual behaviour and concerns of people with mild intellectual disabilities (SEBECOMID-S). Methods and procedures: The study included 281 people with mild intellectual disabilities. The psychometric…
Descriptors: Test Construction, Psychometrics, Measurement Techniques, Sexuality
Levin, Nathan; Baker, Ryan S.; Nasiar, Nidhi; Fancsali, Stephen; Hutt, Stephen – International Educational Data Mining Society, 2022
Research into "gaming the system" behavior in intelligent tutoring systems (ITS) has been around for almost two decades, and detection has been developed for many ITSs. Machine learning models can detect this behavior in both real-time and in historical data. However, intelligent tutoring system designs often change over time, in terms…
Descriptors: Intelligent Tutoring Systems, Artificial Intelligence, Models, Cheating