Publication Date
In 2025 | 67 |
Since 2024 | 901 |
Since 2021 (last 5 years) | 3415 |
Since 2016 (last 10 years) | 7595 |
Since 2006 (last 20 years) | 14761 |
Descriptor
Test Reliability | 14547 |
Test Validity | 9865 |
Reliability | 9544 |
Foreign Countries | 6751 |
Test Construction | 4608 |
Validity | 4120 |
Measures (Individuals) | 3750 |
Factor Analysis | 3720 |
Psychometrics | 3393 |
Interrater Reliability | 3054 |
Correlation | 3009 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 703 |
Practitioners | 447 |
Teachers | 204 |
Administrators | 121 |
Policymakers | 62 |
Counselors | 42 |
Students | 37 |
Parents | 11 |
Community | 7 |
Media Staff | 5 |
Support Staff | 5 |
More ▼ |
Location
Turkey | 1246 |
Australia | 428 |
Canada | 371 |
China | 329 |
United States | 264 |
United Kingdom | 246 |
Taiwan | 221 |
Netherlands | 217 |
Indonesia | 214 |
California | 208 |
Spain | 201 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 8 |
Meets WWC Standards with or without Reservations | 9 |
Does not meet standards | 6 |
McLeod, Bryce D.; Sutherland, Kevin S.; Broda, Michael; Granger, Kristen L.; Martinez, Ruben G.; Conroy, Maureen A.; Snyder, Patricia A.; Southam-Gerow, Michael A. – Prevention Science, 2022
Though treatment integrity measurement is important for research intended to promote social and behavioral outcomes of children at risk for emotional and behavioral disorders (EBDs) in early childhood settings, measurement gaps exist in the field. This paper reports on the development and preliminary psychometric assessment of the treatment…
Descriptors: Psychometrics, Measures (Individuals), Fidelity, Integrity
Beula M. Magimairaj; Philip Capin; Sandra L. Gillam; Sharon Vaughn; Greg Roberts; Anna-Maria Fall; Ronald B. Gillam – Grantee Submission, 2022
Purpose: Our aim was to evaluate the psychometric properties of the online administered format of the Test of Narrative Language--Second Edition (TNL-2; Gillam & Pearson, 2017), given the importance of assessing children's narrative ability and considerable absence of psychometric studies of spoken language assessments administered online.…
Descriptors: Computer Assisted Testing, Language Tests, Story Telling, Language Impairments
Adam Brieske-Ulenski; Michelle J. Kelley – Literacy Research and Instruction, 2025
It is well documented that literacy coaches engage in a wide variety of tasks related and unrelated to their role. However, little is known about their efficacy beliefs related to performing these tasks. In this article we describe the development and initial testing of the Literacy Coaching Self-Efficacy Scale that was designed and created to…
Descriptors: Literacy, Literacy Education, Coaching (Performance), Self Efficacy
Ping-Lin Chuang – Language Testing, 2025
This experimental study explores how source use features impact raters' judgment of argumentation in a second language (L2) integrated writing test. One hundred four experienced and novice raters were recruited to complete a rating task that simulated the scoring assignment of a local English Placement Test (EPT). Sixty written responses were…
Descriptors: Interrater Reliability, Evaluators, Information Sources, Primary Sources
Grace C. Tetschner; Sachin Nedungadi – Chemistry Education Research and Practice, 2025
Many undergraduate chemistry students hold alternate conceptions related to resonance--an important and fundamental topic of organic chemistry. To help address these alternate conceptions, an organic chemistry instructor could administer the resonance concept inventory (RCI), which is a multiple-choice assessment that was designed to identify…
Descriptors: Scientific Concepts, Concept Formation, Item Response Theory, Scores
Bang Quan Zheng; Peter M. Bentler – Structural Equation Modeling: A Multidisciplinary Journal, 2025
This paper aims to advocate for a balanced approach to model fit evaluation in structural equation modeling (SEM). The ongoing debate surrounding chi-square test statistics and fit indices has been characterized by ambiguity and controversy. Despite the acknowledged limitations of relying solely on the chi-square test, its careful application can…
Descriptors: Monte Carlo Methods, Structural Equation Models, Goodness of Fit, Robustness (Statistics)
Dandan Tang; Steven M. Boker; Xin Tong – Structural Equation Modeling: A Multidisciplinary Journal, 2025
The replication crisis in social and behavioral sciences has raised concerns about the reliability and validity of empirical studies. While research in the literature has explored contributing factors to this crisis, the issues related to analytical tools have received less attention. This study focuses on a widely used analytical tool -…
Descriptors: Test Validity, Factor Analysis, Replication (Evaluation), Social Science Research
Stacey Havlik; Peter Wiens; Arash Ghafoori; Melissa Jacobowitz; Kelly-Jo Sheback; Hannah Hudson – Journal of Education for Students Placed at Risk, 2025
While many teachers are unaware that students in their classes are experiencing homelessness, others may not know how to support students who are identified as lacking consistent housing (Wright et al., 2019). Thus, there is a critical need to better assess, understand, and enhance teachers' knowledge and attitudes toward homelessness. Therefore,…
Descriptors: Preservice Teachers, Preservice Teacher Education, Homeless People, Student Characteristics
Marcela Alves Sanseverino; Ana Carolina Raabe Abitante; Monique Cristielle Silva da Silva; Liza S. Rovniak; Wagner de Lara Machado – Measurement in Physical Education and Exercise Science, 2025
As part of a validation study of the Exercise Planning and Scheduling (EPS), and Goal-Setting (EGS) Scales, which were translated from English to Brazilian Portuguese, we aim to: present evidence of reliability and validity for the translated scale; and, explore the effects of non-labeled response categories of rating scales. The sample comprised…
Descriptors: Rating Scales, Self Management, Exercise, Test Validity
Lee, Morgan P.; Croteau, Ethan; Gurung, Ashish; Botelho, Anthony F.; Heffernan, Neil T. – International Educational Data Mining Society, 2023
The use of Bayesian Knowledge Tracing (BKT) models in predicting student learning and mastery, especially in mathematics, is a well-established and proven approach in learning analytics. In this work, we report on our analysis examining the generalizability of BKT models across academic years attributed to "detector rot." We compare the…
Descriptors: Bayesian Statistics, Models, Generalizability Theory, Longitudinal Studies
Halvorsen, Marianne Berg; Helverschou, Sissel Berge; Axelsdottir, Brynhildur; Brøndbo, Per Håkan; Martinussen, Monica – Journal of Autism and Developmental Disorders, 2023
There is a need for more knowledge of valid and standardized measures of mental health problems among children and adolescents with intellectual disability (ID). In this study, we systematically reviewed and evaluated the psychometric properties of instruments used to assess general mental health problems in this population. Following PRISMA…
Descriptors: Measures (Individuals), Clinical Diagnosis, Mental Health, Mental Disorders
Stark, Kristabel; Bettini, Elizabeth; Cumming, Michelle; O'Brien, Kristen Merrill; Brunsting, Nelson; Huggins-Manley, Corinne; Binkert, Gino; Shaheen, Tashnuva – Remedial and Special Education, 2023
Special education teachers' (SETs) working conditions play a crucial role in shaping the size, quality, and effectiveness of the U.S. SET workforce and thereby shape the quality of instruction provided to students with disabilities. Valid measures of SETs' working conditions are essential for conducting robust research on how to improve working…
Descriptors: Special Education, Teaching Conditions, Special Education Teachers, Students with Disabilities
Budak, Zeynep; Isikhan, Selen Yilmaz; Batuk, Merve Ozbal – Language, Speech, and Hearing Services in Schools, 2023
Purpose: The aim of this study was to translate the versions of the Hearing Environments and Reflection on Quality of Life (HEAR-QL) into Turkish and investigate the validity and reliability of the Turkish 26-item HEAR-QL (HEAR-QL-26) for children and Turkish 28-item HEAR-QL (HEAR-QL-28) for adolescents. Method: The protocol included translation…
Descriptors: Children, Adolescents, Hearing Impairments, Control Groups
Nguyen-Duc, Thinh; Phuong, Tam T.; Le, Thuy T. B.; Nguyen, Lam T. T. – Learning Organization, 2023
Purpose: The main purpose of this study was to validate the Dimensions of Learning Organization Questionnaire (DLOQ) in a Vietnamese context. Using the DLOQ as a research tool, this study also investigated the impact of demographic features on participants' perceptions of learning organizations (LOs). Design/methodology/approach: Data were…
Descriptors: Foreign Countries, Organizational Culture, Organizational Learning, Questionnaires
Johnson, David A.; Stone, Ashlyn; Marsh, Sarah – Measurement and Evaluation in Counseling and Development, 2023
We evaluated structural, construct, and concurrent validity evidence for State-Interpersonal Reactivity Index scores among 208 telemental health counselors. Confirmatory factor analysis results supported a three-factor model. Partial correlation analyses yielded evidence for construct and concurrent validity evidence for scores. We discuss…
Descriptors: Validity, Scores, Counselors, Health Services