Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 21 |
Since 2006 (last 20 years) | 72 |
Descriptor
Correlation | 107 |
Evaluation Methods | 107 |
Test Validity | 107 |
Test Reliability | 46 |
Foreign Countries | 24 |
Psychometrics | 19 |
Scores | 19 |
Factor Analysis | 18 |
Rating Scales | 18 |
Test Construction | 17 |
Children | 15 |
More ▼ |
Source
Author
Clark, John L. D. | 2 |
Clarke, Ben | 2 |
Elliott, Stephen N. | 2 |
Haymond, Kelly | 2 |
Jordan, Nancy C. | 2 |
Klein, Stephen P. | 2 |
Matson, Johnny L. | 2 |
McIntyre, Nancy | 2 |
Mundy, Peter | 2 |
Newman-Gonchar, Rebecca | 2 |
Novotny, Stephanie | 2 |
More ▼ |
Publication Type
Education Level
Higher Education | 14 |
Elementary Education | 10 |
Postsecondary Education | 10 |
High Schools | 7 |
Elementary Secondary Education | 6 |
Secondary Education | 6 |
Grade 8 | 4 |
Middle Schools | 4 |
Grade 11 | 3 |
Grade 7 | 3 |
Junior High Schools | 3 |
More ▼ |
Audience
Researchers | 2 |
Practitioners | 1 |
Location
Florida | 5 |
Australia | 3 |
Singapore | 3 |
Canada | 2 |
Illinois | 2 |
Japan | 2 |
Massachusetts | 2 |
South Korea | 2 |
Spain | 2 |
Taiwan | 2 |
Wisconsin | 2 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
An, Lily Shiao; Ho, Andrew Dean; Davis, Laurie Laughlin – Educational Measurement: Issues and Practice, 2022
Technical documentation for educational tests focuses primarily on properties of individual scores at single points in time. Reliability, standard errors of measurement, item parameter estimates, fit statistics, and linking constants are standard technical features that external stakeholders use to evaluate items and individual scale scores.…
Descriptors: Documentation, Scores, Evaluation Methods, Longitudinal Studies
Zaher M. Kmail; Gordon Brobbey – Journal of the American Academy of Special Education Professionals, 2024
Teacher evaluation has been closely tied to professional development. In special education, professional development experiences are meant to promote special educator learning and implementation of high leverage practices. Yet, the connection between teacher evaluation outcomes and professional development decisions of special educators is largely…
Descriptors: Teacher Evaluation, Special Education Teachers, Teacher Attitudes, Faculty Development
Zheng, Boyang; Sun, Guiping; Wang, Hourong – SAGE Open, 2019
Traditional Chinese medicine (TCM) is an important component of China's medical system. How to educate TCM practitioners in China, therefore, has become a crucial issue. To contribute to this issue, the current research identified the competency model of TCM practitioners in China and developed an evaluation for TCM students. We combined Bloom's…
Descriptors: Medical Students, Correlation, Foreign Countries, Test Reliability
Tavares, Walter; Brydges, Ryan; Myre, Paul; Prpic, Jason; Turner, Linda; Yelle, Richard; Huiskamp, Maud – Advances in Health Sciences Education, 2018
Assessment of clinical competence is complex and inference based. Trustworthy and defensible assessment processes must have favourable evidence of validity, particularly where decisions are considered high stakes. We aimed to organize, collect and interpret validity evidence for a high stakes simulation based assessment strategy for certifying…
Descriptors: Competence, Simulation, Allied Health Personnel, Certification
Walker, Brooke – ProQuest LLC, 2018
Assessing higher-level verbal repertoires of individuals with autism and related intellectual disabilities is crucial due to the language and cognitive deficits experienced by this population as well as is the need for valid assessment tools for data-driven and individualized treatment. In addition to, curricula or instructional protocols that…
Descriptors: Autism, Intellectual Disability, Correlation, Intelligence Quotient
Brown, Alan V.; Plonsky, Luke; Teimouri, Yasser – Foreign Language Annals, 2018
Much of applied linguistics research has been concerned with classroom-based second language (L2) development as it offers an ideal setting for examining the institutional ecology of L2 learning and teaching. However, scholars have continued to call for greater attention to the operationalization of constructs, selection of valid assessments, and…
Descriptors: Grades (Scholastic), Second Language Learning, Second Language Instruction, Language Tests
St. Clair, Travis; Hallberg, Kelly; Cook, Thomas D. – Journal of Educational and Behavioral Statistics, 2016
We explore the conditions under which short, comparative interrupted time-series (CITS) designs represent valid alternatives to randomized experiments in educational evaluations. To do so, we conduct three within-study comparisons, each of which uses a unique data set to test the validity of the CITS design by comparing its causal estimates to…
Descriptors: Research Methodology, Randomized Controlled Trials, Comparative Analysis, Time
Lee, Minji K.; Sweeney, Kevin; Melican, Gerald J. – Educational Assessment, 2017
This study investigates the relationships among factor correlations, inter-item correlations, and the reliability estimates of subscores, providing a guideline with respect to psychometric properties of useful subscores. In addition, it compares subscore estimation methods with respect to reliability and distinctness. The subscore estimation…
Descriptors: Scores, Test Construction, Test Reliability, Test Validity
Varela, Otmar; Mead, Esther – Journal of Education for Business, 2018
Popular teamwork assessments have been strongly criticized on the grounds of poor psychometric properties and their disconnect with conceptual models of teamwork. These issues raise concerns with respect to our ability to evaluate efforts devoted to advancing teamwork in academia. We report the development of a teamwork assessment that builds on…
Descriptors: Teamwork, Evaluation Methods, Test Validity, Psychometrics
Mundy, Peter; Novotny, Stephanie; Swain-Lerro, Lindsey; McIntyre, Nancy; Zajic, Matt; Oswald, Tasha – Journal of Autism and Developmental Disorders, 2017
The validity of joint attention assessment in school-aged children with ASD is unclear (Lord, Jones, "Journal of Child Psychology and Psychiatry" 53(5):490-509, 2012). This study examined the feasibility and validity of a parent-report measure of joint attention related behaviors in verbal children and adolescents with ASD. Fifty-two…
Descriptors: Attention, Autism, Pervasive Developmental Disorders, Evaluation Methods
Mundy, Peter; Novotny, Stephanie; Swain-Lerro, Lindsey; McIntyre, Nancy; Zajic, Matt; Oswald, Tasha – Grantee Submission, 2017
The validity of joint attention assessment in school-aged children with ASD is unclear (Lord, Jones, "Journal of Child Psychology and Psychiatry" 53(5):490-509, 2012). This study examined the feasibility and validity of a parent-report measure of joint attention related behaviors in verbal children and adolescents with ASD. Fifty-two…
Descriptors: Attention, Autism, Pervasive Developmental Disorders, Evaluation Methods
Rigney, Alexander M. – Journal of Psychoeducational Assessment, 2019
This report reviews the "Social Skills Improvement System Social-Emotional Learning Edition" (SSIS SEL; Gresham & Elliott, 2017), a multicomponent rating scale that includes a criterion and norm-referenced measure of social-emotional and academic functioning--based on a reformulation of the "Social Skills Improvement…
Descriptors: Rating Scales, Interpersonal Competence, Social Development, Emotional Development
Minor, Elizabeth Covay; Porter, Andrew C.; Murphy, Joseph; Goldring, Ellen; Elliott, Stephen N. – Educational Assessment, Evaluation and Accountability, 2017
The Vanderbilt Assessment for Leadership in Education (VAL-ED) is a 360-degree learning-centered behaviors principal evaluation tool that includes ratings from the principal, supervisors, and teachers. The current study assesses the test-retest reliability of the VAL-ED for a sample of seven school districts as part of multiple validity and…
Descriptors: Administrator Evaluation, Principals, Test Reliability, Test Validity
Wedman, Jonathan; Lyrén, Per-Erik – Practical Assessment, Research & Evaluation, 2015
When subscores on a test are reported to the test taker, the appropriateness of reporting them depends on whether they provide useful information above what is provided by the total score. Subscores that fail to do so lack adequate psychometric quality and should not be reported. There are several methods for examining the quality of subscores,…
Descriptors: Evaluation Methods, Psychometrics, Scores, Tests
Kim, Youngmi; Kim, Kyeongmo; Lee, Shinhye – Research on Social Work Practice, 2017
Purpose: We tested the reliability and validity of the Self-Efficacy Questionnaire for Children (SEQ-C) in a sample of children living in orphanages in South Korea. Methods: Our study sample consisted of 334 children aged 13-18 obtained using a convenience sampling method. We conducted a confirmatory factor analysis to identify the factor…
Descriptors: Questionnaires, Self Efficacy, Institutionalized Persons, Adolescents