Publication Date
In 2025 | 6 |
Since 2024 | 59 |
Since 2021 (last 5 years) | 268 |
Since 2016 (last 10 years) | 781 |
Since 2006 (last 20 years) | 1698 |
Descriptor
Scores | 2324 |
Test Reliability | 1083 |
Reliability | 1051 |
Test Validity | 596 |
Foreign Countries | 572 |
Correlation | 529 |
Validity | 456 |
Psychometrics | 436 |
Measures (Individuals) | 411 |
Factor Analysis | 392 |
Statistical Analysis | 329 |
More ▼ |
Source
Author
Thompson, Bruce | 21 |
Erford, Bradley T. | 13 |
Henson, Robin K. | 11 |
Zimmerman, Donald W. | 11 |
Haberman, Shelby J. | 10 |
Worrell, Frank C. | 10 |
Lee, Yong-Won | 9 |
Sinharay, Sandip | 9 |
Gill, Brian | 8 |
Petscher, Yaacov | 8 |
Wainer, Howard | 8 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 33 |
Practitioners | 21 |
Teachers | 9 |
Administrators | 4 |
Counselors | 2 |
Parents | 2 |
Policymakers | 2 |
Community | 1 |
Students | 1 |
Location
Turkey | 88 |
Canada | 42 |
China | 37 |
United States | 35 |
Australia | 31 |
Florida | 24 |
Netherlands | 24 |
California | 21 |
Spain | 21 |
United Kingdom | 21 |
United Kingdom (England) | 21 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 3 |
Meets WWC Standards with or without Reservations | 3 |
Does not meet standards | 1 |
Lower, Leeann M.; Newman, Tarkington J.; Anderson-Butcher, Dawn – Research on Social Work Practice, 2017
Purpose: This study examines the psychometric properties of the Teamwork Scale for Youth, an assessment designed to measure youths' perceptions of their teamwork competency. Methods: The Teamwork Scale for Youth was administered to a sample of 460 youths. Confirmatory factor analyses examined the factor structure and measurement invariance of the…
Descriptors: Teamwork, Likert Scales, Youth, Construct Validity
Welsch, Lauren A.; Rutledge, Carolyn; Hoch, Johanna M. – Athletic Training Education Journal, 2017
Context: Athletic trainers are encouraged to work collaboratively with other health care professionals to improve patient outcomes. Interprofessional education (IPE) experiences for practicing clinicians should be developed to improve interprofessional collaborative practice postcertification. An outcome measure, such as the modified Readiness for…
Descriptors: Athletic Coaches, Interprofessional Relationship, Readiness, Allied Health Personnel
Bowen, Naomi – ProQuest LLC, 2017
The purpose of this research was to determine if the Pennsylvania Value-Added Assessment System Average Growth Index (PVAAS AGI) scores, derived from standardized tests and calculated for Pennsylvania schools, provide a valid and reliable assessment of teacher effectiveness, as these scores are currently used to derive 15% of the annual…
Descriptors: Teacher Effectiveness, Teacher Evaluation, Value Added Models, Standardized Tests
Sabbah, Sabah Salman – Arab World English Journal, 2020
This study aimed at investigating the effect of two strategies of teaching reading: 'semantic mapping' and 'question generation' on the reading achievement of a sample of 40 female students enrolling in two classes in Level 2 English as a Second Language Foundation Program at the Community College of Qatar. The researcher of the current study…
Descriptors: Semantics, Reading Achievement, Cognitive Mapping, Foreign Countries
Lopata, Christopher; Donnelly, James P.; Rodgers, Jonathan D.; Thomeer, Marcus L.; Booth, Adam J. – Autism: The International Journal of Research and Practice, 2020
This study assessed the reliability and criterion-related validity of teacher ratings on the Adapted Skillstreaming Checklist for a sample of 133 children, aged 6-11 years, with autism spectrum disorder (without intellectual disability). Internal consistency for the total sample was 0.93. For a subsample, test-retest reliability was very good (r =…
Descriptors: Check Lists, Validity, Reliability, Teacher Attitudes
Jorion, Natalie; Gane, Brian D.; James, Katie; Schroeder, Lianne; DiBello, Louis V.; Pellegrino, James W. – Journal of Engineering Education, 2015
Background: Concept inventories (CIs) are commonly used in engineering disciplines to assess students' conceptual understanding and to evaluate instruction, but educators often use CIs without sufficient evidence that a structured approach has been applied to validate inferences about student thinking. Purpose: We propose an analytic framework for…
Descriptors: Guidelines, Validity, Inferences, Concept Formation
Lawson, Timothy J.; Jordan-Fleming, Mary Kay; Bodle, James H. – Teaching of Psychology, 2015
Critical thinking is widely considered an important skill for psychology majors. However, few measures exist of the types of critical thinking that are specific to psychology majors. Lawson (1999) designed the Psychological Critical Thinking Exam (PCTE) to measure students' ability to "think critically, or evaluate claims, in a way that…
Descriptors: Critical Thinking, Psychology, Majors (Students), Measures (Individuals)
Zhang, Bo; Xiao, Yunnan; Luo, Juan – Language Testing in Asia, 2015
Previous studies comparing holistic scoring to analytic scoring of second language writing have given mixed results. Some of them suffer from methodological drawbacks, such as limited writing sample size, limited number of raters, and lack of direct comparison of the two methods. Based on 300 writing samples graded by 14 raters, this research…
Descriptors: Evaluators, Reliability, Scores, Holistic Approach
Pier, Elizabeth L.; Raclaw, Joshua; Nathan, Mitchell J.; Kaatz, Anna; Carnes, Molly; Ford, Cecilia E. – Wisconsin Center for Education Research, 2015
Grant peer review is a foundational component of scientific research. In the context of grant review meetings, the review process is a collaborative, socially mediated, locally constructed decision-making task. The current study examines how collaborative discussion affects reviewers' scores of grant proposals, how different review panels score…
Descriptors: Participative Decision Making, Videoconferencing, Peer Evaluation, Grants
Benton, Tom – Cambridge Assessment, 2016
The reliability of an assessment is defined as the extent to which candidates' results would remain stable if the entire assessment exercise was repeated. Whilst numerous studies have evaluated the reliability of written examinations, relatively little has been done to quantify the reliability of internal teacher assessment within schools. This is…
Descriptors: Test Reliability, Foreign Countries, History Instruction, English Literature
Lin, Chien-Liang – Journal of Science Education and Technology, 2018
This study sought to develop a self-report instrument to be used in the assessment of the project competences of college students engaged in online project-based learning. Three scales of the KIPSSE instrument developed for this study, namely, the knowledge integration, project skills, and self-efficacy scales, were based on related theories and…
Descriptors: College Students, Online Courses, Student Projects, Active Learning
Parkin, Jason R.; Beaujean, A. Alexander; Firmin, Michael W.; Qiu, Xiao; Firmin, Ruth L. – Journal of Psychoeducational Assessment, 2018
In this study, we examined the factor structure, reliability, and external validity of scores from the Comprehensive Test of Nonverbal Intelligence-Second Edition (CTONI-2) using an independent sample of young adults currently enrolled in a postsecondary institution. Although the subtests appear to be measuring general intelligence, the aggregate…
Descriptors: Nonverbal Tests, Intelligence Tests, Factor Structure, Test Reliability
Aviad-Levitzky, Tami; Laufer, Batia; Goldstein, Zahava – Language Assessment Quarterly, 2019
This article describes the development and validation of the new CATSS (Computer Adaptive Test of Size and Strength), which measures vocabulary knowledge in four modalities -- productive recall, receptive recall, productive recognition, and receptive recognition. In the first part of the paper we present the assumptions that underlie the test --…
Descriptors: Foreign Countries, Test Construction, Test Validity, Test Reliability
Johnston, Lucy; Schluter, Philip J. – Studies in Higher Education, 2017
With increasing competition for postgraduate research scholarships, awarding processes demand attention and scrutiny. We examine inter-rater reliability for two prestigious New Zealand scholarships, the Shirtcliffe Fellowship and the Gordon Watson Scholarship. For each scholarship, five assessors (three academic; two non-academic) independently…
Descriptors: Interrater Reliability, Scholarships, Academic Achievement, Program Proposals
Hays, Danica G.; Wood, Chris – Measurement and Evaluation in Counseling and Development, 2017
We present considerations for validity when a population outside of a normed sample is assessed and those data are interpreted. Using a career group counseling example exploring life satisfaction changes as evidenced by the Quality of Life Inventory (Frisch, 1994), we showcase qualitative and quantitative approaches to explore how normative data…
Descriptors: Data Interpretation, Scores, Quality of Life, Life Satisfaction