Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 5 |
Since 2016 (last 10 years) | 13 |
Since 2006 (last 20 years) | 73 |
Descriptor
Construct Validity | 73 |
Evidence | 73 |
Factor Analysis | 25 |
Validity | 25 |
Scores | 18 |
Psychometrics | 17 |
Measures (Individuals) | 16 |
Reliability | 14 |
Test Validity | 14 |
Models | 12 |
Correlation | 11 |
More ▼ |
Source
Author
Kettler, Ryan J. | 3 |
Gargani, John | 2 |
Albanese, Emiliano | 1 |
Alessandri, Guido | 1 |
Anders, Samantha | 1 |
Ari, Omer | 1 |
Arias, Angel | 1 |
Armijo-Olivo, Susan | 1 |
Armstrong, Norris | 1 |
Armstrong, Stephen A. | 1 |
Balkin, Richard S. | 1 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 1 |
Location
United States | 3 |
Germany | 2 |
Massachusetts | 2 |
Turkey | 2 |
United Kingdom | 2 |
Australia | 1 |
California | 1 |
Canada | 1 |
Cyprus | 1 |
Hungary | 1 |
Idaho | 1 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Rasooli, Amirhossein; Zandi, Hamed; DeLuca, Christopher – School Psychology Review, 2023
Empirical research in education has largely adopted quantitative approaches to measure teachers' and students' perceptions of fairness and justice in classroom contexts. The purpose of this study is to understand the validity evidence of fairness and justice instruments including how fairness and justice have been conceptualized in measures.…
Descriptors: Measurement Techniques, Ethics, Justice, Validity
Shankar, Sneha; Marshall, Sheila K.; Zumbo, Bruno D. – Journal of Psychoeducational Assessment, 2020
Goal attainment scaling (GAS) is an internationally recognized measure that is widely used in educational, counseling, and clinical settings to identify and evaluate relevant goals for an individual. The GAS is an unusual measure because its content, which consists of goals, is formed by the respondent and/or users in the process of completing the…
Descriptors: Goal Orientation, Evaluation Methods, Measures (Individuals), Educational Assessment
Kiera Coulter; Melissa Y. Delgado; Rajni L. Nair; Lorey A. Wheeler; Rayni Thomas – Child & Youth Care Forum, 2024
Background: The positive youth development (PYD) framework orients developmental scholarship to focusing on youth's strengths rather than deficits, however, validity evidence on PYD measures among ethnic-minority youth is limited. Objectives: The objectives of the current study were to (a) examine the factor structure of a PYD measure within a…
Descriptors: Measures (Individuals), Test Construction, Construct Validity, Adolescent Development
Németh, Lilla; Bernáth, László – Educational Assessment, 2023
The Cognitive Test Anxiety Scale (CTAS) is a unidimensional scale designed to measure the cognitive aspect of test anxiety. The instrument has been adapted in several countries, and convincing psychometric properties have been found; however, uncertainties remain regarding its factor structure. Therefore, the aim of this study is twofold: to…
Descriptors: Test Anxiety, Cognitive Processes, Measures (Individuals), Factor Structure
Arias, Angel; Blais, Jean-Guy – Canadian Modern Language Review, 2023
This article draws on argument-based validation to gather and evaluate construct-related evidence (i.e., the explanation inference) of a high-stakes test. The data stemmed from the listening component of a French test used for immigration to Canada through the province of Quebec. An expert panel with varied backgrounds in applied linguistics…
Descriptors: French, Listening Comprehension Tests, Second Language Learning, High Stakes Tests
Albanese, Emiliano; Bütikofer, Lukas; Armijo-Olivo, Susan; Ha, Christine; Egger, Matthias – Research Synthesis Methods, 2020
Background: There is an agreement that the methodological quality of randomized trials should be assessed in systematic reviews, but there is a debate on how this should be done. We conducted a construct validation study of the Physiotherapy Evidence Database (PEDro) scale, which is widely used to assess the quality of trials in physical therapy…
Descriptors: Construct Validity, Physical Therapy, Item Response Theory, Factor Analysis
Bonner, Sarah; Chen, Peggy; Jones, Kristi; Milonovich, Brandon – Applied Measurement in Education, 2021
We describe the use of think alouds to examine substantive processes involved in performance on a formative assessment of computational thinking (CT) designed to support self-regulated learning (SRL). Our task design model included three phases of work on a computational thinking problem: forethought, performance, and reflection. The cognitive…
Descriptors: Formative Evaluation, Thinking Skills, Metacognition, Computer Science Education
Articulating and Evaluating Validity Arguments for the "TOEIC"® Tests. Research Report. ETS RR-17-51
Schmidgall, Jonathan E. – ETS Research Report Series, 2017
This report provides a brief overview of how the "TOEIC"® program has adopted an argument-based approach to validity in order to support the use of the TOEIC tests. This approach emphasizes the need to explicitly state claims about the measurement quality and intended use of a test and to support those claims with evidence. This report…
Descriptors: English (Second Language), Language Tests, Second Language Learning, Test Use
LeClaire, Edgar L.; Nihira, Mikio A.; Hardré, Patricia L. – Advances in Health Sciences Education, 2015
Validity is critical for meaningful assessment of surgical competency. According to the Standards for Educational and Psychological Testing, validation involves the integration of data from well-defined classifications of evidence. In the authoritative framework, data from all classifications support construct validity claims. The two aims of this…
Descriptors: Surgery, Gynecology, Validity, Standards
Huang, Lan – ProQuest LLC, 2015
It is widely believed that subscores can give us more information about an examinee. Thus they can be useful in planning instructional and remedial efforts, or making vocational or academic placement decisions. However, past research has shown that subscores are often not as useful as hoped either because they do not have high reliability or…
Descriptors: Children, Intelligence Tests, Scores, Reliability
Eddy, Colleen M.; Harrell, Pamela; Heitz, Layne – Investigations in Mathematics Learning, 2017
The "AssessToday" observation protocol was created to measure teachers' use of short-cycle formative assessment through observation of teachers in a single instructional period. This classroom observation instrument utilized seven dimensions: "learning target, question quality, nature of questioning, self-evaluation,…
Descriptors: Formative Evaluation, Mathematics Instruction, Construct Validity, Evidence
Sessoms, John; Henson, Robert A. – Measurement: Interdisciplinary Research and Perspectives, 2018
Diagnostic classification models (DCMs) classify examinees based on the skills they have mastered given their test performance. This classification enables targeted feedback that can inform remedial instruction. Unfortunately, applications of DCMs have been criticized (e.g., no validity support). Generally, these evaluations have been brief and…
Descriptors: Literature Reviews, Classification, Models, Criticism
Reddy, Linda A.; Dudek, Christopher M.; Kettler, Ryan J.; Kurz, Alexander; Peters, Stephanie – Educational Assessment, 2016
This study presents the reliability and validity of the Teacher Evaluation Experience Scale--Teacher Form (TEES-T), a multidimensional measure of educators' attitudes and beliefs about teacher evaluation. Confirmatory factor analyses of data from 583 teachers were conducted on the TEES-T hypothesized five-factor model, as well as on alternative…
Descriptors: Teacher Attitudes, Beliefs, Teacher Evaluation, Attitude Measures
Lam, Ricky – Assessment & Evaluation in Higher Education, 2014
Portfolio assessment (PA) has been extensively adopted for writing development in the past three decades. Much research on PA primarily investigates students' and teachers' perceptions of its benefits, and how it influences students' motivation and general writing abilities. Despite its purported effectiveness, not much has been…
Descriptors: Portfolio Assessment, English (Second Language), Independent Study, Feedback (Response)
Souroulla, Andry Vrachimi; Panayiotou, Georgia – Journal of Education and Training Studies, 2017
The Hellenic WISC-III (Wechsler, 1997) is currently the only standardized and officially published tool for the assessment of the intelligence of children and adolescents in Greece. The test is also used with caution in Cyprus, among Greek speakers, but no specific norms exist for use in this country. The purpose of this study was to provide…
Descriptors: Foreign Countries, Children, Intelligence Tests, Greek