Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 6 |
Since 2016 (last 10 years) | 17 |
Since 2006 (last 20 years) | 63 |
Descriptor
Reliability | 142 |
Test Validity | 142 |
Test Construction | 35 |
Foreign Countries | 29 |
Measures (Individuals) | 28 |
Psychometrics | 26 |
Factor Analysis | 24 |
Validity | 21 |
Correlation | 20 |
Measurement Techniques | 20 |
Test Reliability | 19 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 7 |
Practitioners | 2 |
Media Staff | 1 |
Policymakers | 1 |
Location
Turkey | 5 |
Canada | 4 |
United States | 4 |
Belgium | 3 |
Germany | 3 |
Texas | 3 |
Australia | 2 |
Florida | 2 |
Indonesia | 2 |
Iran | 2 |
Netherlands | 2 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Riana Nurhayati; Suranto Aw; Siti Irene Astuti Dwiningrum; Mami Hajaroh; Herwin Herwin – International Journal of Educational Methodology, 2024
Evaluation of child-friendly school (CFS) policies is essential to determine the achievements of school efforts in reducing violence cases. This research aims to proving the reliability and validity of CFS policy evaluation instruments in elementary schools with different locations. This investigation uses the Context Input Process Product (CIPP)…
Descriptors: Validity, Reliability, School Policy, Program Evaluation
Ting Ma; Lawrence Jun Zhang; Judy M. Parr – Language Awareness, 2025
Studies have shown that raising L2 learners' metaphor awareness contributes to the acquisition of figurative language, which fosters students' development of language skills. However, the instruments measuring metaphor awareness, in the majority of relevant research, did not seem to have undergone proper methodological procedures for checking…
Descriptors: English (Second Language), Second Language Learning, Second Language Instruction, Figurative Language
Hashemi Golpayegani, Fatemeh; Hosseinian, Simin; Rezaeian, Hamid; Pourshahriari, Mahsima; Rasouli, Roya – Psychology in the Schools, 2022
Shame is a significant factor for psychological problems in adolescents. The present study aims to assess the Persian version of the Adolescent Shame-Proneness Scale (ASPS) among Iranian adolescents. Participants of this correlation study were 2291 high school students aged 12-18 (1296 girls and 1036 boys), selected through a multistage random…
Descriptors: Foreign Countries, High School Students, Psychological Patterns, Test Validity
Mislevy, Robert J.; Oliveri, Maria Elena – Educational Measurement: Issues and Practice, 2019
In this digital ITEMS module, Dr. Robert [Bob] Mislevy and Dr. Maria Elena Oliveri introduce and illustrate a sociocognitive perspective on educational measurement, which focuses on a variety of design and implementation considerations for creating fair and valid assessments for learners from diverse populations with diverse sociocultural…
Descriptors: Educational Testing, Reliability, Test Validity, Test Reliability
Springer, Mark Christopher; Tyran, Craig K. – Quality Assurance in Education: An International Perspective, 2022
Purpose: This study aims to describe the development and validation of a student survey instrument to assess academic advising services. The instrument was based on the SERVQUAL scale, a well-known instrument for service quality. Design/methodology/approach: A quantitative methodology was used. Data were collected through a structured…
Descriptors: Academic Advising, Quality Assurance, Student Surveys, Test Construction
Ramsey Lee Cardwell – ProQuest LLC, 2022
The emergence of digital-first assessments is prompting reconsideration of, and innovation in, aspects of psychometrics, test validation, and test use. Using the Duolingo English Test (DET) as an example, this three-paper series seeks to address issues concerning the estimation of classification consistency and the reporting of results for such…
Descriptors: Classification, Reliability, Language Proficiency, Computer Assisted Testing
Park, HwaChoon; Hill, Roger B. – Career and Technical Education Research, 2018
The Employability Skills Assessment (ESA) was translated into Korean (KESA) and the construct validity and reliabilities of the KESA were examined to provide Koreans with a scientific research-based work ethic measure. A total of 896 Korean Baby Boomers (1955-1963), Generation X (1964-1981) and Millennials (1982-1999) provided data. Work ethic was…
Descriptors: Foreign Countries, Employment Qualifications, Job Skills, Test Validity
Indrapangastuti, Dewi; Surjono, Herman Dwi; Sugiman; Yanto, Bagus Endri – Journal of Education and e-Learning Research, 2021
This study aims to discover the effectiveness of the blended learning model in mathematics learning to improve the achievement of mathematical concepts. This study employed a quasi-experimental design with a non-equivalent control group. The experimental class was taught through blended learning, while the control class was taught through the…
Descriptors: Blended Learning, Teaching Methods, Mathematical Concepts, Mathematics Instruction
Rezaeian, Mahbubeh; Seyyedrezaei, Seyyed Hassan; Barani, Ghasem; Seyyedrezaei, Zari Sadat – International Journal of Language Testing, 2020
Individuals are controlled by tests in every advanced society when they want to be admitted in educational courses, to proceed from one stage to the next, or to be given a certificate (Shohamy, 2001b). Accordingly, the present study was carried out to construct and validate educational, social, and psychological consequences questionnaires of…
Descriptors: High Stakes Tests, English (Second Language), Second Language Learning, Factor Analysis
Geiger, Tray J.; Amrein-Beardsley, Audrey – AASA Journal of Scholarship & Practice, 2017
In this commentary, we discuss three types of data manipulations that can occur within teacher evaluation methods: artificial inflation, artificial deflation, and artificial conflation. These types of manipulation are more popularly known in the education profession as instances of Campbell's Law (1976), which states that the higher the…
Descriptors: Teacher Evaluation, Evaluation Methods, Data Analysis, Personnel Policy
Kiliçoglu, Gökhan; Kiliçoglu, Derya Yilmaz; Karadag, Engin – Leadership and Policy in Schools, 2019
Educational organizations in institutionalized environments may try to reflect a legitimate image of the environment in their internal structure. However, there may be loosely coupled relationship between anticipated legitimacy and the performed actions in the schools. Thus, that lack of congruence between rhetoric and the behaviors constitutes…
Descriptors: Foreign Countries, Organizational Climate, Organizational Culture, Integrity
Mikeska, Jamie N.; Phelps, Geoffrey; Croft, Andrew J. – ETS Research Report Series, 2017
This report describes efforts by a group of science teachers, teacher educators, researchers, and content specialists to conceptualize, develop, and pilot practice-based assessment items designed to measure elementary science teachers' content knowledge for teaching (CKT). The report documents the framework used to specify the content-specific…
Descriptors: Elementary School Teachers, Science Teachers, Knowledge Base for Teaching, Test Items
Priyambodo, Erfan; Marfuatun – International Journal of Evaluation and Research in Education, 2016
Nowadays, Rasch model analysis is used widely in social research, moreover in educational research. In this research, Rasch model is used to determine the validation and the reliability of systemic multiple choices question in chemistry teaching and learning. There were 30 multiple choices question with systemic approach for high school student…
Descriptors: Test Validity, Reliability, Multiple Choice Tests, Science Tests
Drost, Ellen A. – Education Research and Perspectives, 2011
In this paper, the author aims to provide novice researchers with an understanding of the general problem of validity in social science research and to acquaint them with approaches to developing strong support for the validity of their research. She provides insight into these two important concepts, namely (1) validity; and (2) reliability, and…
Descriptors: Social Science Research, Validity, Reliability, Measurement Techniques
Fromm, Germán; Hallinger, Philip; Volante, Paulo; Wang, Wen Chung – Educational Management Administration & Leadership, 2017
The purposes of this study were to report on a systematic approach to validating a Spanish version of the Principal Instructional Management Rating Scale and then to apply the scale in a cross-national comparison of principal instructional leadership. The study yielded a validated Spanish language version of the PIMRS Teacher Form and offers a…
Descriptors: Test Validity, Rating Scales, Spanish, Principals