Publication Date
In 2025 | 6 |
Since 2024 | 59 |
Since 2021 (last 5 years) | 268 |
Since 2016 (last 10 years) | 781 |
Since 2006 (last 20 years) | 1698 |
Descriptor
Scores | 2324 |
Test Reliability | 1083 |
Reliability | 1051 |
Test Validity | 596 |
Foreign Countries | 572 |
Correlation | 529 |
Validity | 456 |
Psychometrics | 436 |
Measures (Individuals) | 411 |
Factor Analysis | 392 |
Statistical Analysis | 329 |
More ▼ |
Source
Author
Thompson, Bruce | 21 |
Erford, Bradley T. | 13 |
Henson, Robin K. | 11 |
Zimmerman, Donald W. | 11 |
Haberman, Shelby J. | 10 |
Worrell, Frank C. | 10 |
Lee, Yong-Won | 9 |
Sinharay, Sandip | 9 |
Gill, Brian | 8 |
Petscher, Yaacov | 8 |
Wainer, Howard | 8 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 33 |
Practitioners | 21 |
Teachers | 9 |
Administrators | 4 |
Counselors | 2 |
Parents | 2 |
Policymakers | 2 |
Community | 1 |
Students | 1 |
Location
Turkey | 88 |
Canada | 42 |
China | 37 |
United States | 35 |
Australia | 31 |
Florida | 24 |
Netherlands | 24 |
California | 21 |
Spain | 21 |
United Kingdom | 21 |
United Kingdom (England) | 21 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 3 |
Meets WWC Standards with or without Reservations | 3 |
Does not meet standards | 1 |
Uzun, N. Bilge; Aktas, Mehtap; Asiret, Semih; Yormaz, Seha – Asian Journal of Education and Training, 2018
The goal of this study is to determine the reliability of the performance points of dentistry students regarding communication skills and to examine the scoring reliability by generalizability theory in balanced random and fixed facet (mixed design) data, considering also the interactions of student, rater and duty. The study group of the research…
Descriptors: Foreign Countries, Generalizability Theory, Scores, Test Reliability
Hang Jo; Nayoung Kim – Measurement and Evaluation in Counseling and Development, 2024
The purpose of the current study is to validate the Counseling Competencies Scale Revised (CCS-R). We used convenience sampling to recruit counselors and counselors-in-training (N = 156). Specifically, we had 130 female (83.3%) and 26 male (16.7%) participants, and the mean age was 33.16 (SD = 11.18), ranging from 20 to 62 years old. We used…
Descriptors: Counselor Client Relationship, Counseling Effectiveness, Validity, Reliability
van der Scheer, Emmelien A.; Bijlsma, Hannah J. E.; Glas, Cees A. W. – School Effectiveness and School Improvement, 2019
A Bayesian IRT-model approach was used to investigate the validity and reliability of student perceptions of teaching quality. Furthermore, the student perceptions were compared with ratings of teaching quality by external observers. Grade 4 students (n = 675) filled out a questionnaire that was used to measure their opinions about the lessons of…
Descriptors: Student Attitudes, Validity, Interrater Reliability, Correlation
Irby, Sarah M.; Floyd, Randy G. – Psychology in the Schools, 2017
This study examined the exchangeability of total scores (i.e., intelligent quotients [IQs]) from three brief intelligence tests. Tests were administered to 36 children with intellectual giftedness, scored live by one set of primary examiners and later scored by a secondary examiner. For each student, six IQs were calculated, and all 216 values…
Descriptors: Intelligence Tests, Gifted, Error of Measurement, Scores
Harmston, Matt T.; Camara, Wayne J.; Phillips, Christine K. – ACT, Inc., 2019
Average score change: How big is big? This paper discusses school-level changes in average ACT scores and highlights an interactive tool designed to facilitate score change comparisons.
Descriptors: College Entrance Examinations, High School Students, Scores, Reliability
Purwanto; Hidayah, Niswatul; Wagistina, Satti – International Journal of Educational Methodology, 2023
Learning geography in Indonesia philosophically aims to develop spatial literacy. Students must improve spatial literacy to form reasoning skills and apply spatial concepts in real life. Applying Gersmehl's spatial learning can improve students' spatial literacy through syntax arranged based on spatial aspects. The use of google earth helps…
Descriptors: Spatial Ability, Natural Disasters, Geography Instruction, Teaching Methods
Lambie, Glenn W.; Tabet, Saundra M.; Stickl Haugen, Jaimie – Teacher Development, 2022
In response to the absence of an instrument to measure educator inspiration with evidence of validity and reliability, the authors developed the "Educator Inspire Scale" (EIS), an assessment designed to assess the construct of inspiration in educators. Therefore, their investigation examined the psychometric properties of the EIS scores…
Descriptors: Teacher Motivation, Faculty Development, Attitude Measures, Test Construction
Diez, Stephanie L.; Fava, Nicole M.; Fernandez, Sofia B.; Mendel, Whitney E. – Sex Education: Sexuality, Society and Learning, 2022
With the exponential growth of online information seeking by young people, it is imperative for health and sexual health educators to consider online information a resource young people will pursue. Access to accurate and comprehensive sexual health information is important, yet there is a scarcity of research evaluating the quality of this…
Descriptors: Sexuality, Sex Education, Usability, Reliability
Merchant, Stefan; Rich, Jessica; Klinger, Don A. – Canadian Journal of Educational Administration and Policy, 2022
Both school and district administrators use the results of standardized, large-scale tests to inform decisions about the need for, or success of, educational programs and interventions. However, test results at the school level are subject to random fluctuations due to changes in cohort, test items, and other factors outside of the school's…
Descriptors: Standardized Tests, Foreign Countries, Generalizability Theory, Scores
Patrick, Helen; French, Brian F.; Mantzicopoulos, Panayota – Journal of Psychoeducational Assessment, 2020
We evaluated the score stability of the Framework for Teaching (FFT), a prominent observation instrument used for teacher evaluation. Three raters each scored 200 reading and mathematics lessons taught by 20 kindergarten teachers. Using Generalizability theory analyses, we decomposed the FFT's Classroom Environment, Instruction, and Total scores…
Descriptors: Teacher Evaluation, Observation, Scores, Test Reliability
Martinez, Robert R., Jr.; Foxx, Sejal Parikh; Olsen, Jacob; Kennedy, Stephen D. – Professional School Counseling, 2021
We examined data from a national sample of 917 school counselors to determine the factor structure of the School Counselor STEM Advocacy Survey. An exploratory and confirmatory factor analysis supported use of the two-factor model. Survey scores demonstrated good internal consistency and convergent validity. We discuss differences between key…
Descriptors: School Counselors, Counselor Attitudes, National Surveys, STEM Education
Emrah Higde; Ahmet Volkan Yüzüak; Zekiye Merve Öcal; Hilal Aktamis – Journal of Baltic Science Education, 2024
The Many-Facet Rasch model is frequently used to analyse and minimize disparities in rater (judge) severity in performance evaluations, in which raters assign scores to test-takers' performances. In this research, the aim of the present study was to analyse science teacher candidates' laboratory activities by using the Many-facet Rasch model.…
Descriptors: Science Laboratories, Learning Activities, Science Process Skills, Student Attitudes
Kladouchou, Vasiliki; Papathanasiou, Ilias; Efstratiadou, Eva A.; Christaki, Vasiliki; Hilari, Katerina – International Journal of Language & Communication Disorders, 2017
Background & Aims: This study ran within the framework of the Thales Aphasia Project that investigated the efficacy of elaborated semantic feature analysis (ESFA). We evaluated the treatment integrity (TI) of ESFA, i.e., the degree to which therapists implemented treatment as intended by the treatment protocol, in two different formats:…
Descriptors: Aphasia, Semantics, Speech Therapy, Group Therapy
Goeman, J. J.; De Jong, N. H. – Educational Measurement: Issues and Practice, 2018
Many researchers use Cronbach's alpha to demonstrate internal consistency, even though it has been shown numerous times that Cronbach's alpha is not suitable for this. Because the intention of questionnaire and test constructers is to summarize the test by its overall sum score, we advocate summability, which we define as the proportion of total…
Descriptors: Tests, Scores, Questionnaires, Measurement
Stoevenbelt, Andrea H.; Wicherts, Jelte M.; Flore, Paulette C.; Phillips, Lorraine A. T.; Pietschnig, Jakob; Verschuere, Bruno; Voracek, Martin; Schwabe, Inga – Educational and Psychological Measurement, 2023
When cognitive and educational tests are administered under time limits, tests may become speeded and this may affect the reliability and validity of the resulting test scores. Prior research has shown that time limits may create or enlarge gender gaps in cognitive and academic testing. On average, women complete fewer items than men when a test…
Descriptors: Timed Tests, Gender Differences, Item Response Theory, Correlation