Publication Date
In 2025 | 67 |
Since 2024 | 901 |
Since 2021 (last 5 years) | 3415 |
Since 2016 (last 10 years) | 7595 |
Since 2006 (last 20 years) | 14761 |
Descriptor
Test Reliability | 14547 |
Test Validity | 9865 |
Reliability | 9544 |
Foreign Countries | 6751 |
Test Construction | 4608 |
Validity | 4120 |
Measures (Individuals) | 3750 |
Factor Analysis | 3720 |
Psychometrics | 3393 |
Interrater Reliability | 3054 |
Correlation | 3009 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 703 |
Practitioners | 447 |
Teachers | 204 |
Administrators | 121 |
Policymakers | 62 |
Counselors | 42 |
Students | 37 |
Parents | 11 |
Community | 7 |
Media Staff | 5 |
Support Staff | 5 |
More ▼ |
Location
Turkey | 1246 |
Australia | 428 |
Canada | 371 |
China | 329 |
United States | 264 |
United Kingdom | 246 |
Taiwan | 221 |
Netherlands | 217 |
Indonesia | 214 |
California | 208 |
Spain | 201 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Meets WWC Standards without Reservations | 8 |
Meets WWC Standards with or without Reservations | 9 |
Does not meet standards | 6 |
Dymond, Stacy K.; Chun, Eul Jung; Kim, Rah Kyung; Renzaglia, Adelle – Remedial and Special Education, 2013
A statewide survey of coordinators of inclusive high school service-learning programs was conducted to validate elements, methods, and barriers to including students with and without disabilities in service-learning. Surveys were mailed to 655 service-learning coordinators; 190 (29%) returned a completed survey. Findings support the validity of…
Descriptors: Service Learning, Surveys, Validity, Inclusion
Rogers, Katherine D.; Young, Alys; Lovell, Karina; Campbell, Malcolm; Scott, Paul R.; Kendal, Sarah – Journal of Deaf Studies and Deaf Education, 2013
The present study is aimed to translate 3 widely used clinical assessment measures into British Sign Language (BSL), to pilot the BSL versions, and to establish their validity and reliability. These were the Patient Health Questionnaire (PHQ-9), the Generalized Anxiety Disorder 7-item (GAD-7) scale, and the Work and Social Adjustment Scale (WSAS).…
Descriptors: Foreign Countries, Deafness, Sign Language, Mental Health
Hill, Tara M.; Laux, John M.; Stone, Gregory; Dupuy, Paula; Scott, Holly – Journal of Addictions & Offender Counseling, 2013
Rasch analysis of the Substance Abuse Subtle Screening Inventory-3 (SASSI-3; F. G. Miller & Lazowski, 1999) indicated that the SASSI-3 meets fundamental measurement properties; however, the authors of the current study recommend the elimination of nonfunctioning items and the improvement of response options for the face valid scales to…
Descriptors: Test Items, Substance Abuse, Usability, Test Validity
Coffee, Gina; Kratochwill, Thomas R. – Journal of Educational & Psychological Consultation, 2013
In this study we examined the extent to which teachers implement and generalize a praise intervention learned during behavioral consultation. Four elementary teachers and 15 of their students (3-4 per teacher) participated in the study. In each classroom, 1 student was randomly assigned as the consultation target student, 1 as the generalization…
Descriptors: Intervention, Generalization, Consultation Programs, Reliability
Partnership for Assessment of Readiness for College and Careers, 2018
The purpose of this technical report is to describe the third operational administration of the Partnership for Assessment of Readiness for College and Careers (PARCC) assessments in the 2016-2017 academic year. PARCC is a state-led consortium creating next-generation assessments that, compared to traditional K-12 assessments, more accurately…
Descriptors: College Readiness, Career Readiness, Common Core State Standards, Language Arts
Furlong, Michael J.; Dowdy, Erin; Nylund-Gibson, Karen – Grantee Submission, 2018
This manual reports on the development and validation of the original Social Emotional Health Survey-Secondary (carried out between 2012 and 2017). We shared the first version of the SEHS-S because it had sufficient validation evidence based on research completed by 2015; hence, the form reported on in this manual is called the SEHS-S (2015)…
Descriptors: Surveys, Psychometrics, Test Validity, Mental Health
Pearson, 2018
aimswebPlus® is an assessment, data management, and reporting system that provides national and local performance and growth norms for the screening and progress monitoring of math and reading skills for all students in kindergarten through 8th grade. aimswebPlus uses two types of measures: (1) "curriculum-based measures" (CBMs)--brief,…
Descriptors: Management Systems, Data Analysis, Standards, Response to Intervention
Krukowski, Rebecca A.; Philyaw Perez, Amanda G.; Bursac, Zoran; Goodell, Melanie; Raczynski, James M.; Smith West, Delia; Phillips, Martha M. – Journal of School Health, 2011
Background: Foods provided in schools represent a substantial portion of US children's dietary intake; however, the school food environment has proven difficult to describe due to the lack of comprehensive, standardized, and validated measures. Methods: As part of the Arkansas Act 1220 evaluation project, we developed the School Cafeteria…
Descriptors: Health Promotion, Nutrition, Public Health, Interrater Reliability
Brimi, Hunter M. – Practical Assessment, Research & Evaluation, 2011
This research replicates the work of Starch and Elliot (1912) by examining the reliability of the grading by English teachers in a single school district. Ninety high school teachers graded the same student paper following professional development sessions in which they were trained to use NWREL's "6+1 Traits of Writing." These participants had…
Descriptors: Grading, Reliability, Secondary School Teachers, English Teachers
Flynn Longmire, Crystal V.; Knight, Bob G. – Gerontologist, 2011
Purpose of the study: Although the Zarit Burden Interview (ZBI) is one of the most extensively used measures in research for caregiver burden, few researchers have examined its factor structure. Furthermore, though the ZBI has also been used in cross-group comparisons of burden, there have not been studies of whether or not it measures burden…
Descriptors: Factor Analysis, Interviews, Dementia, Caregivers
Kaya, Taciser; Goksel Karatepe, Altinay; Gunaydin, Rezzan; Koc, Aysegul; Altundal Ercan, Ulku – International Journal of Rehabilitation Research, 2011
The Modified Ashworth Scale (MAS) is commonly used in clinical practice for grading spasticity. However, it was modified recently by omitting grade "1+" of the MAS and redefining grade "2". The aim of this study was to investigate the inter-rater reliability of MAS and modified MAS (MMAS) for the assessment of poststroke elbow flexor spasticity.…
Descriptors: Interrater Reliability, Patients, Measures (Individuals), Neurological Impairments
d'Uva, Teresa Bago; Lindeboom, Maarten; O'Donnell, Owen; van Doorslaer, Eddy – Journal of Human Resources, 2011
We propose tests of the two assumptions under which anchoring vignettes identify heterogeneity in reporting of categorical evaluations. Systematic variation in the perceived difference between any two vignette states is sufficient to reject "vignette equivalence." "Response consistency"--the respondent uses the same response…
Descriptors: Vignettes, Reliability, Older Adults, Responses
Sahin, Semiha; Cek, Fatma; Zeytin, Nalan – Educational Sciences: Theory and Practice, 2011
The purpose of this study is to gather the educational supervisors' opinions regarding whether the supervision system and in-service training courses reaches its aim and to obtain their suggestions about the restructuring of the supervision system. The sample of the study is composed of 104 supervisors. The qualitative data were collected through…
Descriptors: Opinions, Supervision, Content Analysis, Supervisors
Weatherly, Jeffrey N.; Derenne, Adam; Terrell, Heather K. – Psychological Record, 2011
Several measures of delay discounting have been shown to be reliable over periods of up to 3 months. In the present study, 115 participants completed a fill-in-the-blank (FITB) delay-discounting task on sets of 5 different commodities, 12 weeks apart. Results showed that discounting rates were not well described by a hyperbolic function but were…
Descriptors: Delay of Gratification, Reliability, Test Format, Measures (Individuals)
Drevon, Daniel D. – Journal of Psychoeducational Assessment, 2011
This article presents a review of the "Preschool Behavioral and Emotional Rating Scale" (PreBERS), a 42-item family member--or school personnel--completed rating scale designed to measure the behavioral and emotional strengths of preschool children ages 3-0 to 5-11. According to the manual, results can be used to identify preschoolers with limited…
Descriptors: Behavior Rating Scales, Preschool Children, Child Behavior, Psychological Patterns