Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 7 |
Since 2016 (last 10 years) | 14 |
Since 2006 (last 20 years) | 22 |
Descriptor
Test Reliability | 10 |
Interrater Reliability | 9 |
Test Validity | 9 |
Reliability | 6 |
Evaluation Methods | 5 |
Psychometrics | 5 |
Clinical Supervision (of… | 4 |
Cooperating Teachers | 4 |
Correlation | 4 |
High School Students | 4 |
Intervention | 4 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 19 |
Journal Articles | 16 |
Reports - Evaluative | 2 |
Numerical/Quantitative Data | 1 |
Reports - Descriptive | 1 |
Education Level
Higher Education | 6 |
Postsecondary Education | 6 |
High Schools | 5 |
Secondary Education | 5 |
Early Childhood Education | 3 |
Elementary Education | 3 |
Grade 10 | 1 |
Grade 5 | 1 |
Grade 9 | 1 |
Intermediate Grades | 1 |
Middle Schools | 1 |
More ▼ |
Audience
Location
Kansas | 22 |
Tennessee | 5 |
Kentucky | 4 |
Ohio | 4 |
Illinois | 3 |
Missouri | 3 |
Oklahoma | 3 |
Texas | 3 |
Virginia | 3 |
Nebraska | 2 |
Oregon | 2 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Praxis Series | 4 |
Battelle Developmental… | 1 |
Bayley Scales of Infant… | 1 |
MacArthur Communicative… | 1 |
What Works Clearinghouse Rating
Duvall, Steven F.; Fox, Ashley M.; Meeks, Courtney G. – American Journal of Distance Education, 2022
Following the pandemic-related school shutdowns in spring 2020, direct observations continued to be a necessary component of special education evaluations even when students were not present at school. As students began learning at home instead of in classrooms, the continued need for observational data likely compelled most educators to use video…
Descriptors: Interrater Reliability, Distance Education, Observation, COVID-19
Toland, Michael D.; Grisham, Jennifer; Waddell, Misti; Crawford, Rebecca; Dueber, David M. – Topics in Early Childhood Special Education, 2022
Rasch and classification analyses on a field-test version of the third edition of the Assessment, Evaluation, and Programming System (AEPS-3), a curriculum-based assessment used to assess young children birth to age 6 years, were conducted. First, an evaluation of the psychometric properties of data from each developmental area of an AEPS-3…
Descriptors: Curriculum Based Assessment, Field Tests, Young Children, Item Response Theory
Li, Zijia; Gooden, Caroline; Toland, Michael D. – Journal of Early Intervention, 2019
This study provides preliminary evidence for reliability and validity of the Hawaii Early Learning Profile Strands 0-3 (HELP Strands 0-3), an assessment instrument for young children. First, the degree of interobserver agreement for a sample of representative HELP items was examined; results indicated that HELP scoring was dependable and…
Descriptors: Measures (Individuals), Psychometrics, Early Childhood Education, Test Reliability
Tsai, Shu-Chen; Kern, Lee – Journal of Emotional and Behavioral Disorders, 2020
Student views of treatment acceptability of an intervention is important but still is neither regularly assessed nor studied beyond non-behavioral interventions. Furthermore, assessment of treatment acceptability across time is almost never considered. Using data from a longitudinal, randomized controlled trial, we examined variables that…
Descriptors: Intervention, High School Students, Behavior Disorders, Emotional Disturbances
Grisham, Jennifer; Waddell, Misti; Crawford, Rebecca; Toland, Michael – Journal of Early Intervention, 2021
The purpose of this article is to provide evidence of the technical adequacy of the Assessment, Evaluation, and Programming System--Third Edition (AEPS-3). The AEPS has long been identified as one of the most psychometrically sound early childhood curriculum-based assessments. In this article, results of three studies of technical adequacy are…
Descriptors: Infants, Young Children, Curriculum Based Assessment, Psychometrics
Joyce, Jeanette; Brodersen, R. Marc; Meyer, Stephen; Haines, Mckenzie; Weston-Sementelli, Jennifer – Regional Educational Laboratory Central, 2021
Although national assessments for evaluating teacher candidates are available, some state education agencies and education preparation programs have developed their own assessments. These locally developed assessments are based on observations of teaching and other artifacts such as lesson plans and student assignments. However, local assessment…
Descriptors: Test Validity, Test Reliability, Evaluation Methods, Preservice Teachers
Regional Educational Laboratory Central, 2021
This Study Snapshot highlights key findings from a larger study examining the validity and reliability of the Kansas Clinical Assessment Tool (K-CAT), a newly developed tool for assessing the performance of teacher candidates. The study team used interviews with cooperating teachers, content experts' ratings of the alignment of the K-CAT to…
Descriptors: Test Validity, Test Reliability, Evaluation Methods, Preservice Teachers
Joyce, J.; Brodersen, R.; M., Meyer; Haines, M.; Weston-Sementelli, J. – Regional Educational Laboratory Central, 2021
Although national assessments for evaluating teacher candidates are available, some state education agencies and education preparation programs have developed their own assessments. These locally developed assessments are based on observations of teaching and other artifacts such as lesson plans and student assignments. However, local assessment…
Descriptors: Test Validity, Test Reliability, Evaluation Methods, Preservice Teachers
Regional Educational Laboratory Central, 2021
The "Examination of the Validity and Reliability of the Kansas Clinical Assessment Tool" study explored the validity and reliability of the Kansas Clinical Assessment Tool (K-CAT), a newly developed tool for assessing the performance of teacher candidates. The study found that cooperating teachers reported that the K-CAT accurately…
Descriptors: Test Validity, Test Reliability, Evaluation Methods, Preservice Teachers
Gnacinski, Stacy L.; Janes, Mikaela K.; Newman, Nathan D. – Measurement in Physical Education and Exercise Science, 2019
The purposes of this study were to examine the factorial validity of the six-item Intragroup Conflict Scale (ICS) in athletic trainer (AT) and coach populations and to examine measurement invariance by gender and profession. A total of 195 ATs and 615 head or assistant coaches working at secondary schools or National Collegiate Athletic…
Descriptors: Measures (Individuals), Factor Structure, Conflict, Test Validity
LoCasale-Crouch, Jennifer; Jamil, Faiza; Pianta, Robert C.; Rudasill, Kathleen Moritz; DeCoster, Jamie – SAGE Open, 2018
This study examined how overall quality and within-day consistency in fifth graders' teacher-student interactions related to feelings about, engagement, and academic performance in school. Participants were 956 children in a national study. Students who experienced higher quality interactions reported more positive feelings about school, were more…
Descriptors: Teacher Student Relationship, Grade 5, Psychological Patterns, Academic Achievement
Nakamura, Christopher M.; Murphy, Sytil K.; Christel, Michael G.; Stevens, Scott M.; Zollman, Dean A. – Physical Review Physics Education Research, 2016
Computer-automated assessment of students' text responses to short-answer questions represents an important enabling technology for online learning environments. We have investigated the use of machine learning to train computer models capable of automatically classifying short-answer responses and assessed the results. Our investigations are part…
Descriptors: Physics, Introductory Courses, Science Instruction, Intelligent Tutoring Systems
Shogren, Karrie A.; Wehmeyer, Michael L.; Little, Todd D.; Forber-Pratt, Anjali J.; Palmer, Susan B.; Seo, Hyojeong – Career Development and Transition for Exceptional Individuals, 2017
The purpose of this article is to describe preliminary psychometric characteristics of a student self-report measure of self-determination, the "Self-Determination Inventory: Student Report" version (SDI-SR), designed for youth with and without disabilities. We administered the draft assessment to 311 youth and examined item functioning…
Descriptors: Construct Validity, Test Reliability, Scores, Psychometrics
Latimer, Marvin E., Jr.; Bergee, Martin J.; Cohen, Mary L. – Journal of Research in Music Education, 2010
The purpose of this study was to investigate the reliability and perceived pedagogical utility of a multidimensional weighted performance assessment rubric used in Kansas state high school large-group festivals. Data were adjudicator rubrics (N = 2,016) and adjudicator and director questionnaires (N = 515). Rubric internal consistency was…
Descriptors: Music Activities, State Programs, Performance Based Assessment, Weighted Scores
Sandbank, Micheal; Yoder, Paul – Topics in Early Childhood Special Education, 2014
Generalizability and decision studies provide a mathematical framework for quantifying the stability of a given number of measurements. This approach is especially relevant to the task of obtaining a representative measure of communicative behavior in young children and supports an alternative to the debate regarding which type of assessment…
Descriptors: Developmental Delays, Toddlers, Intervention, Vocabulary Development
Previous Page | Next Page ยป
Pages: 1 | 2