Publication Date
In 2025 | 1 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 16 |
Since 2016 (last 10 years) | 54 |
Since 2006 (last 20 years) | 90 |
Descriptor
Scores | 91 |
Test Reliability | 44 |
Reliability | 41 |
Psychometrics | 29 |
Grade 3 | 28 |
Elementary School Students | 26 |
Foreign Countries | 26 |
Test Validity | 25 |
Correlation | 24 |
Early Childhood Education | 23 |
Kindergarten | 23 |
More ▼ |
Source
Author
Publication Type
Reports - Research | 76 |
Journal Articles | 69 |
Numerical/Quantitative Data | 7 |
Reports - Evaluative | 6 |
Reports - Descriptive | 5 |
Dissertations/Theses -… | 4 |
Tests/Questionnaires | 4 |
Guides - Non-Classroom | 1 |
Education Level
Early Childhood Education | 91 |
Primary Education | 58 |
Elementary Education | 51 |
Grade 3 | 30 |
Kindergarten | 24 |
Preschool Education | 24 |
Intermediate Grades | 18 |
Grade 2 | 17 |
Grade 4 | 17 |
Middle Schools | 15 |
Grade 1 | 12 |
More ▼ |
Audience
Researchers | 1 |
Location
Pennsylvania | 5 |
Illinois | 4 |
Turkey | 4 |
United States | 4 |
Colorado | 3 |
California | 2 |
Canada | 2 |
Finland | 2 |
Florida | 2 |
Germany | 2 |
Indiana | 2 |
More ▼ |
Laws, Policies, & Programs
Education for All Handicapped… | 1 |
Education of the Handicapped… | 1 |
Head Start | 1 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
D'Agostino, Jerome V.; Rodgers, Emily; Winkler, Christa; Johnson, Tracy; Berenbon, Rebecca – Reading Psychology, 2021
Running Records provide a standardized method for recording and assessing students' oral reading behaviors and are excellent formative assessment tools to guide instructional decision-making. This study expands on prior Running Record reliability work by evaluating the extent to which external raters and teachers consistently assessed students'…
Descriptors: Accuracy, Oral Reading, Generalizability Theory, Error Correction
Parker, David C.; Stewart, Lisa H.; Thomson, Susan; Kaminski, Ruth A. – Assessment for Effective Intervention, 2021
Vocabulary skills are important for overall reading competence, but vocabulary assessment approaches that inform instructional decision-making and are sensitive to improvement are limited. This article describes a process for developing vocabulary measures designed to facilitate data-driven decision-making for kindergarten and first-grade students…
Descriptors: Vocabulary, Kindergarten, Grade 1, Elementary School Students
McLeod, Bryce D.; Sutherland, Kevin S.; Broda, Michael; Granger, Kristen L.; Martinez, Ruben G.; Conroy, Maureen A.; Snyder, Patricia A.; Southam-Gerow, Michael A. – Prevention Science, 2022
Though treatment integrity measurement is important for research intended to promote social and behavioral outcomes of children at risk for emotional and behavioral disorders (EBDs) in early childhood settings, measurement gaps exist in the field. This paper reports on the development and preliminary psychometric assessment of the treatment…
Descriptors: Psychometrics, Measures (Individuals), Fidelity, Integrity
Fritz, Ronda; Harn, Beth; Biancarosa, Gina; Lucero, Audrey; Flannery, K. Brigid – Assessment for Effective Intervention, 2019
This study investigated the use of brief observations to measure implementation of small group interventions using the Quality of Intervention Delivery and Receipt (QIDR) tool. Videos of 10-min segments representing the beginning, middle, and end of each 30-min intervention lesson were coded for implementation. Results indicated that (a)…
Descriptors: Intervention, Program Implementation, Efficiency, Observation
VanDerHeyden, Amanda M.; Codding, Robin; Solomon, Benjamin G. – Remedial and Special Education, 2023
Computer-based curriculum-based measurement (CBM) is a relatively common practice, but surprisingly few studies have examined the reliability of computer-based CBM. This study sought to examine the reliability of CBM administered via paper/pencil versus the computer. Twenty-one of 25 students in two third-grade classes (N = 21) participated in two…
Descriptors: Curriculum Based Assessment, Computer Assisted Testing, Test Format, Grade 3
Pakarinen, Eija; Malmberg, Lars-Erik; Poikkeus, Anna-Maija; Siekkinen, Martti; Lerkkanen, Marja-Kristiina – International Journal of Research & Method in Education, 2023
When classroom observations are increasingly used for accountability and evaluation purposes, a deeper understanding of the psychometric properties of such measurement tools is needed. The present study took a unique approach to examine the psychometric properties of a commonly used classroom observation measure by testing the reliability of…
Descriptors: Foreign Countries, Kindergarten, Grade 1, Elementary Schools
Pentimonti, Jill M.; Bowles, Ryan P.; Zucker, Tricia A.; Tambyraja, Sherine R.; Justice, Laura M. – Grantee Submission, 2021
Measuring the quality of classroom-based interactive shared book reading within the early childhood classroom represents a specific dimension of teacher-child interactions that is of great interest to researchers. This interest reflects decades of research demonstrating the benefit of reading to young children in both the home and the classroom.…
Descriptors: Standardized Tests, Test Construction, Construct Validity, Predictive Validity
Jiayi Wang; Michael T. Kalkbrenner; Riley Schaner – Psychology in the Schools, 2025
Teaching is a stressful profession with a high turnover rate. Schools and related institutions need to take more action to support teachers and keep teacher stress at a manageable level. The continued research and practical effort require measures to examine teachers' stress in a briefer and accurate manner. The Teacher Stress Scale is a recently…
Descriptors: Elementary School Teachers, Secondary School Teachers, Preschool Teachers, Stress Variables
McLeod, Bryce D.; Sutherland, Kevin S.; Broda, Michael; Granger, Kristen L.; Martinez, Ruben G.; Conroy, Maureen A.; Snyder, Patricia A.; Southam-Gerow, Michael A. – Grantee Submission, 2021
Though treatment integrity measurement is important for research intended to promote social and behavioral outcomes of children at risk for emotional and behavioral disorders (EBDs) in early childhood settings, measurement gaps exist in the field. This paper reports on the development and preliminary psychometric assessment of the treatment…
Descriptors: Psychometrics, Measures (Individuals), Fidelity, Integrity
Carbonneau, Kira J.; Van Orman, Dustin S. J.; Lemberger-Truelove, Matthew E.; Atencio, David J. – Early Education and Development, 2020
Research Findings: Given the variable nature of early childhood settings, practitioners and researchers need better guidance on what conditions influence observations conducted within early childhood settings (National Research Council, 2008). Using 230 observations from 23 three- and four-year-old children, we conducted a Generalizability study…
Descriptors: Classroom Environment, Observation, Preschool Children, Influences
Li, Zijia; Gooden, Caroline; Toland, Michael D. – Journal of Early Intervention, 2019
This study provides preliminary evidence for reliability and validity of the Hawaii Early Learning Profile Strands 0-3 (HELP Strands 0-3), an assessment instrument for young children. First, the degree of interobserver agreement for a sample of representative HELP items was examined; results indicated that HELP scoring was dependable and…
Descriptors: Measures (Individuals), Psychometrics, Early Childhood Education, Test Reliability
Merchant, Stefan; Rich, Jessica; Klinger, Don A. – Canadian Journal of Educational Administration and Policy, 2022
Both school and district administrators use the results of standardized, large-scale tests to inform decisions about the need for, or success of, educational programs and interventions. However, test results at the school level are subject to random fluctuations due to changes in cohort, test items, and other factors outside of the school's…
Descriptors: Standardized Tests, Foreign Countries, Generalizability Theory, Scores
Patrick, Helen; French, Brian F.; Mantzicopoulos, Panayota – Journal of Psychoeducational Assessment, 2020
We evaluated the score stability of the Framework for Teaching (FFT), a prominent observation instrument used for teacher evaluation. Three raters each scored 200 reading and mathematics lessons taught by 20 kindergarten teachers. Using Generalizability theory analyses, we decomposed the FFT's Classroom Environment, Instruction, and Total scores…
Descriptors: Teacher Evaluation, Observation, Scores, Test Reliability
Wells, Craig S.; Sireci, Stephen G. – Applied Measurement in Education, 2020
Student growth percentiles (SGPs) are currently used by several states and school districts to provide information about individual students as well as to evaluate teachers, schools, and school districts. For SGPs to be defensible for these purposes, they should be reliable. In this study, we examine the amount of systematic and random error in…
Descriptors: Growth Models, Reliability, Scores, Error Patterns
Harding, Jessica F.; Nguyen, Tutrang; Malone, Lizabeth; Atkins-Burnett, Sally; Tarullo, Louisa; Aikens, Nikki – Office of Planning, Research and Evaluation, 2022
In spring 2020, in response to the COVID-19 (for coronavirus disease 2019) pandemic, many early care and education centers, including Head Start centers, closed their physical buildings and changed their operations to virtual. Because of health and safety restrictions, OPRE was unable to directly assess children's skills in spring 2020 for the…
Descriptors: Federal Programs, Low Income Students, Social Services, Experience