Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 3 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 19 |
Descriptor
Evaluation Methods | 22 |
Grade 4 | 15 |
Reliability | 9 |
Elementary School Students | 8 |
Foreign Countries | 8 |
Test Reliability | 7 |
Grade 3 | 6 |
Interrater Reliability | 6 |
Grade 5 | 5 |
Reading Tests | 5 |
Elementary Education | 4 |
More ▼ |
Source
Author
Al Otaiba, Stephanie | 2 |
Friedman, Greg | 2 |
Gatlin, Brandy | 2 |
Kim, Young-Suk Grace | 2 |
Lindsay, Jim | 2 |
Michaels, Hillary | 2 |
Miskell, Ryan | 2 |
Ochieng, Charles | 2 |
Schatschneider, Christopher | 2 |
Wanzek, Jeanne | 2 |
Yen, Shu Jing | 2 |
More ▼ |
Publication Type
Journal Articles | 16 |
Reports - Research | 15 |
Reports - Evaluative | 7 |
Speeches/Meeting Papers | 2 |
Dissertations/Theses -… | 1 |
Education Level
Grade 4 | 22 |
Elementary Education | 18 |
Grade 3 | 11 |
Grade 5 | 11 |
Intermediate Grades | 8 |
Early Childhood Education | 5 |
Grade 1 | 5 |
Grade 6 | 5 |
Primary Education | 5 |
Elementary Secondary Education | 4 |
Grade 2 | 4 |
More ▼ |
Audience
Administrators | 2 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Progress in International… | 1 |
Raven Progressive Matrices | 1 |
Stanford Achievement Tests | 1 |
What Works Clearinghouse Rating
Ji, Xuejun Ryan; Wu, Amery D. – Educational Measurement: Issues and Practice, 2023
The Cross-Classified Mixed Effects Model (CCMEM) has been demonstrated to be a flexible framework for evaluating reliability by measurement specialists. Reliability can be estimated based on the variance components of the test scores. Built upon their accomplishment, this study extends the CCMEM to be used for evaluating validity evidence.…
Descriptors: Measurement, Validity, Reliability, Models
Cristina Menescardi; Aida Carballo-Fazanes; Núria Ortega-Benavent; Isaac Estevan – Journal of Motor Learning and Development, 2024
The Canadian Agility and Movement Skill Assessment (CAMSA) is a valid and reliable circuit-based test of motor competence which can be used to assess children's skills in a live or recorded performance and then coded. We aimed to analyze the intrarater reliability of the CAMSA scores (total, time, and skill score) and time measured, by comparing…
Descriptors: Interrater Reliability, Evaluators, Scoring, Psychomotor Skills
Chen, Dandan; Hebert, Michael; Wilson, Joshua – American Educational Research Journal, 2022
We used multivariate generalizability theory to examine the reliability of hand-scoring and automated essay scoring (AES) and to identify how these scoring methods could be used in conjunction to optimize writing assessment. Students (n = 113) included subsamples of struggling writers and non-struggling writers in Grades 3-5 drawn from a larger…
Descriptors: Reliability, Scoring, Essays, Automation
Hautala, Jarkko; Heikkilä, Riikka; Nieminen, Lea; Rantanen, Vesa; Latvala, Juha-Matti; Richardson, Ulla – Journal of Educational Computing Research, 2020
Computerized game-based assessment (GBA) system for screening reading difficulties may provide substantial time and cost benefits over traditional paper-and-pencil assessment while providing means also to individually adapt learning content in educational games. To study the reliability and validity of a GBA system to identify struggling readers…
Descriptors: Reading Difficulties, Ability Identification, Evaluation Methods, Reliability
Desstya, Anatri; Prasetyo, Zuhdan Kun; Suyanta; Susila, Ihwan; Irwanto – International Journal of Instruction, 2019
This study aims to report the development an instrument that is standardized (reviewed by validity, reliability, and difficulty index) to detect science misconception in an elementary school teacher. This study used a 4-D model; defining, designing, developing, and disseminating. First, it was prepared with 47 opened-ended questions, and then it…
Descriptors: Elementary School Teachers, Misconceptions, Evaluation Methods, Teacher Evaluation
Bailes, Lauren P.; Nandakumar, Ratna – International Journal of Education Policy and Leadership, 2020
High-quality measurement tools are critical to school improvement efforts. Education researchers frequently employ surveys in order to assess a host of variables associated with school improvement. This article asserts that Rasch modeling techniques enhance the quality of a measurement tool because they comprise elements of both qualitative and…
Descriptors: Surveys, Evaluation Methods, Item Response Theory, Administrator Role
Kim, Young-Suk Grace; Schatschneider, Christopher; Wanzek, Jeanne; Gatlin, Brandy; Al Otaiba, Stephanie – Reading and Writing: An Interdisciplinary Journal, 2017
We examined how raters and tasks influence measurement error in writing evaluation and how many raters and tasks are needed to reach a desirable level of 0.90 and 0.80 reliabilities for children in Grades 3 and 4. A total of 211 children (102 boys) were administered three tasks in narrative and expository genres, respectively, and their written…
Descriptors: Writing Evaluation, Elementary School Students, Grade 3, Grade 4
Kim, Young-Suk Grace; Schatschneider, Christopher; Wanzek, Jeanne; Gatlin, Brandy; Al Otaiba, Stephanie – Grantee Submission, 2017
We examined how raters and tasks influence measurement error in writing evaluation and how many raters and tasks are needed to reach a desirable level of 0.90 and 0.80 reliabilities for children in Grades 3 and 4. A total of 211 children (102 boys) were administered three tasks in narrative and expository genres, respectively, and their written…
Descriptors: Writing Evaluation, Elementary School Students, Grade 3, Grade 4
Ruffini, Stephen J.; Miskell, Ryan; Lindsay, Jim; McInerney, Maurice; Waite, Winsome – Regional Educational Laboratory Midwest, 2016
Many schools identified by states as needing improvement through their Elementary and Secondary Education Act waivers have selected Response to Intervention (RTI), a three-tiered instruction program sometimes referred to as tiered levels of instruction, as one of their main strategies for improving school performance and closing achievement gaps.…
Descriptors: Program Implementation, Fidelity, Response to Intervention, Public Schools
Ruffini, Steffen J.; Lindsay, Jim; Miskell, Ryan; Proger, Amy – Regional Educational Laboratory Midwest, 2016
Regional Educational Laboratory Midwest assisted Milwaukee Public Schools in developing a fidelity monitoring system for measuring schools' progress in implementing Response to Intervention (RTI). The study examined the ratings produced by that system to determine the system's reliability, schools' progress in implementing RTI, and whether ratings…
Descriptors: Program Implementation, Fidelity, Response to Intervention, Public Schools
Macedo-Rouet, Monica; Braasch, Jason L. G.; Britt, M. Anne; Rouet, Jean-Francois – Cognition and Instruction, 2013
In two experiments, we examined fourth and fifth graders' comprehension of the source of information in texts presenting controversial issues. In Experiment 1, participants read short texts in which two people presented different arguments regarding an issue. Participants identified who said what and evaluated each source's knowledge of the issue.…
Descriptors: Elementary School Students, Grade 4, Grade 5, Reading Comprehension
Goffreda, Catherine T.; DiPerna, James Clyde – School Psychology Review, 2010
The Dynamic Indicators of Basic Early Literacy Skills (DIBELS) are brief measures of early literacy skills for students in Grades K-6 (University of Oregon, 2009; see Kaminski & Good, 1996). School psychologists and other educational professionals use DIBELS to identify students who are in need of early intervention. The purpose of this review was…
Descriptors: Early Intervention, Reading Fluency, School Psychologists, Validity
Surapiboonchai, Kampol – ProQuest LLC, 2010
There is a lack of valid and reliable low cost observational instruments to measure moderate to vigorous physical activity (MVPA) in school physical education (PE). The participants in this study were third to tenth grade boys and girls from a south Texas school district. The SAM (Simple Activity Measurement) activity levels were compared with…
Descriptors: Physical Education, Physical Activities, Test Construction, Psychometrics
Lau, Sing; Cheung, Ping Chung – Thinking Skills and Creativity, 2010
With a sample of Grade 4 Chinese students, the present study examined whether the electronic version was comparable to the paper-and-pencil version of the Wallach-Kogan Creativity Tests (WKCT). It was found that the two versions generated similar patterns of reliability coefficients and inter-correlation coefficients for the eight creativity…
Descriptors: Foreign Countries, Creativity, Grade 4, Test Reliability
Papay, John P. – American Educational Research Journal, 2011
Recently, educational researchers and practitioners have turned to value-added models to evaluate teacher performance. Although value-added estimates depend on the assessment used to measure student achievement, the importance of outcome selection has received scant attention in the literature. Using data from a large, urban school district, I…
Descriptors: Urban Schools, Teacher Effectiveness, Reading Achievement, Achievement Tests
Previous Page | Next Page »
Pages: 1 | 2