Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 10 |
Since 2006 (last 20 years) | 24 |
Descriptor
Data Analysis | 36 |
Student Evaluation | 36 |
Reliability | 17 |
Test Reliability | 15 |
Evaluation Methods | 12 |
Test Validity | 12 |
Validity | 10 |
Scores | 9 |
Data Collection | 8 |
Interrater Reliability | 7 |
Evaluation Criteria | 6 |
More ▼ |
Source
Author
Aktas, Mehtap | 1 |
Andrews, David M. | 1 |
Asiret, Semih | 1 |
Baran, Evrim | 1 |
Battistone, William A., Jr. | 1 |
Bers, Trudy H. | 1 |
Blaker, Lisa | 1 |
Britton, J. N. | 1 |
Capuano, Nicola | 1 |
Chavez, Oscar | 1 |
Chen, Fu | 1 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 1 |
Researchers | 1 |
Teachers | 1 |
Location
United States | 3 |
Australia | 2 |
Canada | 2 |
Florida | 2 |
Hawaii | 2 |
New York | 2 |
Ohio | 2 |
United Kingdom (England) | 2 |
Asia | 1 |
Brazil | 1 |
California | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
Early Childhood Longitudinal… | 1 |
Measures of Academic Progress | 1 |
Student Teacher Relationship… | 1 |
What Works Clearinghouse Rating
Uzun, N. Bilge; Aktas, Mehtap; Asiret, Semih; Yormaz, Seha – Asian Journal of Education and Training, 2018
The goal of this study is to determine the reliability of the performance points of dentistry students regarding communication skills and to examine the scoring reliability by generalizability theory in balanced random and fixed facet (mixed design) data, considering also the interactions of student, rater and duty. The study group of the research…
Descriptors: Foreign Countries, Generalizability Theory, Scores, Test Reliability
Elturki, Eman – English Teaching Forum, 2020
Accrediting agencies for English language programs, such as the Commission on English Language Program Accreditation (CEA), require a plan in writing for monitoring and reviewing assessment practices. Nonetheless, web-search queries such as "assessing assessment," "how to assess assessment," "assessing assessment…
Descriptors: College Second Language Programs, English (Second Language), Student Evaluation, Test Reliability
Cui, Yang; Chu, Man-Wai; Chen, Fu – Journal of Educational Data Mining, 2019
Digital game-based assessments generate student process data that is much more difficult to analyze than traditional assessments. The formative nature of game-based assessments permits students, through applying and practicing the targeted knowledge and skills during gameplay, to gain experiences, receive immediate feedback, and as a result,…
Descriptors: Educational Games, Student Evaluation, Data Analysis, Bayesian Statistics
Capuano, Nicola; Loia, Vincenzo; Orciuoli, Francesco – IEEE Transactions on Learning Technologies, 2017
Massive Open Online Courses (MOOCs) are becoming an increasingly popular choice for education but, to reach their full extent, they require the resolution of new issues like assessing students at scale. A feasible approach to tackle this problem is peer assessment, in which students also play the role of assessor for assignments submitted by…
Descriptors: Participative Decision Making, Models, Peer Evaluation, Online Courses
Long, Avizia Y.; Shin, Sun-Young; Geeslin, Kimberly; Willis, Erik W. – Language Learning & Technology, 2018
In response to the need for examples of test validation from which everyday language programs can benefit, this paper reports on a study that used Bachman's (2005) assessment use argument (AUA) framework to examine evidence to support claims made about the intended interpretations and uses of scores based on a new web-based Spanish language…
Descriptors: Second Language Instruction, Second Language Learning, Spanish, Computer Assisted Testing
Goodwin, Adam; Chittle, Laura; Dixon, Jess C.; Andrews, David M. – Assessment & Evaluation in Higher Education, 2018
A multi-disciplinary academic unit at a Canadian university completed an evaluation of course syllabi used in its undergraduate programmes over the previous five years. This paper examines the reasons for the evaluation, the processes employed to collect and analyse the data, and how the results will be incorporated into the next steps of the…
Descriptors: Foreign Countries, College Curriculum, Curriculum Evaluation, Course Descriptions
Keller-Margulis, Milena A.; Mercer, Sterett H.; Thomas, Erin L. – School Psychology Quarterly, 2016
The purpose of this study was to examine the reliability of written expression curriculum-based measurement (WE-CBM) in the context of universal screening from a generalizability theory framework. Students in second through fifth grade (n = 145) participated in the study. The sample included 54% female students, 49% White students, 23% African…
Descriptors: Generalizability Theory, Reliability, Written Language, Curriculum Based Assessment
Khan, R. Nazim – International Journal of Mathematical Education in Science and Technology, 2015
Open book assessment is not a new idea, but it does not seem to have gained ground in higher education. In particular, not much literature is available on open book examinations in mathematics and statistics in higher education. The objective of this paper is to investigate the appropriateness of open book assessments in a first-year business…
Descriptors: Evaluation Methods, Higher Education, Mathematics Tests, Statistics
Battistone, William A., Jr. – ProQuest LLC, 2017
Problem: There is an existing cycle of questionable grading practices at the K-12 level. As a result, districts continue to search for innovative methods of evaluating and reporting student progress. One result of this effort has been the adoption of a standards-based grading approach. Research concerning standards-based grading implementation has…
Descriptors: Phenomenology, Beginning Teachers, Teaching Experience, Elementary School Teachers
Pearson, 2018
aimswebPlus® is an assessment, data management, and reporting system that provides national and local performance and growth norms for the screening and progress monitoring of math and reading skills for all students in kindergarten through 8th grade. aimswebPlus uses two types of measures: (1) "curriculum-based measures" (CBMs)--brief,…
Descriptors: Management Systems, Data Analysis, Standards, Response to Intervention
Riley-Ayers, Shannon – Center on Enhancing Early Learning Outcomes, 2014
This policy report provides a guide and framework to early childhood policymakers considering formative assessment. The report defines formative assessment and outlines its process and application in the context of early childhood. The substance of this document is the issues for consideration in the implementation of the formative assessment…
Descriptors: Formative Evaluation, Early Childhood Education, Educational Policy, Policy Formation
Corrigan, M. J.; Gurdineer, E. E. – Journal of Child & Adolescent Substance Abuse, 2012
Objective: This article reports on two separate studies of reliability of the Adolescent Domain Screening Inventory (ADSI), test-retest and internal consistency analyses. The ADSI has shown adequate validity, although reliability has not been established. Methods: Study 1: Students were recruited from two undergraduate courses (N = 29).…
Descriptors: Evidence, Student Evaluation, Data Analysis, Screening Tests
Titley, Jonathan E.; D'Amato, Rik Carl; Koehler-Hak, Kathrine M. – Contemporary School Psychology, 2014
The identification of children at-risk for reading problems can be costly and time-consuming. Previous research has indicated that teachers are relatively accurate in assessing children's overall reading ability. This study investigated the accuracy of kindergarten and first grade teacher rating scales in predicting children's reading…
Descriptors: Literacy, Student Evaluation, Achievement Rating, At Risk Students
Chavez, Oscar; Papick, Ira; Ross, Dan J.; Grouws, Douglas A. – Online Submission, 2010
The purpose of this paper was to describe the process of development of assessment instruments for the Comparing Options in Secondary Mathematics: Investigating Curriculum (COSMIC) project. The COSMIC project was a three-year longitudinal comparative study focusing on evaluating high school students' mathematics learning from two distinct…
Descriptors: Mathematics Education, Mathematics Achievement, Interrater Reliability, Scoring Rubrics
Erford, Bradley T.; Schein, Hallie; Duncan, Kelly – Assessment for Effective Intervention, 2011
The purpose of this study was to provide preliminary analysis of reliability and validity of scores on the "Self-Efficacy Self-Report Scale", which was designed to assess general self-efficacy in students aged 10 to 17 years. Confirmatory factor analysis on cross-validated samples was conducted revealing a marginal fit of the data to the…
Descriptors: Self Efficacy, Measures (Individuals), Factor Analysis, Self Disclosure (Individuals)