Publication Date
In 2025 | 3 |
Since 2024 | 58 |
Since 2021 (last 5 years) | 189 |
Since 2016 (last 10 years) | 438 |
Since 2006 (last 20 years) | 863 |
Descriptor
Student Evaluation | 1647 |
Test Reliability | 950 |
Test Validity | 711 |
Evaluation Methods | 551 |
Reliability | 478 |
Foreign Countries | 398 |
Interrater Reliability | 293 |
Test Construction | 292 |
Validity | 281 |
Higher Education | 264 |
Elementary Secondary Education | 216 |
More ▼ |
Source
Author
Greenan, James P. | 8 |
Tindal, Gerald | 7 |
Baker, Eva L. | 5 |
Deno, Stanley L. | 5 |
Ediger, Marlow | 5 |
Herman, Joan L. | 5 |
Shavelson, Richard J. | 5 |
Bastick, Tony | 4 |
Bracey, Gerald W. | 4 |
Cason, Gerald J. | 4 |
Eva, Kevin W. | 4 |
More ▼ |
Publication Type
Education Level
Audience
Practitioners | 97 |
Teachers | 61 |
Researchers | 60 |
Administrators | 35 |
Students | 11 |
Policymakers | 9 |
Parents | 4 |
Support Staff | 4 |
Community | 3 |
Counselors | 2 |
Location
Australia | 43 |
United Kingdom | 41 |
Turkey | 34 |
United Kingdom (England) | 31 |
Canada | 30 |
Indonesia | 17 |
United States | 16 |
New York | 15 |
China | 14 |
Florida | 14 |
Netherlands | 14 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Young, Karen; James, Kimberley; Noy, Sue – Asia-Pacific Journal of Cooperative Education, 2016
Work integrated learning (WIL) educators using reflective practice to facilitate student learning require a set of standards that works within the traditional assessment frame of Higher Education, to ascertain the level at which reflective practice has been demonstrated. However, there is a paucity of tested assessment instruments that provide…
Descriptors: Work Experience Programs, Reflection, Student Evaluation, Scoring Rubrics
Earle, Sarah – Primary Science, 2017
Moderation is put forward as they key strategy for improving the reliability of teacher assessment. However, for many teachers the word "moderation" conjures up ideas of uncomfortable situations in which marking is being checked by others and there are prolonged arguments about tiny features of individual work. In this article, the…
Descriptors: Grading, Interrater Reliability, Faculty Development, Professional Continuing Education
Smarter Balanced Assessment Consortium, 2020
The Smarter Balanced Assessment Consortium (Smarter Balanced) strives to provide every student with a positive and productive assessment experience, generating results that are a fair and accurate estimate of each student's achievement. Further, Smarter Balanced is building on a framework of accessibility for all students, including English…
Descriptors: Student Evaluation, Evaluation Methods, English Language Learners, Students with Disabilities
Otero-Saborido, Fernando M.; Sánchez-Oliver, Antonio J.; Grimaldi-Puyana, Moisés; Álvarez-García, José – Education & Training, 2018
Purpose: The purpose of this paper is to design and validate a continuous self-assessment tool that involves university students in reflection processes on their Flipped Learning model learning. Design/methodology/approach: For this, 66 students (18.77±1.36) of the first year of the Degree in Physical Activity and Sports Sciences participated for…
Descriptors: Blended Learning, Formative Evaluation, Higher Education, College Students
Scalise, Kathleen; Clarke-Midura, Jody – Journal of Research in Science Teaching, 2018
Science education frameworks in the United States have moved strongly in recent years to incorporate more dimensions of learning, including measuring student use of scientific practices employed during scientific inquiry. For instance, the Next Generation Science Standards and related multidimensional frameworks adopted or adapted recently by more…
Descriptors: Inquiry, Student Research, Science Process Skills, Scientific Concepts
Dumas, Denis G.; McNeish, Daniel M. – Educational Researcher, 2018
Dynamic measurement modeling (DMM) has been shown to improve the consequential validity of longitudinal mathematics assessment in the Early Childhood Longitudinal Study-Kindergarten (ECLS-K) database. Here, the authors demonstrate the capability of DMM to similarly improve the consequential validity of ECLS-K reading assessment through the…
Descriptors: Measurement Techniques, Student Evaluation, Alternative Assessment, Evaluation Methods
Ryan, Kelli A. – ProQuest LLC, 2018
The need to create assessment literate and assessment confident teachers is increasing (Popham, 2009; 2011). Research has revealed that teachers are not well trained to use assessment in the classroom and are poorly trained in standardized testing (Zhang & Burry-Stock, 1997; Zhang & Burry-Stock, 2003). The purpose of this study was to: (1)…
Descriptors: Preservice Teachers, Performance Based Assessment, Knowledge Level, Psychometrics
Conejo, Ricardo; Barros, Beatriz; Bertoa, Manuel F. – IEEE Transactions on Learning Technologies, 2019
This paper presents an innovative method to tackle the automatic evaluation of programming assignments with an approach based on well-founded assessment theories (Classical Test Theory (CTT) and Item Response Theory (IRT)) instead of heuristic assessment as in other systems. CTT and/or IRT are used to grade the results of different items of…
Descriptors: Computer Assisted Testing, Grading, Programming, Item Response Theory
Blackstone, Bethany; Oldmixon, Elizabeth – Journal of Political Science Education, 2019
This article explores the efficacy of specifications grading in undergraduate political science classes. Specifications grading organizes instruction around a set of learning objectives and evaluates student success based on the achievement of carefully articulated specifications for each assessment. Assessments are considered satisfactory or…
Descriptors: Grading, Undergraduate Students, Political Science, Best Practices
Schat, Esther; van der Knaap, Ewout; de Graaff, Rick – Intercultural Communication Education, 2021
Intercultural competence is a crucial element of foreign language education, yet the multifaceted nature of this construct makes it inherently difficult to assess. Although several tools for evaluating intercultural competence currently exist, research on their use in secondary school settings is scarce. This study reports on the development and…
Descriptors: Intercultural Communication, Communicative Competence (Languages), Second Language Learning, Second Language Instruction
Jayashankar, Shailaja; Sridaran, R. – Education and Information Technologies, 2017
Teachers are thrown open to abundance of free text answers which are very daunting to read and evaluate. Automatic assessments of open ended answers have been attempted in the past but none guarantees 100% accuracy. In order to deal with the overload involved in this manual evaluation, a new tool becomes necessary. The unique superlative model…
Descriptors: Word Frequency, Models, Electronic Learning, Student Evaluation
Knezek, Gerald; Christensen, Rhonda – Journal of Computers in Mathematics and Science Teaching, 2020
A conceptual framework for empirical research on the impact of NASA Space Science Education Consortium (NSSEC) activities is presented in this paper, along with a cross-referencing system between the NSF-based NSSEC evaluation framework and historical definitions of comparable psychometric constructs in the literature. A selected set of findings…
Descriptors: Space Sciences, Science Education, Space Exploration, Hands on Science
Bichi, Ado Abdu; Ibrahim, Fatima B.; Ibrahim, Rahinatu H. – Journal of Education and Learning (EduLearn), 2019
Science education is believed to be a vital tool for individual and societal development at large. The persistent low levels of students' achievement in sciences at the various public examinations in Nigeria have continued to draw the attention of major stakeholders in education. This study examined academic achievement of Senior Secondary School…
Descriptors: Foreign Countries, Secondary School Students, Student Evaluation, Secondary School Science
Cui, Yang; Chu, Man-Wai; Chen, Fu – Journal of Educational Data Mining, 2019
Digital game-based assessments generate student process data that is much more difficult to analyze than traditional assessments. The formative nature of game-based assessments permits students, through applying and practicing the targeted knowledge and skills during gameplay, to gain experiences, receive immediate feedback, and as a result,…
Descriptors: Educational Games, Student Evaluation, Data Analysis, Bayesian Statistics
Zumbo, Bruno D.; Hubley, Anita M. – Assessment in Education: Principles, Policy & Practice, 2016
Ultimately, measures in research, testing, assessment and evaluation are used, or have implications, for ranking, intervention, feedback, decision-making or policy purposes. Explicit recognition of this fact brings the often-ignored and sometimes maligned concept of consequences to the fore. Given that measures have personal and social…
Descriptors: Testing Programs, Testing Problems, Measurement Techniques, Student Evaluation