Publication Date
In 2025 | 2 |
Since 2024 | 15 |
Since 2021 (last 5 years) | 68 |
Since 2016 (last 10 years) | 171 |
Since 2006 (last 20 years) | 439 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 28 |
Practitioners | 2 |
Policymakers | 1 |
Students | 1 |
Location
Turkey | 14 |
Canada | 10 |
United States | 10 |
California | 9 |
Netherlands | 9 |
Australia | 6 |
Germany | 6 |
South Korea | 6 |
Iowa | 5 |
Norway | 5 |
Turkey (Ankara) | 5 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 2 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Till, Hettie; Ker, Jean; Myford, Carol; Stirling, Kevin; Mires, Gary – Advances in Health Sciences Education, 2015
The authors report final-year ward simulation data from the University of Dundee Medical School. Faculty who designed this assessment intend for the final score to represent an individual senior medical student's level of clinical performance. The results are included in each student's portfolio as one source of evidence of the student's…
Descriptors: Foreign Countries, Simulation, Clinical Experience, Medical Education
Zhang, Bo; Xiao, Yunnan; Luo, Juan – Language Testing in Asia, 2015
Previous studies comparing holistic scoring to analytic scoring of second language writing have given mixed results. Some of them suffer from methodological drawbacks, such as limited writing sample size, limited number of raters, and lack of direct comparison of the two methods. Based on 300 writing samples graded by 14 raters, this research…
Descriptors: Evaluators, Reliability, Scores, Holistic Approach
Smith, Martin M.; Saklofske, Donald H.; Yan, Gonggu; Sherry, Simon B. – Measurement and Evaluation in Counseling and Development, 2016
This study supports the generalizability of perfectionistic strivings and concerns across Canadian and Chinese university students (N = 1,006) and demonstrates the importance of establishing measurement invariance prior to hypothesis testing with different groups. No latent mean difference in perfectionistic concerns was observed, but Canadian…
Descriptors: Foreign Countries, Cultural Differences, Personality Traits, Hypothesis Testing
Kidzinsk, Lukasz; Sharma, Kshitij; Boroujeni, Mina Shirvani; Dillenbourg, Pierre – International Educational Data Mining Society, 2016
The big data imposes the key problem of generalizability of the results. In the present contribution, we discuss statistical tools which can help to select variables adequate for target level of abstraction. We show that a model considered as over-fitted in one context can be accurate in another. We illustrate this notion with an example analysis…
Descriptors: Generalizability Theory, Online Courses, Large Group Instruction, Models
Rupp, André A. – Applied Measurement in Education, 2018
This article discusses critical methodological design decisions for collecting, interpreting, and synthesizing empirical evidence during the design, deployment, and operational quality-control phases for automated scoring systems. The discussion is inspired by work on operational large-scale systems for automated essay scoring but many of the…
Descriptors: Design, Automation, Scoring, Test Scoring Machines
Guler, Nese – Educational Research and Reviews, 2014
Nowadays, rapid changes in science and technology increase the demand of qualified individuals who have signs of disciplined mind which is hightlighted in Howard Gardner's (2006) five minds as one type of mind. So, it is important to measure whether individuals have disciplined mind or not. Based on this idea, it is aimed to evaluate the…
Descriptors: Answer Keys, Reliability, Grade 7, Generalizability Theory
Marsh, Herbert W. – Journal of Educational Psychology, 2016
Given that the Big-Fish-Little-Pond-Effect, the negative effect of school-average achievement on academic self-concept, is one of the most robust findings in educational psychology (Marsh, Seaton et al., 2007), this research extends the theoretical model, based on social comparison theory, to study relative year in school effects (e.g., being 1…
Descriptors: Cross Cultural Studies, Acceleration (Education), Grade Repetition, Self Concept
Charalambous, Charalambos Y.; Kyriakides, Ermis; Tsangaridou, Niki; Kyriakides, Leonidas – School Effectiveness and School Improvement, 2017
Heightened accountability pressures and an increased emphasis on teaching quality have directed scholarly attention to scrutinizing instruction, particularly with respect to issues of validity and reliability. However, these attempts have largely been directed toward "core" content areas and investigated generic or content-specific…
Descriptors: Physical Education, Instructional Effectiveness, Lesson Plans, Interrater Reliability
Cetin, Bayram; Guler, Nese; Sarica, Rabia – Eurasian Journal of Educational Research, 2016
Problem Statement: In addition to being teaching tools, concept maps can be used as effective assessment tools. The use of concept maps for assessment has raised the issue of scoring them. Concept maps generated and used in different ways can be scored via various methods. Holistic and relational scoring methods are two of them. Purpose of the…
Descriptors: Generalizability Theory, Concept Mapping, Scoring, Scoring Formulas
Kim, Sungyeun; Berebitsky, Dan – EURASIA Journal of Mathematics, Science & Technology Education, 2016
This study investigates error sources and the effects of each error source to determine optimal weights of the composite score of teacher recommendation letters and self-introduction letters using multivariate generalizability theory. Data were collected from the science education institute for the gifted attached to the university located within…
Descriptors: Academically Gifted, Foreign Countries, Mathematics, Mathematics Instruction
Oh, Yoonkyung; Osgood, D. Wayne; Smith, Emilie P. – Journal of Early Adolescence, 2015
The importance of afterschool hours for youth development is widely acknowledged, and afterschool settings have recently received increasing attention as an important venue for youth interventions, bringing a growing need for reliable and valid measures of afterschool quality. This study examined the extent to which the two observational tools,…
Descriptors: After School Programs, Program Effectiveness, Observation, Rating Scales
Han, Turgay – International Journal of Progressive Education, 2017
The aim of this study is to examine the variability in and reliability of scores assigned to different quality EFL compositions by EFL instructors and their rating behaviors. Using a mixed research design, quantitative data were collected from EFL instructors' ratings of 30 compositions of three different qualities using a holistic scoring rubric.…
Descriptors: English (Second Language), Writing Evaluation, Scores, Expertise
Swirski, Hani; Baram-Tsabari, Ayelet; Yarden, Anat – International Journal of Science Education, 2018
Context-based approaches can bridge the gap between abstract, difficult science concepts and the world students live in. However, the relevance of specific contexts to different groups of learners, and its stability over time, have not been extensively explored. This study used four datasets, collected in different formal and informal settings, to…
Descriptors: Elementary School Students, Secondary School Students, Student Interests, Learner Engagement
Chen, Dezhi; Hu, Bi Ying; Fan, Xitao; Li, Kejian – Journal of Psychoeducational Assessment, 2014
Adapted from the Early Childhood Environment Rating Scale-Revised, the Chinese Early Childhood Program Rating Scale (CECPRS) is a culturally comparable measure for assessing the quality of early childhood education and care programs in the Chinese cultural/social contexts. In this study, 176 kindergarten classrooms were rated with CECPRS on eight…
Descriptors: Foreign Countries, Rating Scales, Early Childhood Education, Educational Environment
Schweig, Jonathan David – Applied Measurement in Education, 2014
Developing indicators that reflect important aspects of school and classroom environments has become central in a nationwide effort to develop comprehensive programs that measure teacher quality and effectiveness. Formulating teacher evaluation policy necessitates accurate and reliable methods for measuring these environmental variables. This…
Descriptors: Error of Measurement, Educational Environment, Classroom Environment, Surveys