ERIC - Search Results

Publication Date

In 2025	2
Since 2024	15
Since 2021 (last 5 years)	68
Since 2016 (last 10 years)	171
Since 2006 (last 20 years)	439

Descriptor

Generalizability Theory	728
Reliability	168
Scores	146
Error of Measurement	133
Test Reliability	125
Interrater Reliability	120
Foreign Countries	102
Statistical Analysis	85
Evaluation Methods	82
Psychometrics	75
Research Methodology	67
Validity	66
Test Validity	64
Models	62
Comparative Analysis	59
Correlation	59
Higher Education	59
Scoring	59
Item Response Theory	57
Performance Based Assessment	57
Research Design	57
Test Items	54
Test Construction	49
Elementary School Students	48
Test Theory	47
More ▼

Education Level

Higher Education	115
Postsecondary Education	68
Elementary Education	59
Secondary Education	42
Middle Schools	33
Elementary Secondary Education	29
Early Childhood Education	24
Junior High Schools	22
Grade 8	17
Grade 3	15
Preschool Education	15
Grade 4	14
Grade 5	13
Primary Education	13
Grade 7	12
High Schools	12
Intermediate Grades	11
Adult Education	10
Grade 6	7
Kindergarten	7
Grade 10	6
Grade 9	6
Grade 1	4
Grade 2	4
Two Year Colleges	3
More ▼

Audience

Researchers	28
Practitioners	2
Policymakers	1
Students	1

Location

Turkey	14
Canada	10
United States	10
California	9
Netherlands	9
Australia	6
Germany	6
South Korea	6
Iowa	5
Norway	5
Turkey (Ankara)	5
United Kingdom	5
Florida	4
South Africa	4
Tennessee	4
China	3
Hong Kong	3
Indiana	3
Japan	3
North Carolina	3
Texas	3
Alabama	2
China (Beijing)	2
Colorado	2
Cyprus	2
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	2
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 151 to 165 of 728 results Save | Export

Constructing and Evaluating a Validity Argument for the Final-Year Ward Simulation Exercise

Peer reviewed

Direct link

Till, Hettie; Ker, Jean; Myford, Carol; Stirling, Kevin; Mires, Gary – Advances in Health Sciences Education, 2015

The authors report final-year ward simulation data from the University of Dundee Medical School. Faculty who designed this assessment intend for the final score to represent an individual senior medical student's level of clinical performance. The results are included in each student's portfolio as one source of evidence of the student's…

Descriptors: Foreign Countries, Simulation, Clinical Experience, Medical Education

Rater Reliability and Score Discrepancy under Holistic and Analytic Scoring of Second Language Writing

Peer reviewed

Direct link

Zhang, Bo; Xiao, Yunnan; Luo, Juan – Language Testing in Asia, 2015

Previous studies comparing holistic scoring to analytic scoring of second language writing have given mixed results. Some of them suffer from methodological drawbacks, such as limited writing sample size, limited number of raters, and lack of direct comparison of the two methods. Based on 300 writing samples graded by 14 raters, this research…

Descriptors: Evaluators, Reliability, Scores, Holistic Approach

Peer reviewed

Direct link

Smith, Martin M.; Saklofske, Donald H.; Yan, Gonggu; Sherry, Simon B. – Measurement and Evaluation in Counseling and Development, 2016

This study supports the generalizability of perfectionistic strivings and concerns across Canadian and Chinese university students (N = 1,006) and demonstrates the importance of establishing measurement invariance prior to hypothesis testing with different groups. No latent mean difference in perfectionistic concerns was observed, but Canadian…

Descriptors: Foreign Countries, Cultural Differences, Personality Traits, Hypothesis Testing

On Generalizability of MOOC Models

Peer reviewed
PDF on ERIC

Download full text

Kidzinsk, Lukasz; Sharma, Kshitij; Boroujeni, Mina Shirvani; Dillenbourg, Pierre – International Educational Data Mining Society, 2016

The big data imposes the key problem of generalizability of the results. In the present contribution, we discuss statistical tools which can help to select variables adequate for target level of abstraction. We show that a model considered as over-fitted in one context can be accurate in another. We illustrate this notion with an example analysis…

Descriptors: Generalizability Theory, Online Courses, Large Group Instruction, Models

Designing, Evaluating, and Deploying Automated Scoring Systems with Validity in Mind: Methodological Design Decisions

Peer reviewed

Direct link

Rupp, André A. – Applied Measurement in Education, 2018

This article discusses critical methodological design decisions for collecting, interpreting, and synthesizing empirical evidence during the design, deployment, and operational quality-control phases for automated scoring systems. The discussion is inspired by work on operational large-scale systems for automated essay scoring but many of the…

Descriptors: Design, Automation, Scoring, Test Scoring Machines

An Evaluation of the Answer Key Used in Determining the 7th Grade Students' Levels of Disciplined Mind in Terms of Generalizability Theory

Peer reviewed

Direct link

Guler, Nese – Educational Research and Reviews, 2014

Nowadays, rapid changes in science and technology increase the demand of qualified individuals who have signs of disciplined mind which is hightlighted in Howard Gardner's (2006) five minds as one type of mind. So, it is important to measure whether individuals have disciplined mind or not. Based on this idea, it is aimed to evaluate the…

Descriptors: Answer Keys, Reliability, Grade 7, Generalizability Theory

Cross-Cultural Generalizability of Year in School Effects: Negative Effects of Acceleration and Positive Effects of Retention on Academic Self-Concept

Peer reviewed

Direct link

Marsh, Herbert W. – Journal of Educational Psychology, 2016

Given that the Big-Fish-Little-Pond-Effect, the negative effect of school-average achievement on academic self-concept, is one of the most robust findings in educational psychology (Marsh, Seaton et al., 2007), this research extends the theoretical model, based on social comparison theory, to study relative year in school effects (e.g., being 1…

Descriptors: Cross Cultural Studies, Acceleration (Education), Grade Repetition, Self Concept

Exploring the Reliability of Generic and Content-Specific Instructional Aspects in Physical Education Lessons

Peer reviewed

Direct link

Charalambous, Charalambos Y.; Kyriakides, Ermis; Tsangaridou, Niki; Kyriakides, Leonidas – School Effectiveness and School Improvement, 2017

Heightened accountability pressures and an increased emphasis on teaching quality have directed scholarly attention to scrutinizing instruction, particularly with respect to issues of validity and reliability. However, these attempts have largely been directed toward "core" content areas and investigated generic or content-specific…

Descriptors: Physical Education, Instructional Effectiveness, Lesson Plans, Interrater Reliability

Using Generalizability Theory to Examine Different Concept Map Scoring Methods

Peer reviewed
PDF on ERIC

Download full text

Cetin, Bayram; Guler, Nese; Sarica, Rabia – Eurasian Journal of Educational Research, 2016

Problem Statement: In addition to being teaching tools, concept maps can be used as effective assessment tools. The use of concept maps for assessment has raised the issue of scoring them. Concept maps generated and used in different ways can be scored via various methods. Holistic and relational scoring methods are two of them. Purpose of the…

Descriptors: Generalizability Theory, Concept Mapping, Scoring, Scoring Formulas

An Application of Multivariate Generalizability in Selection of Mathematically Gifted Students

Peer reviewed

Direct link

Kim, Sungyeun; Berebitsky, Dan – EURASIA Journal of Mathematics, Science & Technology Education, 2016

This study investigates error sources and the effects of each error source to determine optimal weights of the composite score of teacher recommendation letters and self-introduction letters using multivariate generalizability theory. Data were collected from the science education institute for the gifted attached to the university located within…

Descriptors: Academically Gifted, Foreign Countries, Mathematics, Mathematics Instruction

Measuring Afterschool Program Quality Using Setting-Level Observational Approaches

Peer reviewed

Direct link

Oh, Yoonkyung; Osgood, D. Wayne; Smith, Emilie P. – Journal of Early Adolescence, 2015

The importance of afterschool hours for youth development is widely acknowledged, and afterschool settings have recently received increasing attention as an important venue for youth interventions, bringing a growing need for reliable and valid measures of afterschool quality. This study examined the extent to which the two observational tools,…

Descriptors: After School Programs, Program Effectiveness, Observation, Rating Scales

Scores Assigned by Inexpert EFL Raters to Different Quality EFL Compositions, and the Raters' Decision-Making Behaviors

Peer reviewed
PDF on ERIC

Download full text

Han, Turgay – International Journal of Progressive Education, 2017

The aim of this study is to examine the variability in and reliability of scores assigned to different quality EFL compositions by EFL instructors and their rating behaviors. Using a mixed research design, quantitative data were collected from EFL instructors' ratings of 30 compositions of three different qualities using a holistic scoring rubric.…

Descriptors: English (Second Language), Writing Evaluation, Scores, Expertise

Does Interest Have an Expiration Date? An Analysis of Students' Questions as Resources for Context-Based Learning

Peer reviewed

Direct link

Swirski, Hani; Baram-Tsabari, Ayelet; Yarden, Anat – International Journal of Science Education, 2018

Context-based approaches can bridge the gap between abstract, difficult science concepts and the world students live in. However, the relevance of specific contexts to different groups of learners, and its stability over time, have not been extensively explored. This study used four datasets, collected in different formal and informal settings, to…

Descriptors: Elementary School Students, Secondary School Students, Student Interests, Learner Engagement

Measurement Quality of the Chinese Early Childhood Program Rating Scale: An Investigation Using Multivariate Generalizability Theory

Peer reviewed

Direct link

Chen, Dezhi; Hu, Bi Ying; Fan, Xitao; Li, Kejian – Journal of Psychoeducational Assessment, 2014

Adapted from the Early Childhood Environment Rating Scale-Revised, the Chinese Early Childhood Program Rating Scale (CECPRS) is a culturally comparable measure for assessing the quality of early childhood education and care programs in the Chinese cultural/social contexts. In this study, 176 kindergarten classrooms were rated with CECPRS on eight…

Descriptors: Foreign Countries, Rating Scales, Early Childhood Education, Educational Environment

Quantifying Error in Survey Measures of School and Classroom Environments

Peer reviewed

Direct link

Schweig, Jonathan David – Applied Measurement in Education, 2014

Developing indicators that reflect important aspects of school and classroom environments has become central in a nationwide effort to develop comprehensive programs that measure teacher quality and effectiveness. Formulating teacher evaluation policy necessitates accurate and reliable methods for measuring these environmental variables. This…

Descriptors: Error of Measurement, Educational Environment, Classroom Environment, Surveys

« Previous Page | Next Page »

Pages: 1 | ... | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | ... | 49

Educational and Psychological…	44
Advances in Health Sciences…	27
Journal of Educational…	24
Applied Measurement in…	23
ProQuest LLC	18
Language Testing	13
Grantee Submission	11
Society for Research on…	11
Psychometrika	10
School Psychology Review	10
Applied Psychological…	9
Educational Measurement:…	9
Online Submission	8
International Journal of…	7
Journal of Educational…	7
Journal of Educational…	7
Multivariate Behavioral…	7
Behavioral Research and…	6
Educational Researcher	6
Educational Sciences: Theory…	6
Journal of Psychoeducational…	6
Measurement and Evaluation in…	6
Review of Educational Research	6
School Psychology Quarterly	6
Assessment for Effective…	5
More ▼

Brennan, Robert L.	18
Lee, Guemin	13
Briesch, Amy M.	11
Clauser, Brian E.	9
Chafouleas, Sandra M.	8
Riley-Tillman, T. Chris	8
Solano-Flores, Guillermo	8
Volpe, Robert J.	8
Christ, Theodore J.	7
Lee, Yong-Won	7
Marcoulides, George A.	7
Shavelson, Richard J.	7
Tindal, Gerald	7
Alonzo, Julie	6
Anderson, Daniel	6
Hagtvet, Knut A.	5
Harik, Polina	5
Miller, M. David	5
Raymond, Mark R.	5
Atilgan, Hakan	4
Chang, Lei	4
Fitzpatrick, Anne R.	4
French, Brian F.	4
Guler, Nese	4
More ▼

Journal Articles	534
Reports - Research	426
Reports - Evaluative	180
Speeches/Meeting Papers	115
Reports - Descriptive	57
Opinion Papers	27
Information Analyses	25
Dissertations/Theses -…	19
Numerical/Quantitative Data	19
Tests/Questionnaires	11
Guides - Non-Classroom	6
Books	5
Collected Works - General	3
Book/Product Reviews	2
Reference Materials -…	2
Reports - General	2
Collected Works - Serials	1
Dissertations/Theses -…	1
Guides - Classroom - Learner	1
Guides - General	1
Non-Print Media	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	6
Program for International…	4
Teacher Performance…	4
Dynamic Indicators of Basic…	3
Trends in International…	3
ACT Assessment	2
Childrens Depression Inventory	2
National Assessment of…	2
National Survey of Student…	2
Progress in International…	2
SAT (College Admission Test)	2
Students Evaluation of…	2
Test of English for…	2
United States Medical…	2
Wechsler Adult Intelligence…	2
Advanced Placement…	1
Battelle Developmental…	1
Behavior Assessment System…	1
Big Five Inventory	1
Classroom Assessment Scoring…	1
Cognitive Abilities Test	1
Conners Teacher Rating Scale	1
Early Childhood Environment…	1
Eating Disorder Inventory	1
Eysenck Personality Inventory	1
More ▼