ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	7
Since 2006 (last 20 years)	19

Descriptor

Generalizability Theory	57
Performance Based Assessment	57
Interrater Reliability	20
Test Reliability	16
Error of Measurement	15
Scores	14
Test Construction	14
Evaluation Methods	13
Reliability	13
Educational Assessment	12
Test Validity	12
Scoring	9
Student Evaluation	9
Data Analysis	6
Models	6
Psychometrics	6
Sampling	6
Validity	6
Estimation (Mathematics)	5
Foreign Countries	5
Higher Education	5
Standards	5
Writing Tests	5
Decision Making	4
Elementary School Students	4
More ▼

Publication Type

Journal Articles	33
Reports - Research	31
Reports - Evaluative	18
Speeches/Meeting Papers	16
Information Analyses	3
Reports - Descriptive	3
Book/Product Reviews	1
Dissertations/Theses -…	1
Numerical/Quantitative Data	1
Reference Materials -…	1
Reports - General	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	8
Postsecondary Education	4
Adult Education	1
Grade 10	1
Grade 8	1
Junior High Schools	1
Middle Schools	1
Secondary Education	1

Audience

Practitioners

Location

United States	2
California (Los Angeles)	1
Canada	1
China (Beijing)	1
Colorado	1
Japan	1
Massachusetts	1
Oklahoma	1
South Korea	1
Turkey (Ankara)	1

Laws, Policies, & Programs

Assessments and Surveys

Teacher Performance…	1
Texas Assessment of Academic…	1
United States Medical…	1
edTPA (Teacher Performance…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 57 results Save | Export

Examining the Reliability of Scores from a Performance Assessment of Practice-Based Competencies

Peer reviewed

Direct link

Roduta Roberts, Mary; Alves, Cecilia Brito; Werther, Karin; Bahry, Louise M. – Journal of Psychoeducational Assessment, 2019

The purpose of this study was to examine the reliability and sources of score variation from a performance assessment of practice competencies within an occupational therapy program. Data from 99 students who participated in a practical exam were examined. A generalizability analysis of analytic, total, and overall holistic scores was completed…

Descriptors: Performance Based Assessment, Test Reliability, Scores, Occupational Therapy

Using Teaching Performance Assessments for Program Evaluation and Improvement in Teacher Education. Evaluating and Improving Teacher Preparation Programs

Download full text

Peck, Charles A.; Young, Maia Goodman; Zhang, Wenqi – National Academy of Education, 2021

In this paper the authors examine the uses of teaching performance assessments (TPAs) as resources for learning, program evaluation, and improvement in teacher education. The authors begin by outlining their conceptual framing and related research questions about the uses of TPAs as resources for program evaluation and improvement. They describe…

Descriptors: Performance Based Assessment, Preservice Teachers, Teacher Evaluation, Program Evaluation

Using Generalizability Theory to Assess the Score Reliability of Communication Skills of Dentistry Students

Peer reviewed
PDF on ERIC

Download full text

Uzun, N. Bilge; Aktas, Mehtap; Asiret, Semih; Yormaz, Seha – Asian Journal of Education and Training, 2018

The goal of this study is to determine the reliability of the performance points of dentistry students regarding communication skills and to examine the scoring reliability by generalizability theory in balanced random and fixed facet (mixed design) data, considering also the interactions of student, rater and duty. The study group of the research…

Descriptors: Foreign Countries, Generalizability Theory, Scores, Test Reliability

Developing Situated Measures of Science Instruction through an Innovative Electronic Portfolio App for Mobile Devices: Reliability, Validity, and Feasibility

Peer reviewed

Direct link

Martínez, José Felipe; Kloser, Matt; Srinivasan, Jayashri; Stecher, Brian; Edelman, Amanda – Educational and Psychological Measurement, 2022

Adoption of new instructional standards in science demands high-quality information about classroom practice. Teacher portfolios can be used to assess instructional practice and support teacher self-reflection anchored in authentic evidence from classrooms. This study investigated a new type of electronic portfolio tool that allows efficient…

Descriptors: Science Instruction, Academic Standards, Instructional Innovation, Electronic Publishing

Working with Sparse Data in Rated Language Tests: Generalizability Theory Applications

Peer reviewed

Direct link

Lin, Chih-Kai – Language Testing, 2017

Sparse-rated data are common in operational performance-based language tests, as an inevitable result of assigning examinee responses to a fraction of available raters. The current study investigates the precision of two generalizability-theory methods (i.e., the rating method and the subdividing method) specifically designed to accommodate the…

Descriptors: Data Analysis, Language Tests, Generalizability Theory, Accuracy

Evaluating Score and Decision Consistency across Claims in a Validation Argument

Peer reviewed

Direct link

Schmidgall, Jonathan – Applied Measurement in Education, 2017

This study utilizes an argument-based approach to validation to examine the implications of reliability in order to further differentiate the concepts of score and decision consistency. In a methodological example, the framework of generalizability theory was used to estimate appropriate indices of score consistency and evaluations of the…

Descriptors: Scores, Reliability, Validity, Generalizability Theory

Investigating Score Dependability in English/Chinese Interpreter Certification Performance Testing: A Generalizability Theory Approach

Peer reviewed

Direct link

Han, Chao – Language Assessment Quarterly, 2016

As a property of test scores, reliability/dependability constitutes an important psychometric consideration, and it underpins the validity of measurement results. A review of interpreter certification performance tests (ICPTs) reveals that (a) although reliability/dependability checking has been recognized as an important concern, its theoretical…

Descriptors: Foreign Countries, Scores, English, Chinese

Psychometric Analysis of the Thermochemistry Concept Inventory

Peer reviewed

Direct link

Wren, David; Barbera, Jack – Chemistry Education Research and Practice, 2014

Assessing conceptual understanding of foundational topics before instruction on higher-order concepts can provide chemical educators with information to aid instructional design. This study provides an instrument that can be used to identify students' alternative conceptions regarding thermochemistry concepts. The Thermochemistry Concept Inventory…

Descriptors: Psychometrics, Thermodynamics, Chemistry, Item Response Theory

Using Multivariate Generalizability Theory to Assess the Effect of Content Stratification on the Reliability of a Performance Assessment

Peer reviewed

Direct link

Keller, Lisa A.; Clauser, Brian E.; Swanson, David B. – Advances in Health Sciences Education, 2010

In recent years, demand for performance assessments has continued to grow. However, performance assessments are notorious for lower reliability, and in particular, low reliability resulting from task specificity. Since reliability analyses typically treat the performance tasks as randomly sampled from an infinite universe of tasks, these estimates…

Descriptors: Generalizability Theory, Test Reliability, Performance Based Assessment, Error of Measurement

What Different Kinds of Stratification Can Reveal about the Generalizability of Data-Mined Skill Assessment Models

Peer reviewed
PDF on ERIC

Download full text

Sao Pedro, Michael A.; Baker, Ryan S. J. d.; Gobert, Janice D. – Grantee Submission, 2013

When validating assessment models built with data mining, generalization is typically tested at the student-level, where models are tested on new students. This approach, though, may fail to find cases where model performance suffers if other aspects of those cases relevant to prediction are not well represented. We explore this here by testing if…

Descriptors: Educational Research, Data Collection, Data Analysis, Generalizability Theory

The Impact of Statistically Adjusting for Rater Effects on Conditional Standard Errors of Performance Ratings

Peer reviewed

Direct link

Raymond, Mark R.; Harik, Polina; Clauser, Brian E. – Applied Psychological Measurement, 2011

Prior research indicates that the overall reliability of performance ratings can be improved by using ordinary least squares (OLS) regression to adjust for rater effects. The present investigation extends previous work by evaluating the impact of OLS adjustment on standard errors of measurement ("SEM") at specific score levels. In…

Descriptors: Performance Based Assessment, Licensing Examinations (Professions), Least Squares Statistics, Item Response Theory

Confidence Bounds and Power for the Reliability of Observational Measures on the Quality of a Social Setting

Peer reviewed

Direct link

Shin, Yongyun; Raudenbush, Stephen W. – Psychometrika, 2012

Social scientists are frequently interested in assessing the qualities of social settings such as classrooms, schools, neighborhoods, or day care centers. The most common procedure requires observers to rate social interactions within these settings on multiple items and then to combine the item responses to obtain a summary measure of setting…

Descriptors: Generalizability Theory, Neighborhoods, Intervals, Child Care Centers

The Effect of Raters and Rating Conditions on the Reliability of the Missionary Teaching Assessment

Direct link

Ure, Abigail C. – ProQuest LLC, 2011

This study investigated how 2 different rating conditions, the controlled rating condition (CRC) and the uncontrolled rating condition (URC), effected rater behavior and the reliability of a performance assessment (PA) known as the Missionary Teaching Assessment (MTA). The CRC gives raters the capability to manipulate (pause, rewind, fast-forward)…

Descriptors: Teacher Evaluation, Performance Based Assessment, Performance Tests, Generalizability Theory

An Examination of Rater Drift within a Generalizability Theory Framework

Peer reviewed

Direct link

Harik, Polina; Clauser, Brian E.; Grabovsky, Irina; Nungester, Ronald J.; Swanson, Dave; Nandakumar, Ratna – Journal of Educational Measurement, 2009

The present study examined the long-term usefulness of estimated parameters used to adjust the scores from a performance assessment to account for differences in rater stringency. Ratings from four components of the USMLE[R] Step 2 Clinical Skills Examination data were analyzed. A generalizability-theory framework was used to examine the extent to…

Descriptors: Generalizability Theory, Performance Based Assessment, Performance Tests, Clinical Experience

Generalizability Theory Applied to Reading Assessments for Students with Significant Cognitive Disabilities

Peer reviewed

Direct link

Tindal, Gerald; Yovanoff, Paul; Geller, Josh P. – Journal of Special Education, 2010

Students with significant disabilities must participate in large-scale assessments, often using an alternate assessment judged against alternate achievement standards. The development and administration of this type of assessment must necessarily balance meaningful participation with accurate measurement. In this study, generalizability theory is…

Descriptors: Generalizability Theory, Alternative Assessment, Disabilities, Severe Mental Retardation

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4

Journal of Educational…	7
Applied Measurement in…	3
Applied Psychological…	3
Educational and Psychological…	3
Educational Measurement:…	2
Language Testing	2
Advances in Health Sciences…	1
Advances in Physiology…	1
Alberta Journal of…	1
Asian Journal of Education…	1
Chemistry Education Research…	1
Educational Researcher	1
Evaluation and Program…	1
Grantee Submission	1
Journal of Outcome Measurement	1
Journal of Psychoeducational…	1
Journal of Special Education	1
Language Assessment Quarterly	1
National Academy of Education	1
National Center for Research…	1
Pearson	1
ProQuest LLC	1
Psychometrika	1
Research & Practice in…	1
More ▼

Brennan, Robert L.	5
Clauser, Brian E.	5
Harik, Polina	4
Linn, Robert L.	3
Shavelson, Richard J.	3
Hambleton, Ronald K.	2
Jiang, Ying Hong	2
Ruiz-Primo, Maria Araceli	2
Abedi, Jamal	1
Aktas, Mehtap	1
Alves, Cecilia Brito	1
Asiret, Semih	1
Bahry, Louise M.	1
Baker, Eva L.	1
Baker, Ryan S. J. d.	1
Barbera, Jack	1
Betebenner, Damian W.	1
Burton, Elizabeth	1
Chen, Eva	1
Chiu, Chris W. T.	1
Clyman, Stephen G.	1
Crehan, Kevin D.	1
Cronbach, Lee J.	1
Edelman, Amanda	1
More ▼