ERIC - Search Results

Publication Date

In 2025	2
Since 2024	15
Since 2021 (last 5 years)	68
Since 2016 (last 10 years)	171
Since 2006 (last 20 years)	439

Descriptor

Generalizability Theory	728
Reliability	168
Scores	146
Error of Measurement	133
Test Reliability	125
Interrater Reliability	120
Foreign Countries	102
Statistical Analysis	85
Evaluation Methods	82
Psychometrics	75
Research Methodology	67
Validity	66
Test Validity	64
Models	62
Comparative Analysis	59
Correlation	59
Higher Education	59
Scoring	59
Item Response Theory	57
Performance Based Assessment	57
Research Design	57
Test Items	54
Test Construction	49
Elementary School Students	48
Test Theory	47
More ▼

Education Level

Higher Education	115
Postsecondary Education	68
Elementary Education	59
Secondary Education	42
Middle Schools	33
Elementary Secondary Education	29
Early Childhood Education	24
Junior High Schools	22
Grade 8	17
Grade 3	15
Preschool Education	15
Grade 4	14
Grade 5	13
Primary Education	13
Grade 7	12
High Schools	12
Intermediate Grades	11
Adult Education	10
Grade 6	7
Kindergarten	7
Grade 10	6
Grade 9	6
Grade 1	4
Grade 2	4
Two Year Colleges	3
More ▼

Audience

Researchers	28
Practitioners	2
Policymakers	1
Students	1

Location

Turkey	14
Canada	10
United States	10
California	9
Netherlands	9
Australia	6
Germany	6
South Korea	6
Iowa	5
Norway	5
Turkey (Ankara)	5
United Kingdom	5
Florida	4
South Africa	4
Tennessee	4
China	3
Hong Kong	3
Indiana	3
Japan	3
North Carolina	3
Texas	3
Alabama	2
China (Beijing)	2
Colorado	2
Cyprus	2
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	2
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 301 to 315 of 728 results Save | Export

Conceptualizing Group Dynamics from Our Clients' Perspective: Development of the Conceptualization of Group Dynamics Inventory

Peer reviewed

Direct link

Tate, Kevin A.; Rivera, Edil Torres; Conwill, William L.; Miller, M. David; Puig, Ana – Journal for Specialists in Group Work, 2013

There is a clear call in group counseling practice and training for evidence-based practice (ACA, 2005; ASGW, 2008; CACREP, 2009). At the same time, group counselors also are asked to keep clients' experience at the center of their work (ASGW, 2012). This article outlines the authors' effort to develop and study an instrument designed to measure…

Descriptors: Evidence, Group Dynamics, Construct Validity, Group Counseling

Effects of Multimedia Vocabulary Annotations on Vocabulary Learning and Text Comprehension in ESP Classrooms

Peer reviewed
PDF on ERIC

Download full text

Lin, Huifen – Research-publishing.net, 2012

For the past few decades, instructional materials enriched with multimedia elements have enjoyed increasing popularity. Multimedia-based instruction incorporating stimulating visuals, authentic audios, and interactive animated graphs of different kinds all provide additional and valuable opportunities for students to learn beyond what conventional…

Descriptors: Multimedia Materials, Multimedia Instruction, Vocabulary Development, Reading Comprehension

Development of a Valid and Reliable Student-Achievement and Process-Skills Instrument

Peer reviewed

Direct link

Bunce, Diane M.; VandenPlas, Jessica R.; Neiles, Kelly Y.; Flens, Elizabeth A. – Journal of College Science Teaching, 2010

Development of a research instrument to measure student achievement requires planning and reliability and validity testing before the instrument is used to collect data. These steps are often overlooked in research studies, but when the instrument is to be used across a wider population, the inclusion of these steps is vital to address the…

Descriptors: Academic Achievement, Measures (Individuals), Science Process Skills, Test Reliability

The Impact of Statistical Adjustment on Conditional Standard Errors of Measurement in the Assessment of Physician Communication Skills

Peer reviewed

Direct link

Raymond, Mark R.; Clauser, Brian E.; Furman, Gail E. – Advances in Health Sciences Education, 2010

The use of standardized patients to assess communication skills is now an essential part of assessing a physician's readiness for practice. To improve the reliability of communication scores, it has become increasingly common in recent years to use statistical models to adjust ratings provided by standardized patients. This study employed ordinary…

Descriptors: Generalizability Theory, Physicians, Patients, Least Squares Statistics

The Other Side of Method Bias: The Perils of Distinct Source Research Designs

Peer reviewed

Direct link

Kammeyer-Mueller, John; Steel, Piers D. G.; Rubenstein, Alex – Multivariate Behavioral Research, 2010

Common source bias has been the focus of much attention. To minimize the problem, researchers have sometimes been advised to take measurements of predictors from one observer and measurements of outcomes from another observer or to use separate occasions of measurement. We propose that these efforts to eliminate biases due to common source…

Descriptors: Statistical Bias, Predictor Variables, Measurement, Data Collection

Measuring Test Measurement Error: A General Approach

Peer reviewed

Direct link

Boyd, Donald; Lankford, Hamilton; Loeb, Susanna; Wyckoff, James – Journal of Educational and Behavioral Statistics, 2013

Test-based accountability as well as value-added asessments and much experimental and quasi-experimental research in education rely on achievement tests to measure student skills and knowledge. Yet, we know little regarding fundamental properties of these tests, an important example being the extent of measurement error and its implications for…

Descriptors: Accountability, Educational Research, Educational Testing, Error of Measurement

Examining Interrater Agreement Analyses of a Pilot Special Education Observation Tool

Peer reviewed
PDF on ERIC

Download full text

Johnson, Evelyn S.; Semmelroth, Carrie L. – Journal of Special Education Apprenticeship, 2012

This paper reports the results of interrater agreement analyses on a pilot special education teacher evaluation instrument, the Recognizing Effective Special Education Teachers (RESET) Observation Tool (OT). Using evidence-based instructional practices as the basis for the evaluation, the RESET OT is designed for the spectrum of different…

Descriptors: Interrater Reliability, Pilot Projects, Special Education, Special Education Teachers

The Effect of Observation Length and Presentation Order on the Reliability and Validity of an Observational Measure of Teaching Quality

Peer reviewed

Direct link

Mashburn, Andrew J.; Meyer, J. Patrick; Allen, Joseph P.; Pianta, Robert C. – Educational and Psychological Measurement, 2014

Observational methods are increasingly being used in classrooms to evaluate the quality of teaching. Operational procedures for observing teachers are somewhat arbitrary in existing measures and vary across different instruments. To study the effect of different observation procedures on score reliability and validity, we conducted an experimental…

Descriptors: Observation, Teacher Evaluation, Reliability, Validity

Estimating Reliability of School-Level Scores Using Multilevel and Generalizability Theory Models

Peer reviewed

Direct link

Jeon, Min-Jeong; Lee, Guemin; Hwang, Jeong-Won; Kang, Sang-Jin – Asia Pacific Education Review, 2009

The purpose of this study was to investigate the methods of estimating the reliability of school-level scores using generalizability theory and multilevel models. Two approaches, "student within schools" and "students within schools and subject areas," were conceptualized and implemented in this study. Four methods resulting from the combination…

Descriptors: Generalizability Theory, Scores, Reliability, Statistical Analysis

The Effect of Raters and Rating Conditions on the Reliability of the Missionary Teaching Assessment

Direct link

Ure, Abigail C. – ProQuest LLC, 2011

This study investigated how 2 different rating conditions, the controlled rating condition (CRC) and the uncontrolled rating condition (URC), effected rater behavior and the reliability of a performance assessment (PA) known as the Missionary Teaching Assessment (MTA). The CRC gives raters the capability to manipulate (pause, rewind, fast-forward)…

Descriptors: Teacher Evaluation, Performance Based Assessment, Performance Tests, Generalizability Theory

Establishing Open-Ended Assessments: Investigating the Validity of Creative Exercises

Peer reviewed

Direct link

Lewis, Scott E.; Shaw, Janet L.; Freeman, Kathryn A. – Chemistry Education Research and Practice, 2011

Open-ended assessments, defined as assessments with a large set of possible correct answers, by nature lend themselves to concerns regarding accurate and consistent grading. This article describes one particular open-ended assessment, named Creative Exercises (CE), designed for promoting students' interconnection of concepts in a college general…

Descriptors: Evidence, Concept Mapping, Knowledge Level, Chemistry

Study of the Reliability of CCSS-Aligned Math Measures (2012 Research Version): Grades 6-8. Technical Report #1312

Download full text

Anderson, Daniel; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we describe the results of a study of mathematics items written to align with the Common Core State Standards (CCSS) in grades 6-8. In each grade, CCSS items were organized into forms, and the reliability of these forms was evaluated along with an experimental form including items aligned with the National Council of…

Descriptors: Curriculum Based Assessment, Mathematics Tests, Academic Standards, State Standards

Multigroup Generalizability Analysis of Verbal, Quantitative, and Nonverbal Ability Tests for Culturally and Linguistically Diverse Students

Peer reviewed

Direct link

Lakin, Joni M.; Lai, Emily R. – Educational and Psychological Measurement, 2012

For educators seeking to differentiate instruction, cognitive ability tests sampling multiple content domains, including verbal, quantitative, and nonverbal reasoning, provide superior information about student strengths and weaknesses compared with unidimensional reasoning measures. However, these ability tests have not been fully evaluated with…

Descriptors: Aptitude Tests, Nonverbal Ability, Cognitive Ability, Verbal Ability

Direct Behavior Rating (DBR): Generalizability and Dependability across Raters and Observations

Peer reviewed

Direct link

Christ, Theodore J.; Riley-Tillman, T. Chris; Chafouleas, Sandra M.; Boice, Christina H. – Educational and Psychological Measurement, 2010

Generalizability theory was used to examine the generalizability and dependability of outcomes from two single-item Direct Behavior Rating (DBR) scales: DBR of actively manipulating and DBR of visually distracted. DBR is a behavioral assessment tool with specific instrumentation and procedures that can be used by a variety of service delivery…

Descriptors: Generalizability Theory, Student Behavior, Data Collection, Student Evaluation

Generalizability Theory as Evidence of Concerns about Fairness in Large-Scale ESL Writing Assessments

Peer reviewed

Direct link

Huang, Jinyan – TESOL Journal, 2011

Using generalizability theory, this study examined both the rating variability and reliability of English as a second language (ESL) students' writing in two provincial examinations in Canada. This article discusses expected and unexpected similarities and differences related to rating variability and reliability between the two testing programs.…

Descriptors: Foreign Countries, Generalizability Theory, Test Reliability, Testing Programs

« Previous Page | Next Page »

Pages: 1 | ... | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | ... | 49

Educational and Psychological…	44
Advances in Health Sciences…	27
Journal of Educational…	24
Applied Measurement in…	23
ProQuest LLC	18
Language Testing	13
Grantee Submission	11
Society for Research on…	11
Psychometrika	10
School Psychology Review	10
Applied Psychological…	9
Educational Measurement:…	9
Online Submission	8
International Journal of…	7
Journal of Educational…	7
Journal of Educational…	7
Multivariate Behavioral…	7
Behavioral Research and…	6
Educational Researcher	6
Educational Sciences: Theory…	6
Journal of Psychoeducational…	6
Measurement and Evaluation in…	6
Review of Educational Research	6
School Psychology Quarterly	6
Assessment for Effective…	5
More ▼

Journal Articles	534
Reports - Research	426
Reports - Evaluative	180
Speeches/Meeting Papers	115
Reports - Descriptive	57
Opinion Papers	27
Information Analyses	25
Dissertations/Theses -…	19
Numerical/Quantitative Data	19
Tests/Questionnaires	11
Guides - Non-Classroom	6
Books	5
Collected Works - General	3
Book/Product Reviews	2
Reference Materials -…	2
Reports - General	2
Collected Works - Serials	1
Dissertations/Theses -…	1
Guides - Classroom - Learner	1
Guides - General	1
Non-Print Media	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	6
Program for International…	4
Teacher Performance…	4
Dynamic Indicators of Basic…	3
Trends in International…	3
ACT Assessment	2
Childrens Depression Inventory	2
National Assessment of…	2
National Survey of Student…	2
Progress in International…	2
SAT (College Admission Test)	2
Students Evaluation of…	2
Test of English for…	2
United States Medical…	2
Wechsler Adult Intelligence…	2
Advanced Placement…	1
Battelle Developmental…	1
Behavior Assessment System…	1
Big Five Inventory	1
Classroom Assessment Scoring…	1
Cognitive Abilities Test	1
Conners Teacher Rating Scale	1
Early Childhood Environment…	1
Eating Disorder Inventory	1
Eysenck Personality Inventory	1
More ▼

Brennan, Robert L.	18
Lee, Guemin	13
Briesch, Amy M.	11
Clauser, Brian E.	9
Chafouleas, Sandra M.	8
Riley-Tillman, T. Chris	8
Solano-Flores, Guillermo	8
Volpe, Robert J.	8
Christ, Theodore J.	7
Lee, Yong-Won	7
Marcoulides, George A.	7
Shavelson, Richard J.	7
Tindal, Gerald	7
Alonzo, Julie	6
Anderson, Daniel	6
Hagtvet, Knut A.	5
Harik, Polina	5
Miller, M. David	5
Raymond, Mark R.	5
Atilgan, Hakan	4
Chang, Lei	4
Fitzpatrick, Anne R.	4
French, Brian F.	4
Guler, Nese	4
More ▼