ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	11

Descriptor

Generalizability Theory	12
Grade 7	9
Reliability	5
Foreign Countries	4
Middle School Students	4
Statistical Analysis	4
Adolescents	3
Difficulty Level	3
Grade 8	3
Secondary School Students	3
Test Items	3
Test Reliability	3
Correlation	2
Error of Measurement	2
Factor Analysis	2
Grade 6	2
High School Students	2
Item Response Theory	2
Longitudinal Studies	2
Low Achievement	2
Mathematics Tests	2
Observation	2
Reading Tests	2
Scores	2
Validity	2
More ▼

Source

Educational and Psychological…	2
Behavioral Research and…	1
Educational Research and…	1
Eurasian Journal of…	1
International Journal of…	1
International Journal of…	1
Journal of Adolescence	1
Journal of Educational and…	1
Journal of Research on…	1
Language Testing	1
School Psychology Quarterly	1
More ▼

Publication Type

Journal Articles	11
Reports - Research	9
Reports - Evaluative	3
Numerical/Quantitative Data	1

Education Level

Grade 7	12
Middle Schools	8
Junior High Schools	7
Secondary Education	7
Elementary Education	5
Grade 8	5
Grade 6	3
Grade 3	2
Grade 5	2
Grade 9	2
High Schools	2
Elementary Secondary Education	1
Grade 10	1
Grade 11	1
Grade 4	1
Intermediate Grades	1
More ▼

Audience

Location

Turkey	2
Turkey (Ankara)	2
Indiana	1
Netherlands	1
New York	1
Texas	1
Turkey (Istanbul)	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Comparison of G and Phi Coefficients Estimated in Generalizability Theory with Real Cases

Peer reviewed
PDF on ERIC

Download full text

Deniz, Kaan Zulfikar; Ilican, Emel – International Journal of Assessment Tools in Education, 2021

This study aims to compare the G and Phi coefficients as estimated by D studies for a measurement tool with the G and Phi coefficients obtained from real cases in which items of differing difficulty levels were added and also to determine the conditions under which the D studies estimated reliability coefficients closer to reality. The study group…

Descriptors: Generalizability Theory, Test Items, Difficulty Level, Test Reliability

The Implications of Population Changes on Generalization and Study Design

Peer reviewed

Direct link

Chan, Wendy; Oh, Jimin; Luo, Peihao – Journal of Research on Educational Effectiveness, 2021

Findings from experimental studies have increasingly been used to inform policy in school settings. Thus far, the populations in many of these studies are typically defined in a cross-sectional context; namely, the populations are defined in the same academic year in which the study took place or the population is defined at a fixed time point.…

Descriptors: Generalization, Research Design, Demography, Case Studies

An Evaluation of the Answer Key Used in Determining the 7th Grade Students' Levels of Disciplined Mind in Terms of Generalizability Theory

Peer reviewed

Direct link

Guler, Nese – Educational Research and Reviews, 2014

Nowadays, rapid changes in science and technology increase the demand of qualified individuals who have signs of disciplined mind which is hightlighted in Howard Gardner's (2006) five minds as one type of mind. So, it is important to measure whether individuals have disciplined mind or not. Based on this idea, it is aimed to evaluate the…

Descriptors: Answer Keys, Reliability, Grade 7, Generalizability Theory

Using Generalizability Theory to Examine Different Concept Map Scoring Methods

Peer reviewed
PDF on ERIC

Download full text

Cetin, Bayram; Guler, Nese; Sarica, Rabia – Eurasian Journal of Educational Research, 2016

Problem Statement: In addition to being teaching tools, concept maps can be used as effective assessment tools. The use of concept maps for assessment has raised the issue of scoring them. Concept maps generated and used in different ways can be scored via various methods. Holistic and relational scoring methods are two of them. Purpose of the…

Descriptors: Generalizability Theory, Concept Mapping, Scoring, Scoring Formulas

Assessing Reading Comprehension in Adolescent Low Achievers: Subskills Identification and Task Specificity

Peer reviewed

Direct link

van Steensel, Roel; Oostdam, Ron; van Gelderen, Amos – Language Testing, 2013

On the basis of a validation study of a new test for assessing low-achieving adolescents' reading comprehension skills--the SALT-reading--we analyzed two issues relevant to the field of reading test development. Using the test results of 200 seventh graders, we examined the possibility of identifying reading comprehension subskills and the effects…

Descriptors: Adolescents, Low Achievement, Reading Comprehension, Reading Tests

Applying Rasch Model and Generalizability Theory to Study Modified-Angoff Cut Scores

Peer reviewed

Direct link

Arce, Alvaro J.; Wang, Ze – International Journal of Testing, 2012

The traditional approach to scale modified-Angoff cut scores transfers the raw cuts to an existing raw-to-scale score conversion table. Under the traditional approach, cut scores and conversion table raw scores are not only seen as interchangeable but also as originating from a common scaling process. In this article, we propose an alternative…

Descriptors: Generalizability Theory, Item Response Theory, Cutting Scores, Scaling

The Influence of Observation Length on the Dependability of Data

Peer reviewed

Direct link

David Ferguson, Tyler; Briesch, Amy M.; Volpe, Robert J.; Daniels, Brian – School Psychology Quarterly, 2012

Although direct observation is one of the most frequently used assessment methods by school psychologists, studies have shown that the number of observations needed to obtain a dependable estimate of student behavior may be impractical. Because direct observation may be used to inform important decisions about students, it is crucial that data be…

Descriptors: School Psychologists, Observation, Time Perspective, Decision Making

Measuring Test Measurement Error: A General Approach

Peer reviewed

Direct link

Boyd, Donald; Lankford, Hamilton; Loeb, Susanna; Wyckoff, James – Journal of Educational and Behavioral Statistics, 2013

Test-based accountability as well as value-added asessments and much experimental and quasi-experimental research in education rely on achievement tests to measure student skills and knowledge. Yet, we know little regarding fundamental properties of these tests, an important example being the extent of measurement error and its implications for…

Descriptors: Accountability, Educational Research, Educational Testing, Error of Measurement

The Effect of Observation Length and Presentation Order on the Reliability and Validity of an Observational Measure of Teaching Quality

Peer reviewed

Direct link

Mashburn, Andrew J.; Meyer, J. Patrick; Allen, Joseph P.; Pianta, Robert C. – Educational and Psychological Measurement, 2014

Observational methods are increasingly being used in classrooms to evaluate the quality of teaching. Operational procedures for observing teachers are somewhat arbitrary in existing measures and vary across different instruments. To study the effect of different observation procedures on score reliability and validity, we conducted an experimental…

Descriptors: Observation, Teacher Evaluation, Reliability, Validity

Study of the Reliability of CCSS-Aligned Math Measures (2012 Research Version): Grades 6-8. Technical Report #1312

Download full text

Anderson, Daniel; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we describe the results of a study of mathematics items written to align with the Common Core State Standards (CCSS) in grades 6-8. In each grade, CCSS items were organized into forms, and the reliability of these forms was evaluated along with an experimental form including items aligned with the National Council of…

Descriptors: Curriculum Based Assessment, Mathematics Tests, Academic Standards, State Standards

How Many Heads Are Better than One? The Reliability and Validity of Teenagers' Self- and Peer Assessments

Peer reviewed

Direct link

Sung, Yao-Ting; Chang, Kuo-En; Chang, Tzyy-Hua; Yu, Wen-Cheng – Journal of Adolescence, 2010

Self- and peer assessments are becoming more popular in classrooms, but there are few data on the reliability and validity of such assessments performed by school children. Because these factors are greatly affected by the number of raters, we conducted two studies to determine the rating behaviours of teenagers in self- and peer assessments, and…

Descriptors: Generalizability Theory, Peer Evaluation, Validity, Reliability

Physical Self-Concept in Adolescence: Generalizability of a Multidimensional, Hierarchical Model Across Gender and Grade

Peer reviewed

Direct link

Hagger, Martin S.; Biddle, Stuart J. H.; John Wang, C. K. – Educational and Psychological Measurement, 2005

This study tests the generalizability of the factor pattern, structural parameters, and latent mean structure of a multidimensional, hierarchical model of physical self-concept in adolescents across gender and grade. A children's version of the Physical Self-Perception Profile (C-PSPP) was administered to seventh-, eighth- and ninth-grade high…

Descriptors: Self Concept Measures, Self Esteem, Adolescents, Generalizability Theory

Guler, Nese	2
Allen, Joseph P.	1
Alonzo, Julie	1
Anderson, Daniel	1
Arce, Alvaro J.	1
Biddle, Stuart J. H.	1
Boyd, Donald	1
Briesch, Amy M.	1
Cetin, Bayram	1
Chan, Wendy	1
Chang, Kuo-En	1
Chang, Tzyy-Hua	1
Daniels, Brian	1
David Ferguson, Tyler	1
Deniz, Kaan Zulfikar	1
Hagger, Martin S.	1
Ilican, Emel	1
John Wang, C. K.	1
Lankford, Hamilton	1
Loeb, Susanna	1
Luo, Peihao	1
Mashburn, Andrew J.	1
Meyer, J. Patrick	1
Oh, Jimin	1
Oostdam, Ron	1
More ▼