Publication Date
In 2025 | 2 |
Since 2024 | 15 |
Since 2021 (last 5 years) | 68 |
Since 2016 (last 10 years) | 171 |
Since 2006 (last 20 years) | 439 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 28 |
Practitioners | 2 |
Policymakers | 1 |
Students | 1 |
Location
Turkey | 14 |
Canada | 10 |
United States | 10 |
California | 9 |
Netherlands | 9 |
Australia | 6 |
Germany | 6 |
South Korea | 6 |
Iowa | 5 |
Norway | 5 |
Turkey (Ankara) | 5 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 2 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating

Sub, Anji; Valiga, Michael J.; Gao, Xiaohong – Journal of Experimental Education, 1997
Results of a study of the reliability of student ratings of academic advising from 15 postsecondary institutions show that generalizability theory is an appropriate and accurate way to estimate reliability. Advantages and disadvantages of the generalizability theory approach in comparison with coefficient alpha are discussed. (SLD)
Descriptors: Academic Advising, College Faculty, College Students, Estimation (Mathematics)

Floden, Robert E. – Journal of Educational Statistics, 1991
This commentary focuses on the application of D. Rogosa and G. Ghandour's work to observational research on classroom processes. Rogosa and Ghandour have shown that the short length of an observation is typically the dominant source of error. Investigators should conduct observations for as long as possible. (SLD)
Descriptors: Behavior Patterns, Behavioral Science Research, Classroom Observation Techniques, Elementary Secondary Education
Hagger, Martin S.; Biddle, Stuart J. H.; John Wang, C. K. – Educational and Psychological Measurement, 2005
This study tests the generalizability of the factor pattern, structural parameters, and latent mean structure of a multidimensional, hierarchical model of physical self-concept in adolescents across gender and grade. A children's version of the Physical Self-Perception Profile (C-PSPP) was administered to seventh-, eighth- and ninth-grade high…
Descriptors: Self Concept Measures, Self Esteem, Adolescents, Generalizability Theory
Hafner, John C.; Hafner, Patti M. – International Journal of Science Education, 2003
Although the rubric has emerged as one of the most popular assessment tools in progressive educational programs, there is an unfortunate dearth of information in the literature quantifying the actual effectiveness of the rubric as an assessment tool "in the hands of the students." This study focuses on the validity and reliability of the rubric as…
Descriptors: Interrater Reliability, Generalizability Theory, Biology, Scoring Rubrics
Bunch, Michael B.; Littlefair, Wendy – 1988
A total of 2,000 essays written by 1,000 students was submitted to generalizability analyses for domain-referenced tests. Each student had written one essay on each of two prompts representing two models of discourse. Each essay was read by six readers and judged on a scale of from 1 to 4. No reader read essays from both prompts. Reader agreement…
Descriptors: Cutting Scores, Essay Tests, Generalizability Theory, Interrater Reliability

Meskauskas, John A. – Evaluation and the Health Professions, 1986
Two new indices of stability of content-referenced standard-setting results are presented, relating variability of judges' decisions to the variability of candidate scores and to the reliability of the test. These indices are used to indicate whether scores resulting from a standard-setting study are of sufficient precision. (Author/LMO)
Descriptors: Certification, Credentials, Error of Measurement, Generalizability Theory

Conger, Anthony J.; And Others – Educational and Psychological Measurement, 1983
An investigation of the Conners' Teacher Rating Scale-Revised hyperactivity scale found that the referents for teacher ratings should be determined, teachers' ratings should be made more objective, standardization across teachers should be demonstrated before norms are preferred, and the rating scale should be validated via observations or other…
Descriptors: Behavior Rating Scales, Classroom Observation Techniques, Generalizability Theory, Hyperactivity

Hopkins, Kenneth D. – American Educational Research Journal, 1984
In behavior research using cognitive and affective measures, there is often incongruity between the statistical analysis employed and the intended inference. This paper argues that incorporating items as levels of a random facet via generalizability theory allows the statistical examination of the inferential question in the desired universe of…
Descriptors: Affective Measures, Analysis of Variance, Behavioral Science Research, Cognitive Measurement
Lee, Yong-Won; Kantor, Robert; Mollaun, Pam – 2002
This study examines the score dependability of writing and speaking assessments from the Test of English as a Foreign Language (TOEFL) from the perspectives of univariate and multivariate generalizability theory (G-theory) and presents the findings of three separate G-theory studies. For writing, the focus was on evaluating the impact on…
Descriptors: Ability, English (Second Language), Generalizability Theory, Item Bias

Temple, Linda; Lips, Hilary M. – Educational Research Quarterly, 1989
The Collis Attitudes toward Computers Survey, developed by B. Collis (1984) for a secondary school population, was tested with 305 college students attending the University of Winnipeg (Canada). Results support the reliability and generalizability of the survey and identified one dominant factor--personal interest and enjoyment of computers. (SLD)
Descriptors: Attitude Measures, College Students, Computer Literacy, Factor Analysis

Crowley, Susan L.; And Others – Educational and Psychological Measurement, 1994
Dependability of the Children's Depression Inventory (CDI) was studied using both generalizability and classical test score analyses with a sample of 164 elementary school students. Results suggest that sources of error variance interact to decrease dependability of CDI scores. Depression in children might be better assessed through multiple…
Descriptors: Children, Clinical Diagnosis, Comparative Analysis, Depression (Psychology)

Fehrmann, Melinda L.; And Others – Educational and Psychological Measurement, 1991
Two frame-of-reference rater training approaches were compared for effects on reliability and accuracy of cutoff scores generated by 21 raters using Angoff methods on tests taken by 155 undergraduates. Both approaches result in higher interrater reliability and more accuracy than does a non-frame-of-reference method. (SLD)
Descriptors: Cutting Scores, Evaluators, Generalizability Theory, Higher Education

Munthe, Elaine – Scandinavian Journal of Educational Research, 2001
Studied teacher certainty in 1,153 Norwegian teachers in elementary and junior high schools using confirmatory factor analysis and generalizability theory. Results indicate a good fit for a model that operationalizes teacher certainty as a second order latent variable with three first order latent variables: teacher's perceived didactic certainty,…
Descriptors: Elementary Education, Elementary School Teachers, Factor Structure, Foreign Countries
Using a Longitudinal Database to Assess the Validity of Preceptors' Ratings of Clerkship Performance
Ferguson, Kristi J.; Kreiter, Clarence D. – Advances in Health Sciences Education, 2004
Purpose: To examine the validity of using scores from a clinical evaluation form as an assessment of clinical competence. Method: Investigators collected a longitudinal clinical skills assessment database that included scores reflecting performance on standardized patient interactions, case-based learning performance, scores on multiple-choice…
Descriptors: Generalizability Theory, Medical Students, Validity, Program Effectiveness
Sudweeks, Richard R.; Reeve, Suzanne; Bradshaw, William S. – Assessing Writing, 2004
A pilot study was conducted to evaluate and improve the rating procedure proposed for use in a research effort designed to assess the essay writing ability of college sophomores. Generalizability theory and the Many-Facet Rasch Model were each used to (a) estimate potential sources of error in the rating, (b) to obtain reliability estimates, and…
Descriptors: Generalizability Theory, College Students, Writing Ability, Writing Evaluation