Publication Date
In 2025 | 2 |
Since 2024 | 15 |
Since 2021 (last 5 years) | 68 |
Since 2016 (last 10 years) | 171 |
Since 2006 (last 20 years) | 439 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 28 |
Practitioners | 2 |
Policymakers | 1 |
Students | 1 |
Location
Turkey | 14 |
Canada | 10 |
United States | 10 |
California | 9 |
Netherlands | 9 |
Australia | 6 |
Germany | 6 |
South Korea | 6 |
Iowa | 5 |
Norway | 5 |
Turkey (Ankara) | 5 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 2 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Goetz, Thomas; Hall, Nathan C.; Frenzel, Anne C.; Pekrun, Reinhard – Learning and Instruction, 2006
The focus of the present study is on students' experiences of enjoyment, an emotion largely neglected in educational research. We present a model in which specific levels of generalization of the construct of enjoyment are differentiated. Based on their extent of generalization, these differentiated constructs of enjoyment are located in a…
Descriptors: Student Experience, Learning Strategies, Structural Equation Models, Correlation
Clauser, Brian E.; Harik, Polina; Margolis, Melissa J. – Journal of Educational Measurement, 2006
Although multivariate generalizability theory was developed more than 30 years ago, little published research utilizing this framework exists and most of what does exist examines tests built from tables of specifications. In this context, it is assumed that the universe scores from levels of the fixed multivariate facet will be correlated, but the…
Descriptors: Multivariate Analysis, Job Skills, Correlation, Test Items
Linacre, John M. – 1993
Generalizability theory (G-theory) and many-facet Rasch measurement (Rasch) manage the variability inherent when raters rate examinees on test items. The purpose of G-theory is to estimate test reliability in a raw score metric. Unadjusted examinee raw scores are reported as measures. A variance component is estimated for the examinee…
Descriptors: Comparative Analysis, Equations (Mathematics), Estimation (Mathematics), Evaluators
Lee, Yong-Won; Kantor, Robert – ETS Research Report Series, 2005
Possible integrated and independent tasks were pilot tested for the writing section of a new generation of TOEFL® (Test of English as a Foreign Language™) examination. This study examines the impact of various rating designs as well as the impact of the number of tasks and raters on the reliability of writing scores based on integrated and…
Descriptors: Language Tests, English (Second Language), Second Language Learning, Writing Tests
Erwin, T. Dary – 1988
Rating scales are a typical method for evaluating a student's performance in outcomes assessment. The analysis of the quality of information from rating scales poses special measurement problems when researchers work with faculty in their development. Generalizability measurement theory offers a set of techniques for estimating errors or…
Descriptors: Educational Assessment, Generalizability Theory, Higher Education, Institutional Research

Bickman, Leonard – Evaluation Review, 1985
An evaluation system which would describe and assess statewide services for preschool children is described. Component theory conceptualizes the unit of analysis for evaluation as the component. This approach increases the generalizability and utilization of evaluations and enhances the ability to evaluate several programs at the state level.…
Descriptors: Early Childhood Education, Evaluation Methods, Evaluation Utilization, Formative Evaluation

Butler, Richard P.; McCauley, Clark – Journal of Educational Psychology, 1987
Compared with data from civilian institutions, data from two graduating classes at the United States Military Academy showed extrordinary stability of independently calculated grade point averages from freshman to senior years and no decline in the validity of Scholastic Aptitude Tests and high school class rank as predictors of these GPAs over…
Descriptors: Class Rank, Correlation, Generalizability Theory, Grade Point Average

McDonald, Roderick P. – Psychometrika, 1986
There is a unity underlying the diversity of models for the analysis of multivariate data. Essentially, they constitute a family of models, most generally nonlinear, for structural/functional relations between variables drawn from a behavior domain. (Author)
Descriptors: Factor Analysis, Generalizability Theory, Latent Trait Theory, Mathematical Models

Schaeffer, Gary A.; And Others – Evaluation Review, 1986
The reliability of criterion-referenced tests (CRTs) used in health program evaluation can be conceptualized in different ways. Formulas are presented for estimating appropriate standard error of measurement (SEM) for CRTs. The SEM can be used in computing confidence intervals for domain score estimates and for a cut-score. (Author/LMO)
Descriptors: Accountability, Criterion Referenced Tests, Cutting Scores, Error of Measurement

Gresham, Frank M. – School Psychology Review, 1984
The evidence for the psychometric adequacy of behavioral interviews in terms of traditional psychometric theory and generalizability theory are reviewed. The review resulted in the conclusion that behavioral interviews have some evidence for interrater reliability, content validity, and criterion-related validity. Additional research in several…
Descriptors: Behavior Patterns, Behavior Problems, Functional Behavioral Assessment, Generalizability Theory
Sun, Anji; Valiga, Michael J. – 1997
In this study, the reliability of the American College Testing (ACT) Program's "Survey of Academic Advising" (SAA) was examined using both univariate and multivariate generalizability theory approaches. The primary purpose of the study was to compare the results of three generalizability theory models (a random univariate model, a mixed…
Descriptors: Academic Advising, Colleges, Faculty Advisers, Generalizability Theory
Crehan, Kevin D. – 1997
Writing fits well within the realm of outcomes suitable for observation by performance assessments. Studies of the reliability of performance assessments have suggested that interrater reliability can be consistently high. Scoring consistency, however, is only one aspect of quality in decisions based on assessment results. Another is…
Descriptors: Evaluation Methods, Feedback, Generalizability Theory, Interrater Reliability

Cronbach, Lee J.; And Others – Educational and Psychological Measurement, 1997
Through the standard error, rather than a reliability coefficient, generalizability theory provides an indicator of the uncertainty attached to school and individual scores on performance assessments. Recommendations are made to apply generalizability theory to current performance assessments, emphasizing practices that differ from usual…
Descriptors: Academic Achievement, Error of Measurement, Generalizability Theory, Performance Based Assessment

Christensen, John O. – Journal of Library Administration, 1988
Description of common errors found in the statistical methodologies of research carried out by librarians, focuses on sampling and generalizability. The discussion covers the need to either adapt library research to the statistical abilities of librarians or to educate librarians in the proper use of statistics. (15 references) (CLB)
Descriptors: Educational Needs, Generalizability Theory, Higher Education, Library Education

Marcoulides, George A. – Journal of Educational Statistics, 1993
A methodology is presented for minimizing mean error variance in generalizability studies when resource constraints are imposed. The optimal number of observations and conditions of facets for random model, fully crossed one- and two-facet designs can be decided. Parallel closed form formulas can be determined for other designs. (SLD)
Descriptors: Budgeting, Equations (Mathematics), Error of Measurement, Generalizability Theory