ERIC - Search Results

Publication Date

In 2025	2
Since 2024	15
Since 2021 (last 5 years)	68
Since 2016 (last 10 years)	171
Since 2006 (last 20 years)	439

Descriptor

Generalizability Theory	728
Reliability	168
Scores	146
Error of Measurement	133
Test Reliability	125
Interrater Reliability	120
Foreign Countries	102
Statistical Analysis	85
Evaluation Methods	82
Psychometrics	75
Research Methodology	67
Validity	66
Test Validity	64
Models	62
Comparative Analysis	59
Correlation	59
Higher Education	59
Scoring	59
Item Response Theory	57
Performance Based Assessment	57
Research Design	57
Test Items	54
Test Construction	49
Elementary School Students	48
Test Theory	47
More ▼

Education Level

Higher Education	115
Postsecondary Education	68
Elementary Education	59
Secondary Education	42
Middle Schools	33
Elementary Secondary Education	29
Early Childhood Education	24
Junior High Schools	22
Grade 8	17
Grade 3	15
Preschool Education	15
Grade 4	14
Grade 5	13
Primary Education	13
Grade 7	12
High Schools	12
Intermediate Grades	11
Adult Education	10
Grade 6	7
Kindergarten	7
Grade 10	6
Grade 9	6
Grade 1	4
Grade 2	4
Two Year Colleges	3
More ▼

Audience

Researchers	28
Practitioners	2
Policymakers	1
Students	1

Location

Turkey	14
Canada	10
United States	10
California	9
Netherlands	9
Australia	6
Germany	6
South Korea	6
Iowa	5
Norway	5
Turkey (Ankara)	5
United Kingdom	5
Florida	4
South Africa	4
Tennessee	4
China	3
Hong Kong	3
Indiana	3
Japan	3
North Carolina	3
Texas	3
Alabama	2
China (Beijing)	2
Colorado	2
Cyprus	2
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	2
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 526 to 540 of 728 results Save | Export

Interrater/Test Reliability System (ITRS).

Peer reviewed

Abedi, Jamal – Multivariate Behavioral Research, 1996

The Interrater/Test Reliability System (ITRS) is described. The ITRS is a comprehensive computer tool used to address questions of interrater reliability that computes several different indices of interrater reliability and the generalizability coefficient over raters and topics. The system is available in IBM compatible or Macintosh format. (SLD)

Descriptors: Computer Software, Computer Software Evaluation, Evaluation Methods, Evaluators

The Precision of Measurements.

Peer reviewed

Kane, Michael – Applied Measurement in Education, 1996

This overview of the role of error and tolerance for error in measurement asserts that the generic precision associated with a measurement procedure is defined as the root mean square error, or standard error, in some relevant population. This view of precision is explored in several applications of measurement. (SLD)

Descriptors: Error of Measurement, Error Patterns, Generalizability Theory, Measurement Techniques

The Use of Aggregate Scoring for a Recertifying Examination.

Peer reviewed

Norcini, John J.; And Others – Evaluation and the Health Professions, 1990

Aggregate scoring was applied to a recertifying examination for medical professionals to generate an answer key and allow comparison of peer examinees. Results for 1,927 candidates for recertification indicate considerable agreement between the traditional answer key and the aggregate answer key. (TJH)

Descriptors: Answer Keys, Criterion Referenced Tests, Error of Measurement, Generalizability Theory

Selecting Weighting Schemes in Multivariate Generalizability Studies.

Peer reviewed

Marcoulides, George A. – Educational and Psychological Measurement, 1994

Effects of different weighting schemes on selecting the optimal number of observations in multivariate-multifacet generalizability designs are studied when cost constraints are imposed. Comparison of four schemes through simulation indicates that all four produce similar optimal values and that reliability should be similar. (SLD)

Descriptors: Budgeting, Comparative Analysis, Costs, Factor Analysis

Maximizing the Coefficient of Generalizability in Decision Studies.

Peer reviewed

Goldstein, Zvi; Marcoulides, George A. – Educational and Psychological Measurement, 1991

An efficient search procedure is presented for determining the optimal number of observations of facets in a design that maximize generalizability when resource constraints are imposed. The procedure is illustrated for three-facet and four-facet designs, with extensions for other configurations. (Author/SLD)

Descriptors: Cost Effectiveness, Decision Making, Equations (Mathematics), Generalizability Theory

Peer reviewed

Smith, Philip L.; Luecht, Richard M. – Applied Psychological Measurement, 1992

The implications of serially correlated effects on the results of generalizability analyses are discussed. Simulated data are provided that demonstrate the biases that serially correlated effects introduce into the results. Serial correlation in measurement effects can have a marked influence on the impression of the dependability of measurement…

Descriptors: Computer Simulation, Correlation, Equations (Mathematics), Estimation (Mathematics)

Estimating Interinterviewer Reliability for Interview Schedules Used in Special Education Research.

Peer reviewed

Goodwin, Laura D.; And Others – Journal of Special Education, 1991

Using data from an individually administered interview schedule (the Consumer Satisfaction Inventory), reliability among nine interviewers was estimated with several statistical methods, including simple percentages of agreement, kappa and weighted kappa, Pearson correlations, t tests on interviewers' means, and generalizability theory techniques.…

Descriptors: Disabilities, Educational Research, Elementary Secondary Education, Estimation (Mathematics)

Using Generalizability Theory To Estimate the Reliability of Writing Scores Derived from Holistic and Analytical Scoring Methods.

Peer reviewed

Swartz, Carl W.; Hooper, Stephen R.; Mongomery, James W.; Wakely, Melissa B.; De Kruif, Renee E. L.; Reed, Martha; Brown, Timothy T.; Levine, Melvin D.; White, Kinnard P. – Educational and Psychological Measurement, 1999

Used generalizability theory to investigate the impact of the number of raters and the type of decision (relative versus absolute) on the reliability of writing scores. Results from 251 middle school students and 20 intermediate grade students show that reliability coefficients decline as the number of raters declines and when absolute decisions…

Descriptors: Estimation (Mathematics), Generalizability Theory, Holistic Evaluation, Intermediate Grades

Dependability of Measurement in Counseling Psychology: An Introduction to Generalizability Theory.

Peer reviewed

Hoyt, William T.; Melby, Janet N. – Counseling Psychologist, 1999

Addresses generalizability theory (GT), which offers a flexible framework for assessing dependability of measurement. GT allows for consideration of multiple sources of error, allowing investigators to assess the overall impact of measurement error. Illustrative analyses demonstrate the special advantages of GT for planning studies in which…

Descriptors: Counseling Psychology, Generalizability Theory, Measurement, Research Design

The Generalizability of Ratings of Item Relevance.

Peer reviewed

Norcini, John; Grosso, Lou – Applied Measurement in Education, 1998

Ratings of test item relevance were collected from 57 practitioners from a pretest of a medical certifying examination. Ratings were correlated with item difficulty, but the relationship between ratings and item discrimination was less clear. Application of generalizability theory shows that reasonable estimates of item, stem, and total test…

Descriptors: Certification, Difficulty Level, Estimation (Mathematics), Generalizability Theory

Administrative Effectiveness in Higher Education: Improving Assessment Procedures.

Peer reviewed

Heck, Ronald H.; Johnsrud, Linda K.; Rosser, Vicki J. – Research in Higher Education, 2000

Little research exists on the assessment of administrators' performance in higher education. The authors offer an evaluation model for assessing and monitoring the effectiveness of academic deans and directors, using generalizability theory as a basis for developing more accurate assessment procedures. (JM)

Descriptors: Academic Deans, Administrator Effectiveness, Administrator Evaluation, College Administration

A Multivariate Generalizability Analysis of the Multistate Bar Examination

Peer reviewed

Direct link

Yin, Ping – Educational and Psychological Measurement, 2005

The main purpose of this study is to examine the content structure of the Multistate Bar Examination (MBE) using the "table of specifications" model from the perspective of multivariate generalizability theory. Specifically, using MBE data collected over different years (six administrations: three from the February test and three from July test),…

Descriptors: Correlation, Generalizability Theory, Statistical Analysis, Multivariate Analysis

Resolving Score Differences in the Rating of Writing Samples: Does Discussion Improve the Accuracy of Scores?

Peer reviewed

Direct link

Johnson, Robert L.; Penny, James; Gordon, Belita; Shumate, Steven R.; Fisher, Steven P. – Language Assessment Quarterly, 2005

Many studies have indicated that at least 2 raters should score writing assessments to improve interrater reliability. However, even for assessments that characteristically demonstrate high levels of rater agreement, 2 raters of the same essay can occasionally report different, or discrepant, scores. If a single score, typically referred to as an…

Descriptors: Interrater Reliability, Scores, Evaluation, Reliability

Model Inference or Model Selection: Discussion of Klugkist, Laudy, and Hoijtink (2005)

Peer reviewed

Direct link

Stern, Hal S. – Psychological Methods, 2005

I. Klugkist, O. Laudy, and H. Hoijtink (2005) presented a Bayesian approach to analysis of variance models with inequality constraints. Constraints may play 2 distinct roles in data analysis. They may represent prior information that allows more precise inferences regarding parameter values, or they may describe a theory to be judged against the…

Descriptors: Probability, Inferences, Bayesian Statistics, Data Analysis

Analysis of School Context Effects on Differential Item Functioning Using Hierarchical Generalized Linear Models

Peer reviewed

Direct link

Cheong, Yuk Fai – International Journal of Testing, 2006

This article considers and illustrates a strategy to study effects of school context on differential item functioning (DIF) in large-scale assessment. The approach employs a hierarchical generalized linear modeling framework to (a) detect DIF, and (b) identify school-level correlates of the between-group differences in item performance. To…

Descriptors: Context Effect, Test Bias, Causal Models, Educational Assessment

« Previous Page | Next Page »

Pages: 1 | ... | 32 | 33 | 34 | 35 | 36 | 37 | 38 | 39 | 40 | ... | 49

Educational and Psychological…	44
Advances in Health Sciences…	27
Journal of Educational…	24
Applied Measurement in…	23
ProQuest LLC	18
Language Testing	13
Grantee Submission	11
Society for Research on…	11
Psychometrika	10
School Psychology Review	10
Applied Psychological…	9
Educational Measurement:…	9
Online Submission	8
International Journal of…	7
Journal of Educational…	7
Journal of Educational…	7
Multivariate Behavioral…	7
Behavioral Research and…	6
Educational Researcher	6
Educational Sciences: Theory…	6
Journal of Psychoeducational…	6
Measurement and Evaluation in…	6
Review of Educational Research	6
School Psychology Quarterly	6
Assessment for Effective…	5
More ▼

Brennan, Robert L.	18
Lee, Guemin	13
Briesch, Amy M.	11
Clauser, Brian E.	9
Chafouleas, Sandra M.	8
Riley-Tillman, T. Chris	8
Solano-Flores, Guillermo	8
Volpe, Robert J.	8
Christ, Theodore J.	7
Lee, Yong-Won	7
Marcoulides, George A.	7
Shavelson, Richard J.	7
Tindal, Gerald	7
Alonzo, Julie	6
Anderson, Daniel	6
Hagtvet, Knut A.	5
Harik, Polina	5
Miller, M. David	5
Raymond, Mark R.	5
Atilgan, Hakan	4
Chang, Lei	4
Fitzpatrick, Anne R.	4
French, Brian F.	4
Guler, Nese	4
More ▼

Journal Articles	534
Reports - Research	426
Reports - Evaluative	180
Speeches/Meeting Papers	115
Reports - Descriptive	57
Opinion Papers	27
Information Analyses	25
Dissertations/Theses -…	19
Numerical/Quantitative Data	19
Tests/Questionnaires	11
Guides - Non-Classroom	6
Books	5
Collected Works - General	3
Book/Product Reviews	2
Reference Materials -…	2
Reports - General	2
Collected Works - Serials	1
Dissertations/Theses -…	1
Guides - Classroom - Learner	1
Guides - General	1
Non-Print Media	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	6
Program for International…	4
Teacher Performance…	4
Dynamic Indicators of Basic…	3
Trends in International…	3
ACT Assessment	2
Childrens Depression Inventory	2
National Assessment of…	2
National Survey of Student…	2
Progress in International…	2
SAT (College Admission Test)	2
Students Evaluation of…	2
Test of English for…	2
United States Medical…	2
Wechsler Adult Intelligence…	2
Advanced Placement…	1
Battelle Developmental…	1
Behavior Assessment System…	1
Big Five Inventory	1
Classroom Assessment Scoring…	1
Cognitive Abilities Test	1
Conners Teacher Rating Scale	1
Early Childhood Environment…	1
Eating Disorder Inventory	1
Eysenck Personality Inventory	1
More ▼