ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	6

Source

Applied Psychological…	4
Educational and Psychological…	4
Journal of Educational…	4
Educational Measurement:…	3
Applied Measurement in…	2
Advances in Health Sciences…	1
College Board	1
International Journal of…	1
Measurement:…	1
Psychometrika	1
Review of Educational Research	1
More ▼

Author

Brennan, Robert L.	32
Kane, Michael T.	4
Lee, Won-Chan	4
Hanson, Bradley A.	2
Yin, Ping	2
Gao, Xiaohong	1
Johnson, Eugene G.	1
Kane, Michael F.	1
Kim, Seonghoon	1
Kim, Stella Y.	1
Kreiter, Clarence D.	1
Lee, Eunjung	1
Light, Richard J.	1
Lockwood, Robert E.	1
Prediger, Dale J.	1
Solow, Catherine	1
Wan, Lei	1
Yi, Hyun Sook	1
More ▼

Publication Type

Journal Articles	18
Reports - Research	13
Reports - Evaluative	9
Speeches/Meeting Papers	4
Reports - Descriptive	3
Information Analyses	2
Numerical/Quantitative Data	2
Opinion Papers	1

Education Level

Higher Education	1
Postsecondary Education	1

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

Iowa Tests of Basic Skills	1
Work Keys (ACT)	1

What Works Clearinghouse Rating

Brennan, Robert L. X

Showing 1 to 15 of 32 results Save | Export

Extended Multivariate Generalizability Theory with Complex Design Structures

Peer reviewed

Direct link

Brennan, Robert L.; Kim, Stella Y.; Lee, Won-Chan – Educational and Psychological Measurement, 2022

This article extends multivariate generalizability theory (MGT) to tests with different random-effects designs for each level of a fixed facet. There are numerous situations in which the design of a test and the resulting data structure are not definable by a single design. One example is mixed-format tests that are composed of multiple-choice and…

Descriptors: Multivariate Analysis, Generalizability Theory, Multiple Choice Tests, Test Construction

Testing for Accountability: A Balancing Act That Challenges Current Testing Practices and Theories

Peer reviewed

Direct link

Brennan, Robert L. – Measurement: Interdisciplinary Research and Perspectives, 2015

Koretz, in his article published in this issue, provides compelling arguments that the high stakes currently associated with accountability testing lead to behavioral changes in students, teachers, and other stakeholders that often have negative consequences, such as inflated scores. Koretz goes on to argue that these negative consequences require…

Descriptors: Accountability, High Stakes Tests, Behavior Change, Student Behavior

Exploring Equity Properties in Equating Using AP® Examinations. Research Report No. 2012-4

Download full text

Lee, Eunjung; Lee, Won-Chan; Brennan, Robert L. – College Board, 2012

In almost all high-stakes testing programs, test equating is necessary to ensure that test scores across multiple test administrations are equivalent and can be used interchangeably. Test equating becomes even more challenging in mixed-format tests, such as Advanced Placement Program® (AP®) Exams, that contain both multiple-choice and constructed…

Descriptors: Test Construction, Test Interpretation, Test Norms, Test Reliability

Generalizability Theory and Classical Test Theory

Peer reviewed

Direct link

Brennan, Robert L. – Applied Measurement in Education, 2011

Broadly conceived, reliability involves quantifying the consistencies and inconsistencies in observed scores. Generalizability theory, or G theory, is particularly well suited to addressing such matters in that it enables an investigator to quantify and distinguish the sources of inconsistencies in observed scores that arise, or could arise, over…

Descriptors: Generalizability Theory, Test Theory, Test Reliability, Item Response Theory

Classification Consistency and Accuracy for Complex Assessments under the Compound Multinomial Model

Peer reviewed

Direct link

Lee, Won-Chan; Brennan, Robert L.; Wan, Lei – Applied Psychological Measurement, 2009

For a test that consists of dichotomously scored items, several approaches have been reported in the literature for estimating classification consistency and accuracy indices based on a single administration of a test. Classification consistency and accuracy have not been studied much, however, for "complex" assessments--for example,…

Descriptors: Classification, Reliability, Test Items, Scoring

A Method for Estimating Classification Consistency Indices for Two Equated Forms

Peer reviewed

Direct link

Yi, Hyun Sook; Kim, Seonghoon; Brennan, Robert L. – Applied Psychological Measurement, 2007

Large-scale testing programs involving classification decisions typically have multiple forms available and conduct equating to ensure cut-score comparability across forms. A test developer might be interested in the extent to which an examinee who happens to take a particular form would have a consistent classification decision if he or she had…

Descriptors: Classification, Reliability, Indexes, Computation

An Essay on the History and Future of Reliability from the Perspective of Replications.

Peer reviewed

Brennan, Robert L. – Journal of Educational Measurement, 2001

Reviews important milestones in the history of reliability, current issues related to reliability, and likely prospects for reliability from the perspective of what constitutes a replication of a measurement procedure. Pays special attention to the fixed/random aspects of facets that characterize replications. (SLD)

Descriptors: Educational Testing, Measurement Techniques, Reliability

Procedures for Computing Classification Consistency and Accuracy Indices with Multiple Categories. ACT Research Report Series.

Download full text

Lee, Won-Chan; Hanson, Bradley A.; Brennan, Robert L. – 2000

This paper describes procedures for estimating various indices of classification consistency and accuracy for multiple category classifications using data from a single test administration. The estimates of the classification consistency and accuracy indices are compared under three different psychometric models: the two-parameter beta binomial,…

Descriptors: Classification, Estimation (Mathematics), Item Response Theory, Reliability

Agreement Coefficients as Indices of Dependability for Domain-Referenced Tests. ACT Technical Bulletin No. 28.

Download full text

Kane, Michael T.; Brennan, Robert L. – 1977

A large number of seemingly diverse coefficients have been proposed as indices of dependability, or reliability, for domain-referenced and/or mastery tests. In this paper, it is shown that most of these indices are special cases of two generalized indices of agreement: one that is corrected for chance, and one that is not. The special cases of…

Descriptors: Bayesian Statistics, Correlation, Criterion Referenced Tests, Cutting Scores

An Index of Dependability for Mastery Tests

Peer reviewed

Brennan, Robert L.; Kane, Michael T. – Journal of Educational Measurement, 1977

An index for the dependability of mastery tests is described. Assumptions necessary for the index and the mathematical development of the index are provided. (Author/JKS)

Descriptors: Criterion Referenced Tests, Mastery Tests, Mathematical Models, Test Reliability

Coefficient Kappa: Some Uses, Misuses, and Alternatives.

Peer reviewed

Brennan, Robert L.; Prediger, Dale J. – Educational and Psychological Measurement, 1981

This paper considers some appropriate and inappropriate uses of coefficient kappa and alternative kappa-like statistics. Discussion is restricted to the descriptive characteristics of these statistics for measuring agreement with categorical data in studies of reliability and validity. (Author)

Descriptors: Classification, Error of Measurement, Mathematical Models, Test Reliability

A Comparison of the Nedelsky and Angoff Cutting Score Procedures Using Generalizability Theory.

Peer reviewed

Brennan, Robert L.; Lockwood, Robert E. – Applied Psychological Measurement, 1980

Generalizability theory is used to characterize and quantify expected variance in cutting scores and to compare the Nedelsky and Angoff procedures for establishing a cutting score. Results suggest that the restricted nature of the Nedelsky (inferred) probability scale may limit its applicability in certain contexts. (Author/BW)

Descriptors: Cutting Scores, Generalization, Statistical Analysis, Test Reliability

Performance Assessments from the Perspective of Generalizability Theory.

Peer reviewed

Brennan, Robert L. – Applied Psychological Measurement, 2000

Reviews relevant aspects of generalizability theory related to performance assessments and discusses the role of various facets in assessing the generalizability of performance assessments. Also considers some popular estimates of reliability for performance assessments from the perspective of generalizability theory. (SLD)

Descriptors: Estimation (Mathematics), Evaluation Methods, Generalizability Theory, Performance Based Assessment

The Calculation of Reliability from a Split-Plot Factorial Design

Peer reviewed

Brennan, Robert L. – Educational and Psychological Measurement, 1975

Variance components from split-plot factorial design (SPF) were used to estimate reliability for schools and persons within schools. Reliability for persons within SPF and randomized block design (RB) schools were compared and reliability for SPF and RB design schools were compared. (Author/BJG)

Descriptors: Analysis of Variance, Evaluation Methods, Schools, Statistical Analysis

Measuring Agreement When Two Observers Classify People Into Categories Not Defined in Advance.

Download full text

Brennan, Robert L.; Light, Richard J. – 1973

Basic to many psychological investigations is the question of agreement between observers who independently categorize people. Several recent studies have proposed measures of agreement when a set of nominal scale categories have been pre-defined and imposed on both observers. This study, in contrast, developes a measure of agreement for settings…

Descriptors: Analysis of Variance, Correlation, Hypothesis Testing, Rating Scales

Previous Page | Next Page »

Pages: 1 | 2 | 3

Reliability	16
Test Reliability	16
Error of Measurement	9
Generalizability Theory	9
Statistical Analysis	8
Mathematical Models	7
Measurement Techniques	7
Scores	7
Test Interpretation	7
Analysis of Variance	6
Criterion Referenced Tests	6
Performance Based Assessment	6
Psychometrics	6
Classification	5
Estimation (Mathematics)	5
Sampling	5
Test Construction	5
Test Items	5
Comparative Analysis	4
Correlation	4
Cutting Scores	4
Item Response Theory	4
Mastery Tests	4
Norm Referenced Tests	4
True Scores	4
More ▼