ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	1
Since 2006 (last 20 years)	4

Descriptor

Test Items	5
Test Construction	3
Test Reliability	3
Equated Scores	2
Psychometrics	2
Reliability	2
Scores	2
Scoring	2
Accountability	1
Behavior Change	1
Bias	1
Classification	1
Computation	1
Computer Programs	1
Confidence Testing	1
Criterion Referenced Tests	1
Error Patterns	1
Error of Measurement	1
Generalizability Theory	1
High Stakes Tests	1
Item Analysis	1
Item Response Theory	1
Literature Reviews	1
Mastery Tests	1
Measurement Techniques	1
More ▼

Source

Applied Psychological…	1
College Board	1
Educational and Psychological…	1
Measurement:…	1

Author

Brennan, Robert L.	5
Lee, Won-Chan	3
Kim, Stella Y.	1
Lee, Eunjung	1
Wan, Lei	1

Publication Type

Journal Articles	3
Reports - Research	3
Numerical/Quantitative Data	1
Opinion Papers	1
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 5 results Save | Export

Extended Multivariate Generalizability Theory with Complex Design Structures

Peer reviewed

Direct link

Brennan, Robert L.; Kim, Stella Y.; Lee, Won-Chan – Educational and Psychological Measurement, 2022

This article extends multivariate generalizability theory (MGT) to tests with different random-effects designs for each level of a fixed facet. There are numerous situations in which the design of a test and the resulting data structure are not definable by a single design. One example is mixed-format tests that are composed of multiple-choice and…

Descriptors: Multivariate Analysis, Generalizability Theory, Multiple Choice Tests, Test Construction

Testing for Accountability: A Balancing Act That Challenges Current Testing Practices and Theories

Peer reviewed

Direct link

Brennan, Robert L. – Measurement: Interdisciplinary Research and Perspectives, 2015

Koretz, in his article published in this issue, provides compelling arguments that the high stakes currently associated with accountability testing lead to behavioral changes in students, teachers, and other stakeholders that often have negative consequences, such as inflated scores. Koretz goes on to argue that these negative consequences require…

Descriptors: Accountability, High Stakes Tests, Behavior Change, Student Behavior

Exploring Equity Properties in Equating Using AP® Examinations. Research Report No. 2012-4

Download full text

Lee, Eunjung; Lee, Won-Chan; Brennan, Robert L. – College Board, 2012

In almost all high-stakes testing programs, test equating is necessary to ensure that test scores across multiple test administrations are equivalent and can be used interchangeably. Test equating becomes even more challenging in mixed-format tests, such as Advanced Placement Program® (AP®) Exams, that contain both multiple-choice and constructed…

Descriptors: Test Construction, Test Interpretation, Test Norms, Test Reliability

Classification Consistency and Accuracy for Complex Assessments under the Compound Multinomial Model

Peer reviewed

Direct link

Lee, Won-Chan; Brennan, Robert L.; Wan, Lei – Applied Psychological Measurement, 2009

For a test that consists of dichotomously scored items, several approaches have been reported in the literature for estimating classification consistency and accuracy indices based on a single administration of a test. Classification consistency and accuracy have not been studied much, however, for "complex" assessments--for example,…

Descriptors: Classification, Reliability, Test Items, Scoring

The Evaluation of Mastery Test Items. Final Report.

Download full text

Brennan, Robert L. – 1974

The first four chapters of this report primarily provide an extensive, critical review of the literature with regard to selected aspects of the criterion-referenced and mastery testing fields. Major topics treated include: (a) definitions, distinctions, and background, (b) the relevance of classical test theory, (c) validity and procedures for…

Descriptors: Computer Programs, Confidence Testing, Criterion Referenced Tests, Error of Measurement