ERIC - Search Results

Descriptor

Test Reliability	14
Mathematical Models	7
Mastery Tests	6
Criterion Referenced Tests	5
Multiple Choice Tests	5
Cutting Scores	4
Achievement Tests	3
Guessing (Tests)	3
Item Analysis	3
Measurement Techniques	3
Probability	3
Test Interpretation	3
Test Theory	3
Testing Problems	3
True Scores	3
Bayesian Statistics	2
Latent Trait Theory	2
Psychometrics	2
Sampling	2
Scoring	2
Scoring Formulas	2
Statistical Analysis	2
Test Construction	2
Test Items	2
Test Length	2
More ▼

Source

Educational and Psychological…	4
Psychometrika	3
Journal of Educational…	2
Journal of Experimental…	1

Author

Wilcox, Rand R.

Publication Type

Reports - Research	11
Journal Articles	8
Collected Works - General	1
Guides - Non-Classroom	1
Reports - Descriptive	1
Reports - Evaluative	1

Education Level

Audience

Location

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 14 results Save | Export

The Single Administration Estimate of the Proportion of Agreement of a Proficiency Test Scored with a Latent Structure Model.

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1981

This paper describes and compares procedures for estimating the reliability of proficiency tests that are scored with latent structure models. Results suggest that the predictive estimate is the most accurate of the procedures. (Author/BW)

Descriptors: Criterion Referenced Tests, Scoring, Test Reliability

Prediction Analysis and the Reliability of a Mastery Test.

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1979

The classical estimate of a binomial probability function is to estimate its mean in the usual manner and to substitute the results in the appropriate expression. Two alternative estimation procedures are described and examined. Emphasis is given to the single administration estimate of the mastery test reliability. (Author/CTM)

Descriptors: Cutting Scores, Mastery Tests, Probability, Scores

Using Results on k Out of n System Reliability to Study and Characterize Tests.

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1982

Results in the engineering literature on "k out of n system reliability" can be used to characterize tests based on estimates of the probability of correctly determining whether the examinee knows the correct response. In particular, the minimum number of distractors required for multiple-choice tests can be empirically determined.…

Descriptors: Achievement Tests, Mathematical Models, Multiple Choice Tests, Test Format

A Review of the Beta-Binomial Model and Its Extensions.

Peer reviewed

Wilcox, Rand R. – Journal of Educational Statistics, 1981

Both the binomial and beta-binomial models are applied to various problems occurring in mental test theory. The paper reviews and critiques these models. The emphasis is on the extensions of the models that have been proposed in recent years, and that might not be familiar to many educators. (Author)

Descriptors: Error of Measurement, Item Analysis, Mathematical Models, Test Reliability

Estimating True Score in the Compound Binomial Error Model

Peer reviewed

Wilcox, Rand R. – Psychometrika, 1978

Several Bayesian approaches to the simultaneous estimation of the means of k binomial populations are discussed. This has particular applicability to criterion-referenced or mastery testing. (Author/JKS)

Descriptors: Bayesian Statistics, Criterion Referenced Tests, Mastery Tests, Probability

On False-Positive and False-Negative Decisions with a Mastery Test.

Download full text

Wilcox, Rand R. – 1980

Wilcox (1977) examines two methods of estimating the probability of a false-positive on false-negative decision with a mastery test. Both procedures make assumptions about the form of the true score distribution which might not give good results in all situations. In this paper, upper and lower bounds on the two possible error types are described…

Descriptors: Cutting Scores, Mastery Tests, Mathematical Models, Student Placement

An Approximation of the K Out of N Reliability of a Test, and a Scoring Procedure for Determining which Items an Examinee Knows.

Peer reviewed

Wilcox, Rand R. – Psychometrika, 1983

A procedure for determining the reliability of an examinee knowing k out of n possible multiple choice items given his or her performance on those items is presented. Also, a scoring procedure for determining which items an examinee knows is presented. (Author/JKS)

Descriptors: Item Analysis, Latent Trait Theory, Measurement Techniques, Multiple Choice Tests

A Lower Bound to the Probability of Choosing the Optimal Passing Score for a Mastery Test When There Is an External Criterion.

Peer reviewed

Wilcox, Rand R. – Psychometrika, 1979

The problem of determining an optimal passing score for a mastery test is discussed, when the purpose of the test is to predict success on an external criterion. For the case of constant losses for the two possible error types, a method for determining passing scores is derived. (Author/JKS)

Descriptors: Criterion Referenced Tests, Cutting Scores, Mastery Tests, Mathematical Models

Estimating the Likelihood of False-Positive and False-Negative Decisions in Mastery Testing: An Empirical Bayes Approach

Peer reviewed

Wilcox, Rand R. – Journal of Educational Statistics, 1977

False-positive and false-negative decisions are the two possible errors committed with a mastery test; yet the estimation of the likelihood of committing these errors has not been investigated. Two methods of this type of estimation are presented and discussed. (Author/JKS)

Descriptors: Bayesian Statistics, Hypothesis Testing, Mastery Tests, Measurement Techniques

Estimating the Parameters of the Beta-Binomial Distribution.

Peer reviewed

Wilcox, Rand R. – Educational and Psychological Measurement, 1979

For some situations the beta-binomial distribution might be used to describe the marginal distribution of test scores for a particular population of examinees. Several different methods of approximating the maximum likelihood estimate were investigated, and it was found that the Newton-Raphson method should be used when it yields admissable…

Descriptors: Criterion Referenced Tests, Maximum Likelihood Statistics, Measurement, Monte Carlo Methods

Test Design Project: Studies in Test Adequacy. Annual Report.

Download full text

Wilcox, Rand R. – 1981

These studies in test adequacy focus on two problems: procedures for estimating reliability, and techniques for identifying ineffective distractors. Fourteen papers are presented on recent advances in measuring achievement (a response to Molenaar); "an extension of the Dirichlet-multinomial model that allows true score and guessing to be…

Descriptors: Achievement Tests, Criterion Referenced Tests, Guessing (Tests), Mathematical Models

R. & D. in Psychometrics: Technical Reports on Latent Structure Models.

Download full text

Wilcox, Rand R. – 1982

This document contains three papers from the Methodology Project of the Center for the Study of Evaluation. Methods for characterizing test accuracy are reported in the first two papers. "Bounds on the K Out of N Reliability of a Test, and an Exact Test for Hierarchically Related Items" describes and illustrates how an extension of a…

Descriptors: Educational Testing, Evaluation Methods, Guessing (Tests), Latent Trait Theory

A Closed Sequential Procedure for Answer-Until-Correct Tests.

Peer reviewed

Wilcox, Rand R. – Journal of Experimental Education, 1982

A closed sequential procedure for estimating true score is proposed for use with answer-until-correct tests. The accuracy of determining true score is the same as in conventional sequential solutions, but the possibility of using an unnecessarily large number of items is eliminated. (Author/CM)

Descriptors: Answer Sheets, Guessing (Tests), Item Banks, Measurement Techniques

An Approach to Measuring the Achievement or Proficiency of an Examinee.

Wilcox, Rand R. – 1979

Mastery tests are analyzed in terms of the number of skills to be mastered and the number of items per skill, in order that correct decisions of mastery or nonmastery will be made to a desired degree of probability. It is assumed that a random sample of skills will be selected for measurement, that each skill will be measured by the same number of…

Descriptors: Achievement Tests, Cutting Scores, Decision Making, Equivalency Tests