Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 1 |
Descriptor
Author
Publication Type
Speeches/Meeting Papers | 34 |
Reports - Research | 18 |
Reports - Evaluative | 16 |
Journal Articles | 2 |
Information Analyses | 1 |
Education Level
Audience
Researchers | 11 |
Location
Laws, Policies, & Programs
Assessments and Surveys
Law School Admission Test | 2 |
Medical College Admission Test | 1 |
Otis Lennon School Ability… | 1 |
Texas Assessment of Basic… | 1 |
Texas Educational Assessment… | 1 |
What Works Clearinghouse Rating
Smith, William Zachary; Dickenson, Tammiee S.; Rogers, Bradley David – AERA Online Paper Repository, 2017
Questionnaire refinement and a process for selecting items for elimination are important tools for survey developers. One of the major obstacles in questionnaire refinement and elimination in surveys lies in one's ability to adequately and appropriately reconstruct a survey. Often times, surveys can be long and strenuous on the respondent,…
Descriptors: Surveys, Psychometrics, Test Construction, Test Reliability
Flowers, Claudia P.; And Others – 1996
N. S. Raju, W. J. van der Linden, and P. F. Fleer (in press) have proposed an item response theory-based, parametric procedure for the detection of differential item functioning (DIF)/differential test functioning (DTF) known as differential functioning of item and test (DFIT). DFIT can be used with dichotomous, polytomous, or multidimensional…
Descriptors: Item Response Theory, Mathematical Models, Simulation, Test Bias
Haladyna, Tom; Roid, Gale – 1980
The problems associated with misclassifying students when pass-fail decisions are based on test scores are discussed. One protection against misclassification is to set a confidence interval around the cutting score. Those whose scores fall above the interval are passed; those whose scores fall below the interval are failed; and those whose scores…
Descriptors: Bayesian Statistics, Classification, Comparative Analysis, Criterion Referenced Tests

De Champlain, Andre F.; Gessaroli, Marc E.; Tang, K. Linda; De Champlain, Judy E. – 1998
The empirical Type I error rates of Poly-DIMTEST (H. Li and W. Stout, 1995) and the LISREL8 chi square fit statistic (K. Joreskog and D. Sorbom, 1993) were compared with polytomous unidimensional data sets simulated to vary as a function of test length and sample size. The rejection rates for both statistics were also studied with two-dimensional…
Descriptors: Chi Square, Goodness of Fit, Item Response Theory, Sample Size
Patsula, Liane N.; Gessaroli, Marc E. – 1995
Among the most popular techniques used to estimate item response theory (IRT) parameters are those used in the LOGIST and BILOG computer programs. Because of its accuracy with smaller sample sizes or differing test lengths, BILOG has become the standard to which new estimation programs are compared. However, BILOG is still complex and…
Descriptors: Comparative Analysis, Effect Size, Estimation (Mathematics), Item Response Theory
Schulz, E. Matthew; Wang, Lin – 2001
In this study, items were drawn from a full-length test of 30 items in order to construct shorter tests for the purpose of making accurate pass/fail classifications with regard to a specific criterion point on the latent ability metric. A three-item parameter Item Response Theory (IRT) framework was used. The criterion point on the latent ability…
Descriptors: Ability, Classification, Item Response Theory, Pass Fail Grading
De Champlain, Andre – 1996
The usefulness of a goodness-of-fit index proposed by R. P. McDonald (1989) was investigated with regard to assessing the dimensionality of item response matrices. The m subscript k index, which is based on an estimate of the noncentrality parameter of the noncentral chi-square distribution, possesses several advantages over traditional tests of…
Descriptors: Chi Square, Cutting Scores, Goodness of Fit, Item Response Theory
Ang, Cheng; Miller, M. David – 1993
The power of the procedure of W. Stout to detect deviations from essential unidimensionality in two-dimensional data was investigated for minor, moderate, and large deviations from unidimensionality using criteria for deviations from unidimensionality based on prior research. Test lengths of 20 and 40 items and sample sizes of 700 and 1,500 were…
Descriptors: Ability, Comparative Testing, Correlation, Item Response Theory
Lautenschlager, Gary J.; Park, Dong-Gun – 1987
The effects of variations in degree of range restriction and different subgroup sample sizes on the validity of several item bias detection procedures based on Item Response Theory (IRT) were investigated in a simulation study. The degree of range restriction for each of two subpopulations was varied by cutting the specified subpopulation ability…
Descriptors: Computer Simulation, Item Analysis, Latent Trait Theory, Mathematical Models
Chang, S. Tai; Bashaw, W. L. – 1984
The purpose of this study was twofold: to investigate to what extent characteristics of anchor tests may affect precision of item calibration, and to estimate to what extent precision of item calibration may be affected by removal of persons whose response patterns deviate from those normally expected from the Rasch one-parameter logistic model.…
Descriptors: Aptitude Tests, Difficulty Level, Equated Scores, Junior High Schools
De Champlain, Andre F.; Gessaroli, Marc E. – 1997
A study was conducted to compare, with simulated unidimensional and two-dimensional sets, the Type I error probabilities and rejection rates obtained with two versions of the LISREL computer program, the earlier version PRELIS/LISREL 7 and the later version PRELIS2/LISREL8, a version that corrects the asymptotic covariance matrix. Unidimensional…
Descriptors: Chi Square, Comparative Analysis, Goodness of Fit, Item Response Theory

Bush, M. Joan; Schumacker, Randall E. – 1993
The feasibility of quick norms derived by the procedure described by B. D. Wright and M. H. Stone (1979) was investigated. Norming differences between traditionally calculated means and Rasch "quick" means were examined for simulated data sets of varying sample size, test length, and type of distribution. A 5 by 5 by 2 design with a…
Descriptors: Computer Simulation, Item Response Theory, Norm Referenced Tests, Sample Size
De Ayala, R. J. – 1993
Previous work on the effects of dimensionality on parameter estimation was extended from dichotomous models to the polytomous graded response (GR) model. A multidimensional GR model was developed to generate data in one-, two-, and three-dimensions, with two- and three-dimensional conditions varying in their interdimensional associations. Test…
Descriptors: Computer Simulation, Correlation, Difficulty Level, Estimation (Mathematics)
Hwang, Chi-en; Cleary, T. Anne – 1986
The results obtained from two basic types of pre-equatings of tests were compared: the item response theory (IRT) pre-equating and section pre-equating (SPE). The simulated data were generated from a modified three-parameter logistic model with a constant guessing parameter. Responses of two replication samples of 3000 examinees on two 72-item…
Descriptors: Computer Simulation, Equated Scores, Latent Trait Theory, Mathematical Models
De Champlain, Andre; Gessaroli, Marc E. – 1996
The use of indices and statistics based on nonlinear factor analysis (NLFA) has become increasingly popular as a means of assessing the dimensionality of an item response matrix. Although the indices and statistics currently available to the practitioner have been shown to be useful and accurate in many testing situations, few studies have…
Descriptors: Adaptive Testing, Chi Square, Computer Assisted Testing, Factor Analysis