NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 16 to 30 of 33 results Save | Export
Peer reviewed Peer reviewed
Feldt, Leonard S. – Applied Measurement in Education, 2002
Considers the degree of bias in testlet-based alpha (internal consistency reliability) through hypothetical examples and real test data from four tests of the Iowa Tests of Basic Skills. Presents a simple formula for computing a testlet-based congeneric coefficient. (SLD)
Descriptors: Estimation (Mathematics), Reliability, Statistical Bias, Test Format
Peer reviewed Peer reviewed
Feldt, Leonard S.; Qualls, Audrey L. – Applied Measurement in Education, 1998
Two relatively simple methods for estimating the condition standard error of measurement (SEM) for nonlinearly derived score scales are proposed. Applications indicate that these two procedures produce fairly consistent estimates that tend to peak near the high end of the scale and reach a minimum in the middle of the raw score scale. (SLD)
Descriptors: Error of Measurement, Estimation (Mathematics), Raw Scores, Reliability
Peer reviewed Peer reviewed
Gilmer, Jerry S.; Feldt, Leonard S. – Psychometrika, 1983
Estimating the reliability of measures derived from separate questions on essay tests or individual judges on a rater panel is considered. Cronbach's alpha is shown to underestimate reliability in these cases. Some alternative coefficients are presented. (JKS)
Descriptors: Essay Tests, Item Analysis, Measurement Techniques, Rating Scales
Peer reviewed Peer reviewed
Feldt, Leonard S. – Psychometrika, 1980
Procedures are developed for testing the hypothesis that Cronbach's alpha reliability coefficient is equal for two tests given to the same subjects. (Author/JKS)
Descriptors: Error of Measurement, Hypothesis Testing, Measurement, Statistical Significance
Peer reviewed Peer reviewed
Feldt, Leonard S. – Applied Measurement in Education, 1997
It has often been asserted that the reliability of a measure places an upper limit on its validity. This article demonstrates in theory that validity can rise when reliability declines, even when validity evidence is a correlation with an acceptable criterion. Whether empirical examples can actually be found is an open question. (SLD)
Descriptors: Correlation, Criteria, Reliability, Test Construction
Peer reviewed Peer reviewed
Feldt, Leonard S. – Applied Measurement in Education, 2002
Considers the situation in which content or administrative considerations limit the way in which a test can be partitioned to estimate the internal consistency reliability of the total test score. Demonstrates that a single-valued estimate of the total score reliability is possible only if an assumption is made about the comparative size of the…
Descriptors: Error of Measurement, Reliability, Scores, Test Construction
Peer reviewed Peer reviewed
Feldt, Leonard S.; Qualls, Audrey L. – Applied Measurement in Education, 1999
Examined the stability of the standard error of measurement and the relationship between the reliability coefficient and the variance of both true scores and error scores for 170 school districts in a state. As expected, reliability coefficients varied as a function of group variability, but the variation in split-half coefficients from school to…
Descriptors: Elementary Secondary Education, Error of Measurement, Reliability, School Districts
Peer reviewed Peer reviewed
Charter, Richard A.; Feldt, Leonard S. – Measurement and Evaluation in Counseling and Development, 2002
Presented is a detailed description of two true score confidence interval approaches, their use, interpretation, and a philosophical conflict that arises in many applied instances. (Contains 27 references.) (Author)
Descriptors: Error of Measurement, Psychometrics, Research Methodology, Statistical Analysis
Peer reviewed Peer reviewed
Alsawalmeh, Yousef M.; Feldt, Leonard S. – Applied Psychological Measurement, 1992
An approximate statistical test is derived for the hypothesis that the intraclass reliability coefficients associated with two measurement procedures are equal. Control of Type 1 error is investigated by comparing empirical sampling distributions of the test statistic with its derived theoretical distribution. A numerical illustration is…
Descriptors: Equations (Mathematics), Hypothesis Testing, Mathematical Models, Measurement Techniques
Peer reviewed Peer reviewed
Direct linkDirect link
Feldt, Leonard S. – Educational and Psychological Measurement, 2005
To meet the requirements of the No Child Left Behind Act, school districts and states must compile summary reports of the levels of student achievement in reading and mathematics. The levels are to be described in broad categories: "basic and below," "proficient," or "advanced." Educational units are given considerable latitude in defining the…
Descriptors: Federal Legislation, Academic Achievement, Test Items, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Feldt, Leonard S.; Kim, Seonghoon – Educational and Psychological Measurement, 2006
Researchers sometimes need a statistical test of the hypothesis that two values of Cronbach's alpha reliability coefficient are equal. The situation may involve scores from two different measures administered to independent random samples or from the same measure administered to random samples from two different populations. Feldt derived a test…
Descriptors: Individual Testing, Test Items, Sample Size, Scores
Peer reviewed Peer reviewed
Feldt, Leonard S. – Educational and Psychological Measurement, 1984
The binomial error model includes form-to-form difficulty differences as error variance and leads to Ruder-Richardson formula 21 as an estimate of reliability. If the form-to-form component is removed from the estimate of error variance, the binomial model leads to KR 20 as the reliability estimate. (Author/BW)
Descriptors: Achievement Tests, Difficulty Level, Error of Measurement, Mathematical Formulas
Peer reviewed Peer reviewed
Feldt, Leonard S. – Applied Measurement in Education, 1993
The recommendation that the reliability of multiple-choice tests will be enhanced if the distribution of item difficulties is concentrated at approximately 0.50 is reinforced and extended in this article by viewing the 0/1 item scoring as a dichotomization of an underlying normally distributed ability score. (SLD)
Descriptors: Ability, Difficulty Level, Guessing (Tests), Mathematical Models
Peer reviewed Peer reviewed
Alsawalmeh, Yousef M.; Feldt, Leonard S. – Psychometrika, 1994
A modification of a test of the equality of nonindependent alpha reliability coefficients is proposed. It avoids the limitation that the product of the number of test parts times the number of subjects be quite large. Monte Carlo studies indicate that this test can be used in comparing interrater reliabilities. (SLD)
Descriptors: Comparative Analysis, Computer Simulation, Equations (Mathematics), Interrater Reliability
PDF pending restoration PDF pending restoration
Gilmer, Jerry S.; Feldt, Leonard S. – 1982
The Feldt-Gilmer congeneric reliability coefficients make it possible to estimate the reliability of a test composed of parts of unequal, unknown length. The approximate standard errors of the Feldt-Gilmer coefficients are derived via a method using the multivariate Taylor's expansion. Monte Carlo simulation is employed to corroborate the…
Descriptors: Educational Testing, Error of Measurement, Mathematical Formulas, Mathematical Models
Pages: 1  |  2  |  3