Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 8 |
Descriptor
Source
Psychometrika | 15 |
Author
Publication Type
Journal Articles | 15 |
Reports - Descriptive | 15 |
Opinion Papers | 2 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Warrens, Matthijs J. – Psychometrika, 2012
The quadratically weighted kappa is the most commonly used weighted kappa statistic for summarizing interrater agreement on an ordinal scale. The paper presents several properties of the quadratically weighted kappa that are paradoxical. For agreement tables with an odd number of categories "n" it is shown that if one of the raters uses the same…
Descriptors: Interrater Reliability, Statistics, Measurement
Kim, Seonghoon – Psychometrika, 2012
Assuming item parameters on a test are known constants, the reliability coefficient for item response theory (IRT) ability estimates is defined for a population of examinees in two different ways: as (a) the product-moment correlation between ability estimates on two parallel forms of a test and (b) the squared correlation between the true…
Descriptors: Reliability, Item Response Theory, Tests, Correlation
Bentler, Peter M. – Psychometrika, 2009
As pointed out by Sijtsma ("in press"), coefficient alpha is inappropriate as a single summary of the internal consistency of a composite score. Better estimators of internal consistency are available. In addition to those mentioned by Sijtsma, an old dimension-free coefficient and structural equation model based coefficients are…
Descriptors: Structural Equation Models, Reliability, Psychometrics
Revelle, William; Zinbarg, Richard E. – Psychometrika, 2009
There are three fundamental problems in Sijtsma ("Psychometrika," 2008): (1) contrary to the name, the glb is not the greatest lower bound of reliability but rather is systematically less than omega[subscript t] (McDonald, "Test theory: A unified treatment," Erlbaum, Hillsdale, 1999), (2) we agree with Sijtsma that when considering how well a test…
Descriptors: Test Theory, Computer Software, Reliability
Green, Samuel B.; Yang, Yanyun – Psychometrika, 2009
A method is presented for estimating reliability using structural equation modeling (SEM) that allows for nonlinearity between factors and item scores. Assuming the focus is on consistency of summed item scores, this method for estimating reliability is preferred to those based on linear SEM models and to the most commonly reported estimate of…
Descriptors: Structural Equation Models, Computation, Reliability
Sijtsma, Klaas – Psychometrika, 2009
This discussion paper argues that both the use of Cronbach's alpha as a reliability estimate and as a measure of internal consistency suffer from major problems. First, alpha always has a value, which cannot be equal to the test score's reliability given the inter-item covariance matrix and the usual assumptions about measurement error. Second, in…
Descriptors: Measurement, Error of Measurement, Scores, Computation
Sijtsma, Klaas – Psychometrika, 2009
The critical reactions of Bentler (2009, doi: 10.1007/s11336-008-9100-1), Green and Yang (2009a, doi: 10.1007/s11336-008-9098-4 ; 2009b, doi: 10.1007/s11336-008-9099-3), and Revelle and Zinbarg (2009, doi: 10.1007/s11336-008-9102-z) to Sijtsma's (2009, doi: 10.1007/s11336-008-9101-0) paper on Cronbach's alpha are addressed. The dissemination of…
Descriptors: Psychometrics, Reliability, Theory Practice Relationship, Structural Equation Models
Laenen, Annouschka; Alonso, Ariel; Molenberghs, Geert – Psychometrika, 2007
A new measure for reliability of a rating scale is introduced, based on the classical definition of reliability, as the ratio of the true score variance and the total variance. Clinical trial data can be employed to estimate the reliability of the scale in use, whenever repeated measurements are taken. The reliability is estimated from the…
Descriptors: Schizophrenia, Rating Scales, Likert Scales, True Scores

ten Berge, Jos M. F.; Hofstee, Willem K. B. – Psychometrika, 1999
H. Kaiser (1992) has shown that the sum of coefficients alpha of a set of principal components does not change when the components are transformed by an orthogonal rotation. In this paper, the rotational invariance and the successive alpha-optimality are integrated and generalized in a simultaneous approach. (SLD)
Descriptors: Factor Structure, Orthogonal Rotation, Reliability
Schuster, Christof; Smith, David A. – Psychometrika, 2005
The rater agreement literature is complicated by the fact that it must accommodate at least two different properties of rating data: the number of raters (two versus more than two) and the rating scale level (nominal versus metric). While kappa statistics are most widely used for nominal scales, intraclass correlation coefficients have been…
Descriptors: Psychometrics, Statistics, Rating Scales, Correlation

Shapiro, Alexander; ten Berge, Jos M. F. – Psychometrika, 2000
Discusses sampling bias problems in the use of the greatest lower bound (g.l.b.) to reliability and offers explicit expressions for the second order derivatives. This yields closed form expression for the asymptotic bias of both the g.l.b. and its numerator. Illustrates the approach through a numeric example. (SLD)
Descriptors: Equations (Mathematics), Factor Analysis, Reliability, Sampling

van Zyl, J. M.; Neudecker, H.; Nel, D. G. – Psychometrika, 2000
Derives the asymptotic normal distribution of the maximum likelihood estimator of Cronbach's alpha (under normality) for the case when no assumptions are made about the covariances among items. Also considers the asymptotic distribution for the special case of compound symmetry and when compared to the exact distribution. (Author/SLD)
Descriptors: Equations (Mathematics), Maximum Likelihood Statistics, Reliability, Statistical Distributions
Berge, Jos M. F. Ten; Socan, Gregor – Psychometrika, 2004
To assess the reliability of congeneric tests, specifically designed reliability measures have been proposed. This paper emphasizes that such measures rely on a unidimensionality hypothesis, which can neither be confirmed nor rejected when there are only three test parts, and will invariably be rejected when there are more than three test parts.…
Descriptors: Test Reliability, Sampling, Psychometrics, Test Bias
MacCann, Robert G. – Psychometrika, 2004
For (0, 1) scored multiple-choice tests, a formula giving test reliability as a function of the number of item options is derived, assuming the "knowledge or random guessing model," the parallelism of the new and old tests (apart from the guessing probability), and the assumptions of classical test theory. It is shown that the formula is a more…
Descriptors: Guessing (Tests), Multiple Choice Tests, Test Reliability, Test Theory
Galindo-Garre, Francisca; Vermunt, Jeroen K. – Psychometrika, 2004
This paper presents a row-column (RC) association model in which the estimated row and column scores are forced to be in agreement with a priori specified ordering. Two efficient algorithms for finding the order-restricted maximum likelihood (ML) estimates are proposed and their reliability under different degrees of association is investigated by…
Descriptors: Mathematics, Test Reliability, Computation, Testing