NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1 to 15 of 120 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Warrens, Matthijs J. – Psychometrika, 2012
The quadratically weighted kappa is the most commonly used weighted kappa statistic for summarizing interrater agreement on an ordinal scale. The paper presents several properties of the quadratically weighted kappa that are paradoxical. For agreement tables with an odd number of categories "n" it is shown that if one of the raters uses the same…
Descriptors: Interrater Reliability, Statistics, Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Kim, Seonghoon – Psychometrika, 2012
Assuming item parameters on a test are known constants, the reliability coefficient for item response theory (IRT) ability estimates is defined for a population of examinees in two different ways: as (a) the product-moment correlation between ability estimates on two parallel forms of a test and (b) the squared correlation between the true…
Descriptors: Reliability, Item Response Theory, Tests, Correlation
Peer reviewed Peer reviewed
Direct linkDirect link
Zhang, Jinming – Psychometrika, 2013
In some popular test designs (including computerized adaptive testing and multistage testing), many item pairs are not administered to any test takers, which may result in some complications during dimensionality analyses. In this paper, a modified DETECT index is proposed in order to perform dimensionality analyses for response data from such…
Descriptors: Adaptive Testing, Simulation, Computer Assisted Testing, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Jung, Kwanghee; Takane, Yoshio; Hwang, Heungsun; Woodward, Todd S. – Psychometrika, 2012
We propose a new method of structural equation modeling (SEM) for longitudinal and time series data, named Dynamic GSCA (Generalized Structured Component Analysis). The proposed method extends the original GSCA by incorporating a multivariate autoregressive model to account for the dynamic nature of data taken over time. Dynamic GSCA also…
Descriptors: Structural Equation Models, Longitudinal Studies, Data Analysis, Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Bentler, Peter M. – Psychometrika, 2009
As pointed out by Sijtsma ("in press"), coefficient alpha is inappropriate as a single summary of the internal consistency of a composite score. Better estimators of internal consistency are available. In addition to those mentioned by Sijtsma, an old dimension-free coefficient and structural equation model based coefficients are…
Descriptors: Structural Equation Models, Reliability, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Revelle, William; Zinbarg, Richard E. – Psychometrika, 2009
There are three fundamental problems in Sijtsma ("Psychometrika," 2008): (1) contrary to the name, the glb is not the greatest lower bound of reliability but rather is systematically less than omega[subscript t] (McDonald, "Test theory: A unified treatment," Erlbaum, Hillsdale, 1999), (2) we agree with Sijtsma that when considering how well a test…
Descriptors: Test Theory, Computer Software, Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Green, Samuel B.; Yang, Yanyun – Psychometrika, 2009
The general use of coefficient alpha to assess reliability should be discouraged on a number of grounds. The assumptions underlying coefficient alpha are unlikely to hold in practice, and violation of these assumptions can result in nontrivial negative or positive bias. Structural equation modeling was discussed as an informative process both to…
Descriptors: Structural Equation Models, Reliability, Bias
Peer reviewed Peer reviewed
Direct linkDirect link
Green, Samuel B.; Yang, Yanyun – Psychometrika, 2009
A method is presented for estimating reliability using structural equation modeling (SEM) that allows for nonlinearity between factors and item scores. Assuming the focus is on consistency of summed item scores, this method for estimating reliability is preferred to those based on linear SEM models and to the most commonly reported estimate of…
Descriptors: Structural Equation Models, Computation, Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Broomell, Stephen B.; Budescu, David V. – Psychometrika, 2009
We derive an analytic model of the inter-judge correlation as a function of five underlying parameters. Inter-cue correlation and the number of cues capture our assumptions about the environment, while differentiations between cues, the weights attached to the cues, and (un)reliability describe assumptions about the judges. We study the relative…
Descriptors: Cues, Models, Expertise, Correlation
Peer reviewed Peer reviewed
Direct linkDirect link
Shin, Yongyun; Raudenbush, Stephen W. – Psychometrika, 2012
Social scientists are frequently interested in assessing the qualities of social settings such as classrooms, schools, neighborhoods, or day care centers. The most common procedure requires observers to rate social interactions within these settings on multiple items and then to combine the item responses to obtain a summary measure of setting…
Descriptors: Generalizability Theory, Neighborhoods, Intervals, Child Care Centers
Peer reviewed Peer reviewed
Direct linkDirect link
Kreiner, Svend; Christensen, Karl Bang – Psychometrika, 2011
In behavioural sciences, local dependence and DIF are common, and purification procedures that eliminate items with these weaknesses often result in short scales with poor reliability. Graphical loglinear Rasch models (Kreiner & Christensen, in "Statistical Methods for Quality of Life Studies," ed. by M. Mesbah, F.C. Cole & M.T.…
Descriptors: Evidence, Markov Processes, Quality of Life, Item Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Sijtsma, Klaas – Psychometrika, 2009
This discussion paper argues that both the use of Cronbach's alpha as a reliability estimate and as a measure of internal consistency suffer from major problems. First, alpha always has a value, which cannot be equal to the test score's reliability given the inter-item covariance matrix and the usual assumptions about measurement error. Second, in…
Descriptors: Measurement, Error of Measurement, Scores, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Yao, Lihua – Psychometrika, 2012
Multidimensional computer adaptive testing (MCAT) can provide higher precision and reliability or reduce test length when compared with unidimensional CAT or with the paper-and-pencil test. This study compared five item selection procedures in the MCAT framework for both domain scores and overall scores through simulation by varying the structure…
Descriptors: Item Banks, Test Length, Simulation, Adaptive Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Sijtsma, Klaas – Psychometrika, 2009
The critical reactions of Bentler (2009, doi: 10.1007/s11336-008-9100-1), Green and Yang (2009a, doi: 10.1007/s11336-008-9098-4 ; 2009b, doi: 10.1007/s11336-008-9099-3), and Revelle and Zinbarg (2009, doi: 10.1007/s11336-008-9102-z) to Sijtsma's (2009, doi: 10.1007/s11336-008-9101-0) paper on Cronbach's alpha are addressed. The dissemination of…
Descriptors: Psychometrics, Reliability, Theory Practice Relationship, Structural Equation Models
Peer reviewed Peer reviewed
Direct linkDirect link
Warrens, Matthijs J. – Psychometrika, 2008
This paper studies correction for chance in coefficients that are linear functions of the observed proportion of agreement. The paper unifies and extends various results on correction for chance in the literature. A specific class of coefficients is used to illustrate the results derived in this paper. Coefficients in this class, e.g. the simple…
Descriptors: Interrater Reliability, Statistical Analysis, Generalization, Mathematical Concepts
Previous Page | Next Page ยป
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8