ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	19

Descriptor

Test Reliability	73
Reliability	39
Mathematical Models	29
Correlation	25
Measurement Techniques	20
Psychometrics	17
Factor Analysis	16
True Scores	16
Equations (Mathematics)	15
Error of Measurement	15
Statistical Analysis	15
Estimation (Mathematics)	14
Item Analysis	14
Sampling	13
Test Theory	13
Measurement	11
Rating Scales	11
Test Construction	11
Test Items	10
Comparative Analysis	9
Interrater Reliability	9
Scores	9
Simulation	9
Test Interpretation	9
Evaluation Methods	8
More ▼

Source

Psychometrika

120

Publication Type

Journal Articles	83
Reports - Research	45
Reports - Evaluative	20
Reports - Descriptive	15
Opinion Papers	2
Guides - Non-Classroom	1
Reports - General	1

Education Level

Adult Education

Audience

Location

Netherlands

Laws, Policies, & Programs

Assessments and Surveys

Wechsler Intelligence Scale…

What Works Clearinghouse Rating

Psychometrika X

Showing 1 to 15 of 120 results Save | Export

Some Paradoxical Results for the Quadratically Weighted Kappa

Peer reviewed

Direct link

Warrens, Matthijs J. – Psychometrika, 2012

The quadratically weighted kappa is the most commonly used weighted kappa statistic for summarizing interrater agreement on an ordinal scale. The paper presents several properties of the quadratically weighted kappa that are paradoxical. For agreement tables with an odd number of categories "n" it is shown that if one of the raters uses the same…

Descriptors: Interrater Reliability, Statistics, Measurement

A Note on the Reliability Coefficients for Item Response Model-Based Ability Estimates

Peer reviewed

Direct link

Kim, Seonghoon – Psychometrika, 2012

Assuming item parameters on a test are known constants, the reliability coefficient for item response theory (IRT) ability estimates is defined for a population of examinees in two different ways: as (a) the product-moment correlation between ability estimates on two parallel forms of a test and (b) the squared correlation between the true…

Descriptors: Reliability, Item Response Theory, Tests, Correlation

A Procedure for Dimensionality Analyses of Response Data from Various Test Designs

Peer reviewed

Direct link

Zhang, Jinming – Psychometrika, 2013

In some popular test designs (including computerized adaptive testing and multistage testing), many item pairs are not administered to any test takers, which may result in some complications during dimensionality analyses. In this paper, a modified DETECT index is proposed in order to perform dimensionality analyses for response data from such…

Descriptors: Adaptive Testing, Simulation, Computer Assisted Testing, Test Reliability

Dynamic GSCA (Generalized Structured Component Analysis) with Applications to the Analysis of Effective Connectivity in Functional Neuroimaging Data

Peer reviewed

Direct link

Jung, Kwanghee; Takane, Yoshio; Hwang, Heungsun; Woodward, Todd S. – Psychometrika, 2012

We propose a new method of structural equation modeling (SEM) for longitudinal and time series data, named Dynamic GSCA (Generalized Structured Component Analysis). The proposed method extends the original GSCA by incorporating a multivariate autoregressive model to account for the dynamic nature of data taken over time. Dynamic GSCA also…

Descriptors: Structural Equation Models, Longitudinal Studies, Data Analysis, Reliability

Alpha, Dimension-Free, and Model-Based Internal Consistency Reliability

Peer reviewed

Direct link

Bentler, Peter M. – Psychometrika, 2009

As pointed out by Sijtsma ("in press"), coefficient alpha is inappropriate as a single summary of the internal consistency of a composite score. Better estimators of internal consistency are available. In addition to those mentioned by Sijtsma, an old dimension-free coefficient and structural equation model based coefficients are…

Descriptors: Structural Equation Models, Reliability, Psychometrics

Coefficients Alpha, Beta, Omega, and the glb: Comments on Sijtsma

Peer reviewed

Direct link

Revelle, William; Zinbarg, Richard E. – Psychometrika, 2009

There are three fundamental problems in Sijtsma ("Psychometrika," 2008): (1) contrary to the name, the glb is not the greatest lower bound of reliability but rather is systematically less than omega[subscript t] (McDonald, "Test theory: A unified treatment," Erlbaum, Hillsdale, 1999), (2) we agree with Sijtsma that when considering how well a test…

Descriptors: Test Theory, Computer Software, Reliability

Commentary on Coefficient Alpha: A Cautionary Tale

Peer reviewed

Direct link

Green, Samuel B.; Yang, Yanyun – Psychometrika, 2009

The general use of coefficient alpha to assess reliability should be discouraged on a number of grounds. The assumptions underlying coefficient alpha are unlikely to hold in practice, and violation of these assumptions can result in nontrivial negative or positive bias. Structural equation modeling was discussed as an informative process both to…

Descriptors: Structural Equation Models, Reliability, Bias

Reliability of Summed Item Scores Using Structural Equation Modeling: An Alternative to Coefficient Alpha

Peer reviewed

Direct link

Green, Samuel B.; Yang, Yanyun – Psychometrika, 2009

A method is presented for estimating reliability using structural equation modeling (SEM) that allows for nonlinearity between factors and item scores. Assuming the focus is on consistency of summed item scores, this method for estimating reliability is preferred to those based on linear SEM models and to the most commonly reported estimate of…

Descriptors: Structural Equation Models, Computation, Reliability

Why Are Experts Correlated? Decomposing Correlations between Judges

Peer reviewed

Direct link

Broomell, Stephen B.; Budescu, David V. – Psychometrika, 2009

We derive an analytic model of the inter-judge correlation as a function of five underlying parameters. Inter-cue correlation and the number of cues capture our assumptions about the environment, while differentiations between cues, the weights attached to the cues, and (un)reliability describe assumptions about the judges. We study the relative…

Descriptors: Cues, Models, Expertise, Correlation

Confidence Bounds and Power for the Reliability of Observational Measures on the Quality of a Social Setting

Peer reviewed

Direct link

Shin, Yongyun; Raudenbush, Stephen W. – Psychometrika, 2012

Social scientists are frequently interested in assessing the qualities of social settings such as classrooms, schools, neighborhoods, or day care centers. The most common procedure requires observers to rate social interactions within these settings on multiple items and then to combine the item responses to obtain a summary measure of setting…

Descriptors: Generalizability Theory, Neighborhoods, Intervals, Child Care Centers

Item Screening in Graphical Loglinear Rasch Models

Peer reviewed

Direct link

Kreiner, Svend; Christensen, Karl Bang – Psychometrika, 2011

In behavioural sciences, local dependence and DIF are common, and purification procedures that eliminate items with these weaknesses often result in short scales with poor reliability. Graphical loglinear Rasch models (Kreiner & Christensen, in "Statistical Methods for Quality of Life Studies," ed. by M. Mesbah, F.C. Cole & M.T.…

Descriptors: Evidence, Markov Processes, Quality of Life, Item Analysis

On the Use, the Misuse, and the Very Limited Usefulness of Cronbach's Alpha

Peer reviewed

Direct link

Sijtsma, Klaas – Psychometrika, 2009

This discussion paper argues that both the use of Cronbach's alpha as a reliability estimate and as a measure of internal consistency suffer from major problems. First, alpha always has a value, which cannot be equal to the test score's reliability given the inter-item covariance matrix and the usual assumptions about measurement error. Second, in…

Descriptors: Measurement, Error of Measurement, Scores, Computation

Multidimensional CAT Item Selection Methods for Domain Scores and Composite Scores: Theory and Applications

Peer reviewed

Direct link

Yao, Lihua – Psychometrika, 2012

Multidimensional computer adaptive testing (MCAT) can provide higher precision and reliability or reduce test length when compared with unidimensional CAT or with the paper-and-pencil test. This study compared five item selection procedures in the MCAT framework for both domain scores and overall scores through simulation by varying the structure…

Descriptors: Item Banks, Test Length, Simulation, Adaptive Testing

Reliability beyond Theory and into Practice

Peer reviewed

Direct link

Sijtsma, Klaas – Psychometrika, 2009

The critical reactions of Bentler (2009, doi: 10.1007/s11336-008-9100-1), Green and Yang (2009a, doi: 10.1007/s11336-008-9098-4 ; 2009b, doi: 10.1007/s11336-008-9099-3), and Revelle and Zinbarg (2009, doi: 10.1007/s11336-008-9102-z) to Sijtsma's (2009, doi: 10.1007/s11336-008-9101-0) paper on Cronbach's alpha are addressed. The dissemination of…

Descriptors: Psychometrics, Reliability, Theory Practice Relationship, Structural Equation Models

Peer reviewed

Direct link

Warrens, Matthijs J. – Psychometrika, 2008

This paper studies correction for chance in coefficients that are linear functions of the observed proportion of agreement. The paper unifies and extends various results on correction for chance in the literature. A specific class of coefficients is used to illustrate the results derived in this paper. Coefficients in this class, e.g. the simple…

Descriptors: Interrater Reliability, Statistical Analysis, Generalization, Mathematical Concepts

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8

Feldt, Leonard S.	5
Hakstian, A. Ralph	4
Huynh, Huynh	4
Schulman, Robert S.	4
ten Berge, Jos M. F.	4
Jackson, Paul H.	3
Kristof, Walter	3
Nicewander, W. Alan	3
Shapiro, Alexander	3
Sijtsma, Klaas	3
Wilcox, Rand R.	3
Alonso, Ariel	2
Bentler, P. M.	2
Bentler, Peter M.	2
Cliff, Norman	2
Cooil, Bruce	2
Green, Samuel B.	2
Kraemer, Helena Chmura	2
Laenen, Annouschka	2
Lewis, Charles	2
Lord, Frederic M.	2
Molenberghs, Geert	2
Nishisato, Shizuhiko	2
Raju, Nambury S.	2
More ▼