ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	12

Descriptor

Computation	12
Reliability	10
Error of Measurement	6
Correlation	4
Equations (Mathematics)	4
Item Response Theory	4
Models	4
Bias	3
Classification	3
Psychometrics	3
Test Theory	3
Achievement Tests	2
Comparative Analysis	2
Cutting Scores	2
Factor Analysis	2
Inferences	2
Maximum Likelihood Statistics	2
Measurement Techniques	2
Predictor Variables	2
Simulation	2
Test Items	2
Test Length	2
Test Reliability	2
Accuracy	1
Anxiety	1
More ▼

Source

Applied Psychological…

Publication Type

Journal Articles	12
Reports - Descriptive	7
Reports - Research	3
Reports - Evaluative	2

Education Level

Early Childhood Education	1
Elementary Education	1
Grade 2	1
High Schools	1
Higher Education	1
Postsecondary Education	1
Primary Education	1
Secondary Education	1

Audience

Location

Germany	1
Michigan	1

Laws, Policies, & Programs

Assessments and Surveys

Eysenck Personality Inventory

What Works Clearinghouse Rating

Showing all 12 results Save | Export

The Reliability and Precision of Total Scores and IRT Estimates as a Function of Polytomous IRT Parameters and Latent Trait Distribution

Peer reviewed

Direct link

Culpepper, Steven Andrew – Applied Psychological Measurement, 2013

A classic topic in the fields of psychometrics and measurement has been the impact of the number of scale categories on test score reliability. This study builds on previous research by further articulating the relationship between item response theory (IRT) and classical test theory (CTT). Equations are presented for comparing the reliability and…

Descriptors: Item Response Theory, Reliability, Scores, Error of Measurement

Taking the Error Term of the Factor Model into Account: The Factor Score Predictor Interval

Peer reviewed

Direct link

Beauducel, Andre – Applied Psychological Measurement, 2013

The problem of factor score indeterminacy implies that the factor and the error scores cannot be completely disentangled in the factor model. It is therefore proposed to compute Harman's factor score predictor that contains an additive combination of factor and error variance. This additive combination is discussed in the framework of classical…

Descriptors: Factor Analysis, Predictor Variables, Reliability, Error of Measurement

Evaluating EIV, OLS, and SEM Estimators of Group Slope Differences in the Presence of Measurement Error: The Single-Indicator Case

Peer reviewed

Direct link

Culpepper, Steven Andrew – Applied Psychological Measurement, 2012

Measurement error significantly biases interaction effects and distorts researchers' inferences regarding interactive hypotheses. This article focuses on the single-indicator case and shows how to accurately estimate group slope differences by disattenuating interaction effects with errors-in-variables (EIV) regression. New analytic findings were…

Descriptors: Evidence, Test Length, Interaction, Regression (Statistics)

An Evaluation of Item Response Theory Classification Accuracy and Consistency Indices

Peer reviewed

Direct link

Wyse, Adam E.; Hao, Shiqi – Applied Psychological Measurement, 2012

This article introduces two new classification consistency indices that can be used when item response theory (IRT) models have been applied. The new indices are shown to be related to Rudner's classification accuracy index and Guo's classification accuracy index. The Rudner- and Guo-based classification accuracy and consistency indices are…

Descriptors: Item Response Theory, Classification, Accuracy, Reliability

Coefficient Alpha and Reliability of Scale Scores

Peer reviewed

Direct link

Almehrizi, Rashid S. – Applied Psychological Measurement, 2013

The majority of large-scale assessments develop various score scales that are either linear or nonlinear transformations of raw scores for better interpretations and uses of assessment results. The current formula for coefficient alpha (a; the commonly used reliability coefficient) only provides internal consistency reliability estimates of raw…

Descriptors: Raw Scores, Scaling, Reliability, Computation

Classification Consistency and Accuracy for Complex Assessments under the Compound Multinomial Model

Peer reviewed

Direct link

Lee, Won-Chan; Brennan, Robert L.; Wan, Lei – Applied Psychological Measurement, 2009

For a test that consists of dichotomously scored items, several approaches have been reported in the literature for estimating classification consistency and accuracy indices based on a single administration of a test. Classification consistency and accuracy have not been studied much, however, for "complex" assessments--for example,…

Descriptors: Classification, Reliability, Test Items, Scoring

A Clarification of the Effects of Rapid Guessing on Coefficient [Alpha]: A Note on Attali's "Reliability of Speeded Number-Right Multiple-Choice Tests"

Peer reviewed

Direct link

Wise, Steven L.; DeMars, Christine E. – Applied Psychological Measurement, 2009

Attali (2005) recently demonstrated that Cronbach's coefficient [alpha] estimate of reliability for number-right multiple-choice tests will tend to be deflated by speededness, rather than inflated as is commonly believed and taught. Although the methods, findings, and conclusions of Attali (2005) are correct, his article may inadvertently invite a…

Descriptors: Guessing (Tests), Multiple Choice Tests, Test Reliability, Computation

Coping with Memory Effect and Serial Correlation when Estimating Reliability in a Longitudinal Framework

Peer reviewed

Direct link

Laenen, Annouschka; Alonso, Ariel; Molenberghs, Geert; Vangeneugden, Tony; Mallinckrodt, Craig H. – Applied Psychological Measurement, 2010

Longitudinal studies are permeating clinical trials in psychiatry. Therefore, it is of utmost importance to study the psychometric properties of rating scales, frequently used in these trials, within a longitudinal framework. However, intrasubject serial correlation and memory effects are problematic issues often encountered in longitudinal data.…

Descriptors: Psychiatry, Rating Scales, Memory, Psychometrics

A Critique of Raju and Oshima's Prophecy Formulas for Assessing the Reliability of Item Response Theory-Based Ability Estimates

Peer reviewed

Direct link

Wang, Wen-Chung – Applied Psychological Measurement, 2008

Raju and Oshima (2005) proposed two prophecy formulas based on item response theory in order to predict the reliability of ability estimates for a test after change in its length. The first prophecy formula is equivalent to the classical Spearman-Brown prophecy formula. The second prophecy formula is misleading because of an underlying false…

Descriptors: Test Reliability, Item Response Theory, Computation, Evaluation Methods

A Method for Estimating Classification Consistency Indices for Two Equated Forms

Peer reviewed

Direct link

Yi, Hyun Sook; Kim, Seonghoon; Brennan, Robert L. – Applied Psychological Measurement, 2007

Large-scale testing programs involving classification decisions typically have multiple forms available and conduct equating to ensure cut-score comparability across forms. A test developer might be interested in the extent to which an examinee who happens to take a particular form would have a consistent classification decision if he or she had…

Descriptors: Classification, Reliability, Indexes, Computation

A Note on Correlations Corrected for Unreliability and Range Restriction

Peer reviewed

Direct link

Raju, Nambury S.; Lezotte, Daniel V.; Fearing, Benjamin K.; Oshima, T. C. – Applied Psychological Measurement, 2006

This note describes a procedure for estimating the range restriction component used in correcting correlations for unreliability and range restriction when an estimate of the reliability of a predictor is not readily available for the unrestricted sample. This procedure is illustrated with a few examples. (Contains 1 table.)

Descriptors: Correlation, Reliability, Predictor Variables, Error Correction

Estimating Generalizability to a Latent Variable Common to All of a Scale's Indicators: A Comparison of Estimators for Omega[subscript h]

Peer reviewed

Direct link

Zinbarg, Richard E.; Yovel, Iftah; Revelle, William; McDonald, Roderick P. – Applied Psychological Measurement, 2006

The extent to which a scale score generalizes to a latent variable common to all of the scale's indicators is indexed by the scale's general factor saturation. Seven techniques for estimating this parameter--omega[hierarchical] (omega[subscript h])--are compared in a series of simulated data sets. Primary comparisons were based on 160 artificial…

Descriptors: Computation, Factor Analysis, Reliability, Correlation

Brennan, Robert L.	2
Culpepper, Steven Andrew	2
Almehrizi, Rashid S.	1
Alonso, Ariel	1
Beauducel, Andre	1
DeMars, Christine E.	1
Fearing, Benjamin K.	1
Hao, Shiqi	1
Kim, Seonghoon	1
Laenen, Annouschka	1
Lee, Won-Chan	1
Lezotte, Daniel V.	1
Mallinckrodt, Craig H.	1
McDonald, Roderick P.	1
Molenberghs, Geert	1
Oshima, T. C.	1
Raju, Nambury S.	1
Revelle, William	1
Vangeneugden, Tony	1
Wan, Lei	1
Wang, Wen-Chung	1
Wise, Steven L.	1
Wyse, Adam E.	1
Yi, Hyun Sook	1
Yovel, Iftah	1
More ▼