Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 1 |
Descriptor
Comparative Analysis | 9 |
Test Reliability | 7 |
Mathematical Models | 3 |
Measurement | 3 |
Equations (Mathematics) | 2 |
Error of Measurement | 2 |
Sampling | 2 |
Simulation | 2 |
Test Interpretation | 2 |
Testing Problems | 2 |
True Scores | 2 |
More ▼ |
Source
Psychometrika | 9 |
Author
Publication Type
Journal Articles | 3 |
Reports - Evaluative | 2 |
Reports - Research | 1 |
Education Level
Audience
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Yao, Lihua – Psychometrika, 2012
Multidimensional computer adaptive testing (MCAT) can provide higher precision and reliability or reduce test length when compared with unidimensional CAT or with the paper-and-pencil test. This study compared five item selection procedures in the MCAT framework for both domain scores and overall scores through simulation by varying the structure…
Descriptors: Item Banks, Test Length, Simulation, Adaptive Testing

Pandey, Tej N.; Hubert, Lawrence – Psychometrika, 1975
Use of Tukey's Jackknife in establishing a confidence interval around the population coefficient alpha is explored and the robustness of Feldt's procedure along with ten variants of the Jackknife when the data do not conform to the necessary normality requirements are evaluated. Only two of the variants compared to Feldt's approach. (RC)
Descriptors: Comparative Analysis, Measurement Techniques, Sampling, Statistical Bias

Jackson, Paul H.; Agunwamba, Christian C. – Psychometrika, 1977
Finding and interpreting lower bounds for reliability coefficients for tests with nonhomogenous items has been a problem for psychometricians. This paper presents a mathematical formula for finding the greatest lower bound for such a coefficient. (Author/JKS)
Descriptors: Comparative Analysis, Mathematical Models, Measurement, Test Interpretation

Rubin, Donald B.; Thayer, Dorothy – Psychometrika, 1978
A procedure is developed for estimating correlations among new tests when non-overlapping sub-samples each are administered a different new test and all sub-samples are administered a set of standard tests. (JKS)
Descriptors: Comparative Analysis, Correlation, Measurement, Standardized Tests

Schroeder, Marsha L.; Hakstian, A. Ralph – Psychometrika, 1990
A 2-facet measurement model is identified, and its coefficient of generalizability (CG) is examined. Three other multifaceted measurement models and their CGs are identified. An empirical investigation of all four procedures is conducted using data from a study of the psychopathology of 71 prison inmates. (SLD)
Descriptors: Comparative Analysis, Equations (Mathematics), Generalizability Theory, Mathematical Models

Hakstian, A. Ralph; Whalen, Thomas E. – Psychometrika, 1976
Details of a reasonably precise normalization technique for coefficient alpha are outlined, along with methods for estimating the variance of the normalized statistic. These procedures lead to the K-sample significance test. (RC)
Descriptors: Analysis of Variance, Comparative Analysis, Error Patterns, Hypothesis Testing

Hunter, John E.; Cohen, Stanley H. – Psychometrika, 1974
Descriptors: Attitude Change, Attitudes, Comparative Analysis, Models

Brennan, Robert L.; Kane, Michael T. – Psychometrika, 1977
Using the assumption of randomly parallel tests and concepts from generalizability theory, three signal/noise ratios for domain-referenced tests are developed, discussed, and compared. The three ratios have the same noise but different signals depending upon the kind of decision to be made as a result of measurement. (Author/JKS)
Descriptors: Comparative Analysis, Criterion Referenced Tests, Error of Measurement, Mathematical Models

Alsawalmeh, Yousef M.; Feldt, Leonard S. – Psychometrika, 1994
A modification of a test of the equality of nonindependent alpha reliability coefficients is proposed. It avoids the limitation that the product of the number of test parts times the number of subjects be quite large. Monte Carlo studies indicate that this test can be used in comparing interrater reliabilities. (SLD)
Descriptors: Comparative Analysis, Computer Simulation, Equations (Mathematics), Interrater Reliability