Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 29 |
Descriptor
Source
Applied Psychological… | 149 |
Author
Publication Type
Education Level
Higher Education | 5 |
Postsecondary Education | 3 |
Early Childhood Education | 1 |
Elementary Education | 1 |
Grade 2 | 1 |
High Schools | 1 |
Primary Education | 1 |
Secondary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating

Kane, Michael; Moloney, James – Applied Psychological Measurement, 1978
The answer-until-correct (AUC) procedure requires that examinees respond to a multi-choice item until they answer it correctly. Using a modified version of Horst's model for examinee behavior, this paper compares the effect of guessing on item reliability for the AUC procedure and the zero-one scoring procedure. (Author/CTM)
Descriptors: Guessing (Tests), Item Analysis, Mathematical Models, Multiple Choice Tests

Moreland, John R.; And Others – Applied Psychological Measurement, 1978
Four factor scores from the Bem Sex Role Inventory were derived from a factor analysis of college student responses and were compared with the original scales on a new sample of students. The factor scales were more internally consistent than those constructed by Bem. (Author/CTM)
Descriptors: Androgyny, Factor Analysis, Higher Education, Rating Scales

Cliff, Norman; And Others – Applied Psychological Measurement, 1979
Monte Carlo research with TAILOR, a program using implied orders as a basis for tailored testing, is reported. TAILOR typically required about half the available items to estimate, for each simulated examinee, the responses on the remainder. (Author/CTM)
Descriptors: Adaptive Testing, Computer Programs, Item Sampling, Nonparametric Statistics

Patterson, Henry O,; Milakofsky, Louis – Applied Psychological Measurement, 1980
Adapting curricula to the cognitive developmental level of students has been hindered by the difficulty of assessing those levels in students. The reliability and validity of a paper-and-pencil Piagetian assessment are discussed. (Author/ JKS)
Descriptors: Cognitive Development, Cognitive Measurement, Elementary Secondary Education, Grade 3

Laosa, Luis M. – Applied Psychological Measurement, 1980
A technique to measure maternal teaching strategies was developed for possible use in research and evaluation studies. Scores derived from the technique describe quality and quanitity of behaviors used by mothers to teach cognitive-perceptual tasks to their own young children. Reliability and validity data are presented. (Author/JKS)
Descriptors: Cultural Differences, Measurement Techniques, Mothers, Observation

Putnins, Aldis L. – Applied Psychological Measurement, 1980
A test-retest reliability study of the Jesness Inventory based on a group of 54 male adolescents (all probationers) and a study of recidivism among 145 probationers are reported. (CTM)
Descriptors: Adolescents, Delinquency, Followup Studies, Foreign Countries

Meijer, Rob R.; And Others – Applied Psychological Measurement, 1994
The power of the nonparametric person-fit statistic, U3, is investigated through simulations as a function of item characteristics, test characteristics, person characteristics, and the group to which examinees belong. Results suggest conditions under which relatively short tests can be used for person-fit analysis. (SLD)
Descriptors: Difficulty Level, Group Membership, Item Response Theory, Nonparametric Statistics

Zegers, Frits E. – Applied Psychological Measurement, 1991
The degree of agreement between two raters rating several objects for a single characteristic can be expressed through an association coefficient, such as the Pearson product-moment correlation. How to select an appropriate association coefficient, and the desirable properties and uses of a class of such coefficients--the Euclidean…
Descriptors: Classification, Correlation, Data Interpretation, Equations (Mathematics)

Zimmerman, Donald W.; And Others – Applied Psychological Measurement, 1993
Some of the methods originally used to find relationships between reliability and power associated with a single measurement are extended to difference scores. Results, based on explicit power calculations, show that augmenting the reliability of measurement by reducing error score variance can make significance tests of difference more powerful.…
Descriptors: Equations (Mathematics), Error of Measurement, Individual Differences, Mathematical Models

Humphreys, Lloyd G.; And Others – Applied Psychological Measurement, 1993
Two articles discuss the controversy about the relationship between reliability and the power of significance tests in response to the discussion of Donald W. Zimmerman, Richard H. Williams, and Bruno D. Zumbo. Lloyd G. Humphreys emphasizes the differences between what statisticians can do and constraints on researchers. Zimmerman, Williams, and…
Descriptors: Error of Measurement, Individual Differences, Power (Statistics), Research Methodology

Burisch, Matthias – Applied Psychological Measurement, 1978
Sets of inventory scales were constructed from a common item pool, using variants of what are here called the Inductive, Deductive, and External strategies. Peer ratings for 21 traits served as criteria. Very little variation in validity was attributable to construction strategies. (Author/CTM)
Descriptors: Deduction, Foreign Countries, Higher Education, Induction

Lumsden, James – Applied Psychological Measurement, 1977
Person changes can be of three kinds: developmental trends, swells, and tremors. Person unreliability in the tremor sense (momentary fluctuations) can be estimated from person characteristic curves. Average person reliability for groups can be compared from item characteristic curves. (Author)
Descriptors: Difficulty Level, Individual Characteristics, Individual Development, Individual Differences

Kleinke, David J. – Applied Psychological Measurement, 1979
Lord's, Millman's and Saupe's methods of approximating the standard error of measurement are reviewed. Through an empirical demonstration involving 200 university classroom tests, all three approximations are shown to be biased. (Author/JKS)
Descriptors: Error of Measurement, Error Patterns, Higher Education, Mathematical Formulas

Luecht, Richard M. – Applied Psychological Measurement, 1996
The example of a medical licensure test is used to demonstrate situations in which complex, integrated content must be balanced at the total test level for validity reasons, but items assigned to reportable subscore categories may be used under a multidimensional item response theory adaptive paradigm to improve subscore reliability. (SLD)
Descriptors: Adaptive Testing, Certification, Computer Assisted Testing, Licensing Examinations (Professions)

Stone, Clement A. – Applied Psychological Measurement, 1992
Monte Carlo methods are used to evaluate marginal maximum likelihood estimation of item parameters and maximum likelihood estimates of theta in the two-parameter logistic model for varying test lengths, sample sizes, and assumed theta distributions. Results with 100 datasets demonstrate the methods' general precision and stability. Exceptions are…
Descriptors: Computer Software Evaluation, Estimation (Mathematics), Mathematical Models, Maximum Likelihood Statistics