ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	29

Descriptor

Reliability	74
Test Reliability	72
Error of Measurement	28
Higher Education	24
Test Items	22
Correlation	21
Estimation (Mathematics)	20
Item Response Theory	20
Test Validity	20
Statistical Analysis	19
Equations (Mathematics)	18
Test Construction	18
Test Theory	18
Mathematical Models	17
Models	17
Scores	16
Rating Scales	15
Simulation	15
Computation	12
Measurement Techniques	12
Psychometrics	12
Scoring	12
Evaluation Methods	11
Sampling	11
Comparative Analysis	10
More ▼

Source

Applied Psychological…

149

Publication Type

Journal Articles	117
Reports - Evaluative	51
Reports - Research	38
Reports - Descriptive	17
Opinion Papers	4
Book/Product Reviews	3
Information Analyses	3
Collected Works - Serials	2
Reports - General	2
Collected Works - General	1
Guides - Non-Classroom	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	5
Postsecondary Education	3
Early Childhood Education	1
Elementary Education	1
Grade 2	1
High Schools	1
Primary Education	1
Secondary Education	1

Audience

Location

West Germany	2
Australia	1
Belgium	1
Germany	1
Michigan	1
Netherlands	1
Spain	1
Sweden	1

Laws, Policies, & Programs

What Works Clearinghouse Rating

Applied Psychological Measurement X

Showing 106 to 120 of 149 results Save | Export

The Effect of Guessing on Item Reliability under Answer-Until-Correct Scoring

Peer reviewed

Kane, Michael; Moloney, James – Applied Psychological Measurement, 1978

The answer-until-correct (AUC) procedure requires that examinees respond to a multi-choice item until they answer it correctly. Using a modified version of Horst's model for examinee behavior, this paper compares the effect of guessing on item reliability for the AUC procedure and the zero-one scoring procedure. (Author/CTM)

Descriptors: Guessing (Tests), Item Analysis, Mathematical Models, Multiple Choice Tests

Some Psychometric Properties of the Bem Sex-Role Inventory

Peer reviewed

Moreland, John R.; And Others – Applied Psychological Measurement, 1978

Four factor scores from the Bem Sex Role Inventory were derived from a factor analysis of college student responses and were compared with the original scales on a new sample of students. The factor scales were more internally consistent than those constructed by Bem. (Author/CTM)

Descriptors: Androgyny, Factor Analysis, Higher Education, Rating Scales

Evaluation of Implied Orders as a Basis for Tailored Testing with Simulation Data.

Peer reviewed

Cliff, Norman; And Others – Applied Psychological Measurement, 1979

Monte Carlo research with TAILOR, a program using implied orders as a basis for tailored testing, is reported. TAILOR typically required about half the available items to estimate, for each simulated examinee, the responses on the remainder. (Author/CTM)

Descriptors: Adaptive Testing, Computer Programs, Item Sampling, Nonparametric Statistics

A Paper-and-Pencil Inventory for the Assessment of Piaget's Tasks.

Peer reviewed

Patterson, Henry O,; Milakofsky, Louis – Applied Psychological Measurement, 1980

Adapting curricula to the cognitive developmental level of students has been hindered by the difficulty of assessing those levels in students. The reliability and validity of a paper-and-pencil Piagetian assessment are discussed. (Author/ JKS)

Descriptors: Cognitive Development, Cognitive Measurement, Elementary Secondary Education, Grade 3

Measures for the Study of Maternal Teaching Strategies.

Peer reviewed

Laosa, Luis M. – Applied Psychological Measurement, 1980

A technique to measure maternal teaching strategies was developed for possible use in research and evaluation studies. Scores derived from the technique describe quality and quanitity of behaviors used by mothers to teach cognitive-perceptual tasks to their own young children. Reliability and validity data are presented. (Author/JKS)

Descriptors: Cultural Differences, Measurement Techniques, Mothers, Observation

Reliability of the Jesness Inventory.

Peer reviewed

Putnins, Aldis L. – Applied Psychological Measurement, 1980

A test-retest reliability study of the Jesness Inventory based on a group of 54 male adolescents (all probationers) and a study of recidivism among 145 probationers are reported. (CTM)

Descriptors: Adolescents, Delinquency, Followup Studies, Foreign Countries

Influence of Test and Person Characteristics on Nonparametric Appropriateness Measurement.

Peer reviewed

Meijer, Rob R.; And Others – Applied Psychological Measurement, 1994

The power of the nonparametric person-fit statistic, U3, is investigated through simulations as a function of item characteristics, test characteristics, person characteristics, and the group to which examinees belong. Results suggest conditions under which relatively short tests can be used for person-fit analysis. (SLD)

Descriptors: Difficulty Level, Group Membership, Item Response Theory, Nonparametric Statistics

Coefficients for Interrater Agreement.

Peer reviewed

Zegers, Frits E. – Applied Psychological Measurement, 1991

The degree of agreement between two raters rating several objects for a single characteristic can be expressed through an association coefficient, such as the Pearson product-moment correlation. How to select an appropriate association coefficient, and the desirable properties and uses of a class of such coefficients--the Euclidean…

Descriptors: Classification, Correlation, Data Interpretation, Equations (Mathematics)

Reliability of Measurement and Power of Significance Tests Based on Differences.

Peer reviewed

Zimmerman, Donald W.; And Others – Applied Psychological Measurement, 1993

Some of the methods originally used to find relationships between reliability and power associated with a single measurement are extended to difference scores. Results, based on explicit power calculations, show that augmenting the reliability of measurement by reducing error score variance can make significance tests of difference more powerful.…

Descriptors: Equations (Mathematics), Error of Measurement, Individual Differences, Mathematical Models

Further Comments on Reliability and Power of Significance Tests and Reliability, Power, Functions, and Relations: A Reply to Humphreys.

Peer reviewed

Humphreys, Lloyd G.; And Others – Applied Psychological Measurement, 1993

Two articles discuss the controversy about the relationship between reliability and the power of significance tests in response to the discussion of Donald W. Zimmerman, Richard H. Williams, and Bruno D. Zumbo. Lloyd G. Humphreys emphasizes the differences between what statisticians can do and constraints on researchers. Zimmerman, Williams, and…

Descriptors: Error of Measurement, Individual Differences, Power (Statistics), Research Methodology

Construction Strategies for Multiscale Personality Inventories

Peer reviewed

Burisch, Matthias – Applied Psychological Measurement, 1978

Sets of inventory scales were constructed from a common item pool, using variants of what are here called the Inductive, Deductive, and External strategies. Peer ratings for 21 traits served as criteria. Very little variation in validity was attributable to construction strategies. (Author/CTM)

Descriptors: Deduction, Foreign Countries, Higher Education, Induction

Person Reliability

Peer reviewed

Lumsden, James – Applied Psychological Measurement, 1977

Person changes can be of three kinds: developmental trends, swells, and tremors. Person unreliability in the tremor sense (momentary fluctuations) can be estimated from person characteristic curves. Average person reliability for groups can be compared from item characteristic curves. (Author)

Descriptors: Difficulty Level, Individual Characteristics, Individual Development, Individual Differences

Systematic Errors in Approximations to the Standard Error of Measurement and Reliability.

Peer reviewed

Kleinke, David J. – Applied Psychological Measurement, 1979

Lord's, Millman's and Saupe's methods of approximating the standard error of measurement are reviewed. Through an empirical demonstration involving 200 university classroom tests, all three approximations are shown to be biased. (Author/JKS)

Descriptors: Error of Measurement, Error Patterns, Higher Education, Mathematical Formulas

Multidimensional Computerized Adaptive Testing in a Certification or Licensure Context.

Peer reviewed

Luecht, Richard M. – Applied Psychological Measurement, 1996

The example of a medical licensure test is used to demonstrate situations in which complex, integrated content must be balanced at the total test level for validity reasons, but items assigned to reportable subscore categories may be used under a multidimensional item response theory adaptive paradigm to improve subscore reliability. (SLD)

Descriptors: Adaptive Testing, Certification, Computer Assisted Testing, Licensing Examinations (Professions)

Recovery of Marginal Maximum Likelihood Estimates in the Two-Parameter Logistic Response Model: An Evaluation of MULTILOG.

Peer reviewed

Stone, Clement A. – Applied Psychological Measurement, 1992

Monte Carlo methods are used to evaluate marginal maximum likelihood estimation of item parameters and maximum likelihood estimates of theta in the two-parameter logistic model for varying test lengths, sample sizes, and assumed theta distributions. Results with 100 datasets demonstrate the methods' general precision and stability. Exceptions are…

Descriptors: Computer Software Evaluation, Estimation (Mathematics), Mathematical Models, Maximum Likelihood Statistics

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Feldt, Leonard S.	5
Alsawalmeh, Yousef M.	4
Brennan, Robert L.	4
Raykov, Tenko	4
Ferrando, Pere J.	3
Fleiss, Joseph L.	3
Humphreys, Lloyd G.	3
Mellenbergh, Gideon J.	3
Raju, Nambury S.	3
Zimmerman, Donald W.	3
Cicchetti, Domenic V.	2
Culpepper, Steven Andrew	2
Davison, Mark L.	2
Divgi, D. R.	2
Forsyth, Robert A.	2
Harik, Polina	2
Lee, Won-Chan	2
Levin, Joel R.	2
Lindell, Michael K.	2
Lucke, Joseph F.	2
Luecht, Richard M.	2
Meijer, Rob R.	2
Moreland, John R.	2
Nicewander, W. Alan	2
More ▼

Graduate Record Examinations	3
California Psychological…	2
SAT (College Admission Test)	2
ACT Assessment	1
Armed Forces Qualification…	1
Armed Services Vocational…	1
Bem Sex Role Inventory	1
Defining Issues Test	1
Differential Aptitude Test	1
Edwards Personal Preference…	1
Eysenck Personality Inventory	1
Hidden Figures Test	1
Minnesota Importance…	1
Minnesota Multiphasic…	1
Rod and Frame Test	1
Sixteen Personality Factor…	1
Stanford Binet Intelligence…	1
Strong Campbell Interest…	1
United States Medical…	1
Washington University…	1
Wechsler Intelligence Scale…	1
Wechsler Preschool and…	1
More ▼