ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	29

Descriptor

Reliability	74
Test Reliability	72
Error of Measurement	28
Higher Education	24
Test Items	22
Correlation	21
Estimation (Mathematics)	20
Item Response Theory	20
Test Validity	20
Statistical Analysis	19
Equations (Mathematics)	18
Test Construction	18
Test Theory	18
Mathematical Models	17
Models	17
Scores	16
Rating Scales	15
Simulation	15
Computation	12
Measurement Techniques	12
Psychometrics	12
Scoring	12
Evaluation Methods	11
Sampling	11
Comparative Analysis	10
More ▼

Source

Applied Psychological…

149

Publication Type

Journal Articles	117
Reports - Evaluative	51
Reports - Research	38
Reports - Descriptive	17
Opinion Papers	4
Book/Product Reviews	3
Information Analyses	3
Collected Works - Serials	2
Reports - General	2
Collected Works - General	1
Guides - Non-Classroom	1
Tests/Questionnaires	1
More ▼

Education Level

Higher Education	5
Postsecondary Education	3
Early Childhood Education	1
Elementary Education	1
Grade 2	1
High Schools	1
Primary Education	1
Secondary Education	1

Audience

Location

West Germany	2
Australia	1
Belgium	1
Germany	1
Michigan	1
Netherlands	1
Spain	1
Sweden	1

Laws, Policies, & Programs

What Works Clearinghouse Rating

Applied Psychological Measurement X

Showing 121 to 135 of 149 results Save | Export

Reliability Estimation for Single Dichotomous Items Based on Mokken's IRT Model.

Peer reviewed

Meijer, Rob R.; And Others – Applied Psychological Measurement, 1995

Three methods based on the nonparametric item response theory (IRT) of R. J. Mokken for the estimation of the reliability of single dichotomous test items are discussed. Analytical and Monte Carlo studies show that one method, designated "MS," is superior because of smaller bias and smaller sampling variance. (SLD)

Descriptors: Estimation (Mathematics), Item Response Theory, Monte Carlo Methods, Nonparametric Statistics

A Rationale for Defining Achievement Levels Using IRT-Estimated Domain Scores.

Peer reviewed

Schulz, E. Matthew; Kolen, Michael J.; Nicewander, W. Alan – Applied Psychological Measurement, 1999

Developed a procedure for defining achievement levels on continuous scales using aspects of Guttman scaling (L. Guttman, 1950) and Item Response Theory. Using data from high school mathematics tests for about 6,000 students, found the new procedure to have higher reliability, higher classification consistency, and lower classification error than…

Descriptors: Academic Achievement, Classification, Estimation (Mathematics), High School Students

The Reliability and Validity of Objective Indices of Moral Development.

Peer reviewed

Davison, Mark L.; Robbins, Stephen – Applied Psychological Measurement, 1978

Empirically weighted scores for Rest's Defining Issues Test were found to be more reliable than the simple sum of scores theoretically weighted sum, or Rest's p scores. They also had slightly higher correlations with Kohlberg's interview scores. Empirically weighted scores also showed more significant change in two longitudinal studies. (CTM)

Descriptors: Higher Education, Longitudinal Studies, Moral Development, Moral Values

Alternative Response and Scoring Methods for Multiple Choice Items: An Empirical Study of Probabilistic and Ordinal Response Modes

Peer reviewed

Poizner, Sharon B.; And Others – Applied Psychological Measurement, 1978

Binary, probability, and ordinal scoring procedures for multiple-choice items were examined. In two situations, it was found that both the probability and ordinal scoring systems were more reliable than the binary scoring method. (Author/CTM)

Descriptors: Confidence Testing, Guessing (Tests), Higher Education, Multiple Choice Tests

Choice Reaction Time: What Role in Ability Measurement?

Peer reviewed

Lunneborg, Clifford E. – Applied Psychological Measurement, 1977

Three studies are described in which choice reaction time (RT) was related to such psychometric ability measures as verbal comprehension, numerical reasoning, hidden figures, and progressive matrices tests. Fairly consistent negative correlations were found between these tests and choice RT when high school samples were used. (Author/CTM)

Descriptors: Cognitive Ability, Cognitive Processes, High Schools, Higher Education

Contributions to Criterion-Referenced Testing Technology.

Peer reviewed

Hambleton, Ronald K., Ed. – Applied Psychological Measurement, 1980

This special issue covers recent technical developments in the field of criterion-referenced testing. An introduction, six papers, and two commentaries dealing with test development, test score uses, and evaluation of scores review relevant literature, offer new models and/or results, and suggest directions for additional research. (SLD)

Descriptors: Criterion Referenced Tests, Mastery Tests, Measurement Techniques, Standard Setting (Scoring)

Measures for the Study of Creativity in Scientific Problem-Solving

Peer reviewed

Frederiksen, Norman; Ward, William C. – Applied Psychological Measurement, 1978

A set of Tests of Scientific Thinking were developed for possible use as criterion measures in research on creativity. Scores on the tests describe both quality and quantity of ideas produced in formulating hypotheses, evaluating proposals, solving methodological problems, and devising methods for measuring constructs. (Author/CTM)

Descriptors: Creativity Tests, Higher Education, Item Sampling, Predictive Validity

Development of a Self-Report Inventory for Assessing Individual Differences in Learning Processes

Peer reviewed

Schmeck, Ronald Ray; And Others – Applied Psychological Measurement, 1977

Five studies are presented describing the development of a self-report inventory for measuring individual differences in learning processes. Factor analysis of items yielded four scales: Synthesis-Analysis, Study Methods, Fact Retention, and Elaborative Processing. There were no sex differences, and the scales demonstrated acceptable reliabilities…

Descriptors: Factor Analysis, Higher Education, Learning Processes, Retention (Psychology)

Ordering Power of Separate versus Grouped True-False Tests: Interaction of Type of Test with Knowledge Levels of Examinees.

Peer reviewed

Hsu, Louis M. – Applied Psychological Measurement, 1979

A comparison of the relative ordering power of separate and grouped-items true-false tests indicated that neither type of test was uniformly superior to the other across all levels of knowledge of examinees. Grouped-item tests were found superior for examinees with low levels of knowledge. (Author/CTM)

Descriptors: Academic Ability, Knowledge Level, Multiple Choice Tests, Scores

Estimating Measures of Pass-Fail Reliability from Parallel Half-Tests.

Peer reviewed

Woodruff, David J.; Sawyer, Richard L. – Applied Psychological Measurement, 1989

Two methods--non-distributional and normal--are derived for estimating measures of pass-fail reliability. Both are based on the Spearman Brown formula and require only a single test administration. Results from a simulation (n=20,000 examinees) and a licensure examination (n=4,828 examinees) illustrate these methods. (SLD)

Descriptors: Equations (Mathematics), Estimation (Mathematics), Licensing Examinations (Professions), Measures (Individuals)

Item Selection Using an Average Growth Approximation of Target Information Functions.

Peer reviewed

Luecht, Richard M.; Hirsch, Thomas M. – Applied Psychological Measurement, 1992

Derivations of several item selection algorithms for use in fitting test items to target information functions (IFs) are described. These algorithms, which use an average growth approximation of target IFs, were tested by generating six test forms and were found to provide reliable fit. (SLD)

Descriptors: Algorithms, Computer Assisted Testing, Equations (Mathematics), Goodness of Fit

Planning an Experiment in the Company of Measurement Error

Peer reviewed

Levin, Joel R.; Subkoviak, Michael J. – Applied Psychological Measurement, 1977

Textbook calculations of statistical power or sample size follow from formulas that assume that the variables under consideration are measured without error. However, in the real world of behavioral research, errors of measurement cannot be neglected. The determination of sample size is discussed, and an example illustrates blocking strategy.…

Descriptors: Analysis of Covariance, Analysis of Variance, Error of Measurement, Hypothesis Testing

The Reliability of a Linear Composite of Nonequivalent Subtests.

Peer reviewed

Rozeboom, William W. – Applied Psychological Measurement, 1989

Formulas are provided for estimating the reliability of a linear composite of non-equivalent subtests given the reliabilities of component subtests. The reliability of the composite is compared to that of its components. An empirical example uses data from 170 children aged 4 through 8 years performing 34 Piagetian tasks. (SLD)

Descriptors: Elementary School Students, Equations (Mathematics), Estimation (Mathematics), Mathematical Models

A Psychometric Evaluation of 4-Point and 6-Point Likert-Type Scales in Relation to Reliability and Validity.

Peer reviewed

Chang, Lei – Applied Psychological Measurement, 1994

Reliability and validity of 4-point and 6-point scales were assessed using a new model-based approach to fit empirical data from 165 graduate students completing an attitude measure. Results suggest that the issue of four- versus six-point scales may depend on the empirical setting. (SLD)

Descriptors: Attitude Measures, Goodness of Fit, Graduate Students, Graduate Study

Sequential Reliability Tests.

Peer reviewed

Eiting, Mindert H. – Applied Psychological Measurement, 1991

A method is proposed for sequential evaluation of reliability of psychometric instruments. Sample size is unfixed; a test statistic is computed after each person is sampled and a decision is made in each stage of the sampling process. Results from a series of Monte-Carlo experiments establish the method's efficiency. (SLD)

Descriptors: Computer Simulation, Equations (Mathematics), Estimation (Mathematics), Mathematical Models

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Feldt, Leonard S.	5
Alsawalmeh, Yousef M.	4
Brennan, Robert L.	4
Raykov, Tenko	4
Ferrando, Pere J.	3
Fleiss, Joseph L.	3
Humphreys, Lloyd G.	3
Mellenbergh, Gideon J.	3
Raju, Nambury S.	3
Zimmerman, Donald W.	3
Cicchetti, Domenic V.	2
Culpepper, Steven Andrew	2
Davison, Mark L.	2
Divgi, D. R.	2
Forsyth, Robert A.	2
Harik, Polina	2
Lee, Won-Chan	2
Levin, Joel R.	2
Lindell, Michael K.	2
Lucke, Joseph F.	2
Luecht, Richard M.	2
Meijer, Rob R.	2
Moreland, John R.	2
Nicewander, W. Alan	2
More ▼

Graduate Record Examinations	3
California Psychological…	2
SAT (College Admission Test)	2
ACT Assessment	1
Armed Forces Qualification…	1
Armed Services Vocational…	1
Bem Sex Role Inventory	1
Defining Issues Test	1
Differential Aptitude Test	1
Edwards Personal Preference…	1
Eysenck Personality Inventory	1
Hidden Figures Test	1
Minnesota Importance…	1
Minnesota Multiphasic…	1
Rod and Frame Test	1
Sixteen Personality Factor…	1
Stanford Binet Intelligence…	1
Strong Campbell Interest…	1
United States Medical…	1
Washington University…	1
Wechsler Intelligence Scale…	1
Wechsler Preschool and…	1
More ▼