Publication Date
In 2025 | 1 |
Since 2024 | 8 |
Since 2021 (last 5 years) | 10 |
Since 2016 (last 10 years) | 19 |
Since 2006 (last 20 years) | 33 |
Descriptor
Source
Author
Adkins, Dorothy C. | 3 |
Crocker, Linda | 3 |
Cross, Lawrence H. | 3 |
Fiske, Donald W. | 3 |
Albanese, Mark A. | 2 |
Ballif, Bonnie L. | 2 |
Benson, Jeri | 2 |
Betz, Nancy E. | 2 |
Garvin, Alfred D. | 2 |
Hakstian, A. Ralph | 2 |
Hanna, Gerald S. | 2 |
More ▼ |
Publication Type
Education Level
Higher Education | 10 |
Postsecondary Education | 7 |
Secondary Education | 2 |
Elementary Education | 1 |
Elementary Secondary Education | 1 |
Grade 4 | 1 |
Grade 6 | 1 |
Intermediate Grades | 1 |
Middle Schools | 1 |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Cuttance, Peter F. – 1982
Covariance structure modelling is applied to the problem of estimating reliability and measurement error in survey data. To provide a basis for grouping certain question or variable types (data from questions), a simple typology based on the formal characteristics of the questions is outlined. From this classification, models for the different…
Descriptors: Analysis of Covariance, Correlation, Educational Research, Error of Measurement
Crehan, Kevin D.; And Others – 1977
Longitudinal studies of test wiseness (TW) were conducted to determine: (1) the relationship between TW and grade level, (2) the relationship between TW and sex, and (3) the stability of TW. Aspects of TW observed included stem cue and specific determiner identification and usage and the elimination of similar and absurd options. Subjects were…
Descriptors: Elementary Secondary Education, High School Students, Instructional Program Divisions, Longitudinal Studies
Donlon, Thomas F. – 1975
This study empirically determined the optimizing weight to be applied to the Wrongs Total Score in scoring rubrics of the general form = R - kW, where S is the Score, R the Rights Total, k the weight and W the Wrongs Total, if reliability is to be maximized. As is well known, the traditional formula score rests on a theoretical framework which is…
Descriptors: Achievement Tests, Comparative Analysis, Guessing (Tests), Multiple Choice Tests
Betz, Nancy E.; Weiss, David J. – 1975
A 40-item flexilevel test and a 40-item conventional test were compared using data obtained through (1) computer-administration of the two tests to three groups of college students, and (2) monte carlo simulation of test response patterns. Results indicated the flexilevel score distribution better reflected the underlying normal distribution of…
Descriptors: Ability, College Students, Comparative Analysis, Computer Oriented Programs
Sabers, Darrell; And Others – 1974
Likert-type inventories of children's attitude toward the world of work were administered to approximately 1,500 students in grades 2-8. The positively and negatively stated scores were analyzed separately. Agreement with positively stated items showed almost no difference among the various grades. However, the trend toward more disagreement among…
Descriptors: Age Differences, Attitude Measures, Elementary School Students, Junior High School Students

Wilcox, Rand R. – Journal of Experimental Education, 1982
A closed sequential procedure for estimating true score is proposed for use with answer-until-correct tests. The accuracy of determining true score is the same as in conventional sequential solutions, but the possibility of using an unnecessarily large number of items is eliminated. (Author/CM)
Descriptors: Answer Sheets, Guessing (Tests), Item Banks, Measurement Techniques

And Others; Mann, Irene T. – Applied Psychological Measurement, 1979
Several methodological problems (particularly the assumed bipolarity of scales, instructions regarding use of the midpoint, and concept-scale interaction) which may contribute to a lack of precision in the semantic differential technique were investigated. Results generally supported the use of the semantic differential. (Author/JKS)
Descriptors: Analysis of Variance, Computer Assisted Testing, Higher Education, Rating Scales

Harasym, P. H.; And Others – Evaluation and the Health Professions, 1980
Coded, as opposed to free response items, in a multiple choice physiology test had a cueing effect which raised students' scores, especially for lower achievers. Reliability of coded items was also lower. Item format and scoring method had an effect on test results. (GDC)
Descriptors: Achievement Tests, Comparative Testing, Cues, Higher Education
Hall, John D.; Ashley, Donna M.; Bramlett, Ronald K.; Dielmann, Kim B.; Murphy, John J. – Journal of Applied School Psychology, 2005
This study examined effects of negative versus positive symptom formats on the assessment and subsequent classification of ADHD in children in public schools. Symptoms associated with the disorder based on the Diagnostic and Statistical Manual of Mental Disorders Fourth Edition (DSM-IV) were presented to parents and teachers of referred children…
Descriptors: Response Style (Tests), Attention Deficit Disorders, Classification, Hyperactivity
Chissom, Brad; Chukabarah, Prince C. O. – 1985
The comparative effects of various sequences of test items were examined for over 900 graduate students enrolled in an educational research course at The University of Alabama, Tuscaloosa. experiment, which was conducted a total of four times using four separate tests, presented three different arrangements of 50 multiple-choice items: (1)…
Descriptors: Analysis of Variance, Comparative Testing, Difficulty Level, Graduate Students
Weber, Margaret B. – 1977
The effects of different choice formats on the reliability of teacher-made tests were examined for high and low achievers. The first study examined the effect of 3 and 5 choice items on the reliability of dichotomously scored teacher-made tests. The second study examined the effect of 3 and 4 choice items on the reliability of similarly designed…
Descriptors: Academic Achievement, Achievement Tests, Guessing (Tests), High Achievement
Schurr, K. Terry; Henriksen, L. W. – 1980
Five questionnaire forms containing 61 items specifying potential inservice topics for public school teachers were sent to a stratified random sample of Indiana public school administrators and curriculum supervisors. The five forms differed in that, for two forms, the items were ungrouped and appeared in different orders; and, for three forms,…
Descriptors: Administrator Attitudes, Correlation, Elementary Secondary Education, Factor Analysis

Poggio, John P.; Funk, Patricia E. – 1977
Effects of response mode, response format, perceived response style, sex, and anonymity on raw scale scores and extreme response tendencies for two distinct measures of affective behavior were investigated. The two measures were a self-acceptance scale developed by J.R. Phillips and the Machiavellianism IV scale. Where (response mode) and how…
Descriptors: Adults, Affective Behavior, Affective Measures, Analysis of Variance
Waters, Brian K. – 1975
This study empirically investigated the validity and utility of the stratified adaptive computerized testing model (stradaptive]developed by Weiss (1973). The model presents a tailored testing strategy based on Binet IQ measurement theory and Lord's (1972) modern test theory. Nationally normed School and College Ability Test Verbal analogy items…
Descriptors: Ability, Adaptive Testing, Branching, Comparative Analysis

Fiske, Donald W.; Kuncel, Ruth Boutin
After taking a personality test, subjects reported their reactions to being tested. Reactions were diverse, even in the same subject. Free responses to 10 questions were coded into 16 categories within five broad groups. Desire for information about the test and about self, and criticism of testing were very prevalent; criticisms of self and…
Descriptors: Affective Measures, Attitude Measures, Catalogs, Evaluation Methods