NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 91 to 105 of 171 results Save | Export
Peer reviewed Peer reviewed
Birenbaum, Menucha; Fatsuoka, Kikumi K. – Journal of Educational Measurement, 1983
The outcomes of two scoring methods (one based on an error analysis and the second on a conventional method) on free-response tests, compared in terms of reliability and dimensionality, indicates the conventional method is inferior in both aspects. (Author/PN)
Descriptors: Achievement Tests, Algorithms, Data, Junior High Schools
Peer reviewed Peer reviewed
Wen, Shih-Sung – Journal of Educational Measurement, 1975
The relationship between students' scores on a verbal meaning test and their degrees of confidence in item responses was investigated. Subjects were black undergraduate students and they were administered a verbal meaning test by following a confidence testing procedure. (Author/BJG)
Descriptors: Blacks, Confidence Testing, Higher Education, Language Skills
Edwards, Keith J.; And Others – 1973
Four selected scales from the Learning Environment Inventory (LEI) were rewritten to measure the students' individual perceptions of their classroom environment, rather than their estimates of the opinions of the class as a whole. Both scales were then administered to 10 7th grade math classes and 4 10th grade social studies classes. The rewritten…
Descriptors: Classroom Environment, Grade 10, Grade 7, Individual Differences
Peer reviewed Peer reviewed
Smith, Monte D.; Rogers, Carl M. – Educational and Psychological Measurement, 1977
Test-retest item instability indices for low, middle, and high scores on the Piers-Harris Children's Self-Concept Scale were calculated in order to test the hypothesis that low scores are invalid because of unreliability of responding. The hypothesis was not supported. (Author/JKS)
Descriptors: Elementary Education, Elementary School Students, Item Analysis, Response Style (Tests)
Peer reviewed Peer reviewed
Fiske, Donald W.; Barack, Leonard I. – Educational and Psychological Measurement, 1976
The diversity among interpretations of single items in personality questionnaires has been noted previously. Using adjectives from the Adjective Check List (ACL), the study sought evidence bearing on these questions: Does such diversity make the responses to an item not comparable across subjects? If so, what are the implications for scores based…
Descriptors: Adjectives, Check Lists, Individual Differences, Item Analysis
Peer reviewed Peer reviewed
Meehan, Kenneth A.; And Others – Journal of Research in Personality, 1979
Three studies investigating the psychometric and conceptual properties of the self-report Survey of Ethical Attitudes inventory indicated that the scale is clearly susceptible to response dissimulation through role playing and impression management and is also confounded with sources of stylistic variance in the form of social desirability.…
Descriptors: College Students, Moral Development, Moral Values, Personality Theories
Peer reviewed Peer reviewed
Leong, Frederick T. L.; Zachar, Peter – Journal of Counseling Psychology, 1991
Presents three studies on development of Scientist-Practitioner Inventory (SPI) designed to measure career specialty interests of psychology students. Reports factorial validity of scales, test-retest reliability, freedom from response-set biases, and construct validity; cross-validation evidence of second-order factor structure, internal…
Descriptors: Career Choice, College Students, Factor Structure, Higher Education
Peer reviewed Peer reviewed
Pugh, Richard C.; Brunza, J. Jay – Educational and Psychological Measurement, 1975
Descriptors: Analysis of Variance, Confidence Testing, Multiple Choice Tests, Personality
Cliff, Norman – 1975
Measures of consistency and completeness of order relations derived from test-type data are proposed. The measures are generalized to apply to incomplete data such as tailored testing. The measures are based on consideration of the items-plus-persons by items-plus-persons matrix as an adjacency matrix in which a 1 means that the row element…
Descriptors: Adaptive Testing, Career Development, Computer Oriented Programs, Individual Differences
Kocher, A. Thel – 1974
The purpose of the present study was to empirically investigate the stability and accuracy of one suggested method for matching test difficulty to examinee ability level. Students' answers to traditional classroom tests were rescored by computer as if the examinations had been flexilevel tests. The scores thus obtained were found to correlate…
Descriptors: Ability, Comparative Analysis, Computer Oriented Programs, Educational Testing
Peer reviewed Peer reviewed
Ebel, Robert L. – Educational and Psychological Measurement, 1978
A multiple true-false item is one where a testee has to identify statements as true or false within a cluster (of two or more) of such statements. Clusters are then scored as items. This study showed such a procedure to yield less reliable results than traditional true-false items. (JKS)
Descriptors: Guessing (Tests), Higher Education, Item Analysis, Multiple Choice Tests
Schuessler, Karl; And Others – Social Psychology, 1978
The feasibility of measuring responding desirably with attitude-opinion items is discussed, and an index based on 16 such items is presented. Estimates of reliability and validity for this index, and examples of its use as a covariate (control) in attitude research are presented. Similarities and differences from related scales are discussed.…
Descriptors: Adults, Attitude Measures, Measurement Techniques, Response Style (Tests)
Green, Kathy – 1978
Forty three-option multiple choice (MC) statements on a midterm examination were converted to 120 true-false (TF) statements, identical in content. Test forms (MC and TF) were randomly administered to 50 undergraduates, to investigate the validity and internal consistency reliability of the two forms. A Kuder-Richardson formula 20 reliability was…
Descriptors: Achievement Tests, Comparative Testing, Higher Education, Multiple Choice Tests
PDF pending restoration PDF pending restoration
Feitler, Fred C.; Graf, Stephen A. – 1978
Two forms of a teacher rating questionnaire, Student Reaction to Instruction, were administered to college students. The regular format used category scaling; the 631 responding students selected a number between one and five. Experimental "ratio production (multiply-divide)" evaluations were also completed by 26 subjects along with the…
Descriptors: College Faculty, Comparative Testing, Higher Education, Rating Scales
Cross, Lawrence H. – 1975
A novel scoring procedure was investigated in order to obtain scores from a conventional multiple-choice test that would be free of the guessing component or contain a known guessing component even though examinees were permitted to guess at will. Scores computed with the experimental procedure are based not only on the number of items answered…
Descriptors: Algebra, Comparative Analysis, Guessing (Tests), High Schools
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  12