ERIC - Search Results

Publication Date

In 2025	1
Since 2024	8
Since 2021 (last 5 years)	10
Since 2016 (last 10 years)	19
Since 2006 (last 20 years)	33

Descriptor

Response Style (Tests)	171
Test Reliability	171
Test Validity	81
Test Construction	40
Multiple Choice Tests	37
Higher Education	33
Testing Problems	30
Item Analysis	27
Test Items	25
Guessing (Tests)	22
Scoring Formulas	22
Measurement Techniques	21
Testing	21
Statistical Analysis	19
Scoring	18
Responses	17
Comparative Analysis	16
Test Bias	16
Test Interpretation	16
Achievement Tests	15
Correlation	15
Questionnaires	15
Factor Analysis	14
Rating Scales	14
College Students	13
More ▼

Publication Type

Reports - Research	97
Journal Articles	57
Speeches/Meeting Papers	19
Reports - Evaluative	6
Information Analyses	4
Tests/Questionnaires	4
Guides - Classroom - Teacher	1
Guides - Non-Classroom	1
Numerical/Quantitative Data	1
Reference Materials -…	1
Reports - Descriptive	1
More ▼

Education Level

Higher Education	10
Postsecondary Education	7
Secondary Education	2
Elementary Education	1
Elementary Secondary Education	1
Grade 4	1
Grade 6	1
Intermediate Grades	1
Middle Schools	1

Audience

Practitioners	5
Researchers	5
Administrators	1
Counselors	1
Teachers	1

Location

Australia	1
Fiji	1
Germany	1
Greece	1
Israel	1
Poland	1
Sweden	1
Texas	1
United Kingdom	1

Laws, Policies, & Programs

What Works Clearinghouse Rating

Test Reliability X

Showing 91 to 105 of 171 results Save | Export

The Effect of a Scoring System Based on the Algorithm Underlying the Students' Response Patterns on the Dimensionality of Achievement Test Data of the Problem Solving Type.

Peer reviewed

Birenbaum, Menucha; Fatsuoka, Kikumi K. – Journal of Educational Measurement, 1983

The outcomes of two scoring methods (one based on an error analysis and the second on a conventional method) on free-response tests, compared in terms of reliability and dimensionality, indicates the conventional method is inferior in both aspects. (Author/PN)

Descriptors: Achievement Tests, Algorithms, Data, Junior High Schools

The Relationship Between Verbal-Meaning Test Scores and Degree of Confidence in Item Responses

Peer reviewed

Wen, Shih-Sung – Journal of Educational Measurement, 1975

The relationship between students' scores on a verbal meaning test and their degrees of confidence in item responses was investigated. Subjects were black undergraduate students and they were administered a verbal meaning test by following a confidence testing procedure. (Author/BJG)

Descriptors: Blacks, Confidence Testing, Higher Education, Language Skills

Changing the Focus of Response in Assessing Classroom Learning Environments.

Download full text

Edwards, Keith J.; And Others – 1973

Four selected scales from the Learning Environment Inventory (LEI) were rewritten to measure the students' individual perceptions of their classroom environment, rather than their estimates of the opinions of the class as a whole. Both scales were then administered to 10 7th grade math classes and 4 10th grade social studies classes. The rewritten…

Descriptors: Classroom Environment, Grade 10, Grade 7, Individual Differences

Item Instability on the Piers-Harris Children's Self-Concept Scale for Academic Underachievers with High, Middle and Low Self-Concepts: Implications for Construct Validity

Peer reviewed

Smith, Monte D.; Rogers, Carl M. – Educational and Psychological Measurement, 1977

Test-retest item instability indices for low, middle, and high scores on the Piers-Harris Children's Self-Concept Scale were calculated in order to test the hypothesis that low scores are invalid because of unreliability of responding. The hypothesis was not supported. (Author/JKS)

Descriptors: Elementary Education, Elementary School Students, Item Analysis, Response Style (Tests)

Individuality of Item Interpretation in Interchangeable ACL Scales

Peer reviewed

Fiske, Donald W.; Barack, Leonard I. – Educational and Psychological Measurement, 1976

The diversity among interpretations of single items in personality questionnaires has been noted previously. Using adjectives from the Adjective Check List (ACL), the study sought evidence bearing on these questions: Does such diversity make the responses to an item not comparable across subjects? If so, what are the implications for scores based…

Descriptors: Adjectives, Check Lists, Individual Differences, Item Analysis

The Role of Dissimulation and Social Desirability in the Measurement of Moral Reasoning.

Peer reviewed

Meehan, Kenneth A.; And Others – Journal of Research in Personality, 1979

Three studies investigating the psychometric and conceptual properties of the self-report Survey of Ethical Attitudes inventory indicated that the scale is clearly susceptible to response dissimulation through role playing and impression management and is also confounded with sources of stylistic variance in the form of social desirability.…

Descriptors: College Students, Moral Development, Moral Values, Personality Theories

Development and Validation of the Scientist-Practitioner Inventory for Psychology.

Peer reviewed

Leong, Frederick T. L.; Zachar, Peter – Journal of Counseling Psychology, 1991

Presents three studies on development of Scientist-Practitioner Inventory (SPI) designed to measure career specialty interests of psychology students. Reports factorial validity of scales, test-retest reliability, freedom from response-set biases, and construct validity; cross-validation evidence of second-order factor structure, internal…

Descriptors: Career Choice, College Students, Factor Structure, Higher Education

Effects of a Confidence Weighted Scoring System on Measures of Test Reliability and Validity

Peer reviewed

Pugh, Richard C.; Brunza, J. Jay – Educational and Psychological Measurement, 1975

Descriptors: Analysis of Variance, Confidence Testing, Multiple Choice Tests, Personality

A Basic Test Theory Generalizable to Tailored Testing. Technical Report No. 1.

Download full text

Cliff, Norman – 1975

Measures of consistency and completeness of order relations derived from test-type data are proposed. The measures are generalized to apply to incomplete data such as tailored testing. The measures are based on consideration of the items-plus-persons by items-plus-persons matrix as an adjacency matrix in which a 1 means that the row element…

Descriptors: Adaptive Testing, Career Development, Computer Oriented Programs, Individual Differences

An Empirical Investigation of the Stability and Accuracy of Flexilevel Tests.

Download full text

Kocher, A. Thel – 1974

The purpose of the present study was to empirically investigate the stability and accuracy of one suggested method for matching test difficulty to examinee ability level. Students' answers to traditional classroom tests were rescored by computer as if the examinations had been flexilevel tests. The scores thus obtained were found to correlate…

Descriptors: Ability, Comparative Analysis, Computer Oriented Programs, Educational Testing

The Ineffectiveness of Multiple True-False Test Items

Peer reviewed

Ebel, Robert L. – Educational and Psychological Measurement, 1978

A multiple true-false item is one where a testee has to identify statements as true or false within a cluster (of two or more) of such statements. Clusters are then scored as items. This study showed such a procedure to yield less reliable results than traditional true-false items. (JKS)

Descriptors: Guessing (Tests), Higher Education, Item Analysis, Multiple Choice Tests

Measuring Responding Desirably with Attitude-Opinion Items

Schuessler, Karl; And Others – Social Psychology, 1978

The feasibility of measuring responding desirably with attitude-opinion items is discussed, and an index based on 16 such items is presented. Estimates of reliability and validity for this index, and examples of its use as a covariate (control) in attitude research are presented. Similarities and differences from related scales are discussed.…

Descriptors: Adults, Attitude Measures, Measurement Techniques, Response Style (Tests)

Multiple Choice Converted to True-False: Comparative Reliabilities and Validities.

Download full text

Green, Kathy – 1978

Forty three-option multiple choice (MC) statements on a midterm examination were converted to 120 true-false (TF) statements, identical in content. Test forms (MC and TF) were randomly administered to 50 undergraduates, to investigate the validity and internal consistency reliability of the two forms. A Kuder-Richardson formula 20 reliability was…

Descriptors: Achievement Tests, Comparative Testing, Higher Education, Multiple Choice Tests

The Use of Ratio Production Scales to Assess Quality of Teaching Performance.

PDF pending restoration

Feitler, Fred C.; Graf, Stephen A. – 1978

Two forms of a teacher rating questionnaire, Student Reaction to Instruction, were administered to college students. The regular format used category scaling; the 631 responding students selected a number between one and five. Experimental "ratio production (multiply-divide)" evaluations were also completed by 26 subjects along with the…

Descriptors: College Faculty, Comparative Testing, Higher Education, Rating Scales

An Investigation of a Scoring Procedure Designed to Eliminate Score Variance Due to Guessing in Multiple-Choice Tests.

Download full text

Cross, Lawrence H. – 1975

A novel scoring procedure was investigated in order to obtain scores from a conventional multiple-choice test that would be free of the guessing component or contain a known guessing component even though examinees were permitted to guess at will. Scores computed with the experimental procedure are based not only on the number of items answered…

Descriptors: Algebra, Comparative Analysis, Guessing (Tests), High Schools

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12

Educational and Psychological…	23
Journal of Educational…	13
Applied Psychological…	6
International Journal of…	3
Assessment	2
Evaluation and the Health…	2
Grantee Submission	2
Journal of Educational and…	2
Measurement and Evaluation in…	2
Psychology: A Quarterly…	2
AERA Online Paper Repository	1
American Educational Research…	1
American Journal of Family…	1
Assessment in Education:…	1
British Journal of…	1
British Journal of Psychology	1
CBE - Life Sciences Education	1
Child Abuse and Neglect: The…	1
Child Development	1
Decision Sciences Journal of…	1
Educational Psychology	1
Educational Research Quarterly	1
Educational Researcher	1
Evaluation Review	1
Florida Vocational Journal	1
More ▼

Adkins, Dorothy C.	3
Crocker, Linda	3
Cross, Lawrence H.	3
Fiske, Donald W.	3
Albanese, Mark A.	2
Ballif, Bonnie L.	2
Benson, Jeri	2
Betz, Nancy E.	2
Garvin, Alfred D.	2
Hakstian, A. Ralph	2
Hanna, Gerald S.	2
Kane, Michael T.	2
Kansup, Wanlop	2
Kuncel, Ruth Boutin	2
Moloney, James M.	2
Tracy, D. B.	2
Waters, Brian K.	2
Weiss, David J.	2
Adrian Adams	1
Aleksandra Gajda	1
Allen, Patricia J.	1
Allison, Howard K., II	1
Alweis, Richard L.	1
Arnold Ewing, Theresa D.	1
More ▼

California Psychological…	2
Minnesota Multiphasic…	2
SAT (College Admission Test)	2
ACT Assessment	1
Adaptive Behavior Scale	1
Adjective Check List	1
Boehm Test of Basic Concepts	1
Canfield Learning Styles…	1
Child Abuse Potential…	1
Edwards Personal Preference…	1
Eysenck Personality Inventory	1
Graduate Management Admission…	1
Gregorc Style Delineator	1
Holland Vocational Preference…	1
Law School Admission Test	1
Learning Environment Inventory	1
Marlowe Crowne Social…	1
Matching Familiar Figures Test	1
Metropolitan Achievement Tests	1
Minnesota Importance…	1
Myers Briggs Type Indicator	1
Personal Orientation Inventory	1
Personality Research Form	1
Piers Harris Childrens Self…	1
Progress in International…	1
More ▼