ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	6

Descriptor

Higher Education	44
Test Bias	44
Test Reliability	38
Test Validity	24
Test Construction	14
Evaluation Methods	11
College Entrance Examinations	10
Scores	9
Standardized Tests	9
Testing Problems	9
Test Interpretation	7
Psychometrics	6
Student Evaluation	6
Test Items	6
Educational Assessment	5
Foreign Countries	5
Achievement Tests	4
Admission Criteria	4
Blacks	4
College Students	4
Comparative Analysis	4
Elementary Secondary Education	4
Grading	4
Interrater Reliability	4
Item Analysis	4
More ▼

Publication Type

Reports - Research	20
Journal Articles	19
Opinion Papers	7
Speeches/Meeting Papers	7
Reports - Descriptive	6
Information Analyses	4
Books	2
Collected Works - Proceedings	2
Collected Works - Serials	2
Reports - Evaluative	2
Collected Works - General	1
Guides - General	1
Guides - Non-Classroom	1
Reference Materials -…	1
More ▼

Education Level

Higher Education	6
Postsecondary Education	2
Secondary Education	1
Two Year Colleges	1

Audience

Researchers	3
Practitioners	2
Administrators	1
Policymakers	1
Students	1
Teachers	1

Location

Israel	2
California	1
China	1

Laws, Policies, & Programs

Bakke v Regents of University…

Assessments and Surveys

SAT (College Admission Test)	3
Beck Depression Inventory	1
Defining Issues Test	1
General Aptitude Test Battery	1
Graduate Management Admission…	1
Graduate Record Examinations	1
Students Evaluation of…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 44 results Save | Export

Excellence Bias Related to Rating Scales with Summative Jury Assessment

Peer reviewed

Direct link

Corradi, David – Assessment & Evaluation in Higher Education, 2023

Juries are a high-stake practice in higher education to assess complex competencies. However common, research remains behind in detailing the psychometric qualities of juries, especially when using rubrics or rating scales as an assessment tool. In this study, I analyze a case of a jury assessment (N = 191) of product development where both…

Descriptors: Court Litigation, Educational Practices, Higher Education, Rating Scales

Rethinking SETs: Retuning Student Evaluations of Teaching for Student Agency

Peer reviewed

Direct link

Ray, Brian; Babb, Jacob; Wooten, Courtney Adams – Composition Studies, 2018

Student evaluations of teaching (SETs) are frequently used to assess college teachers. However, education research has shown that there is potential for bias in SETs, especially based on instructor variables. Aside from Amy Dayton's 2015 work on assessment that advises using SETs only in concert with other measures, English studies scholars have…

Descriptors: Student Evaluation of Teacher Performance, Teacher Evaluation, Educational History, Test Bias

Pilot Testing the Chinese Version of the ETS® Proficiency Profile Critical Thinking Test. Research Report. ETS RR-16-37

Peer reviewed
PDF on ERIC

Download full text

Liu, Ou Lydia; Mao, Liyang; Zhao, Tingting; Yang, Yi; Xu, Jun; Wang, Zhen – ETS Research Report Series, 2016

Chinese higher education is experiencing rapid development and growth. With tremendous resources invested in higher education, policy makers have requested more direct evidence of student learning. However, assessment tools that can be used to measure college-level learning are scarce in China. To mitigate this situation, we translated the…

Descriptors: Foreign Countries, Higher Education, Critical Thinking, College Students

Person Heterogeneity of the BDI-II-C and Its Effects on Dimensionality and Construct Validity: Using Mixture Item Response Models

Peer reviewed

Direct link

Wu, Pei-Chen; Huang, Tsai-Wei – Measurement and Evaluation in Counseling and Development, 2010

This study was to apply the mixed Rasch model to investigate person heterogeneity of Beck Depression Inventory-II-Chinese version (BDI-II-C) and its effects on dimensionality and construct validity. Person heterogeneity was reflected by two latent classes that differ qualitatively. Additionally, person heterogeneity adversely affected the…

Descriptors: Construct Validity, Validity, Depression (Psychology), Item Response Theory

University Student Anonymity in the Summative Assessment of Written Work

Peer reviewed

Direct link

Brennan, David J. – Higher Education Research and Development, 2008

This paper provides an overview of the issue of student anonymity in the summative assessment of student work in higher education. It considers both theoretical literature pertaining to bias in the evaluation of the work of others and the limited empirical work undertaken on this issue in higher education. It then describes the experience of three…

Descriptors: Higher Education, Student Evaluation, Interrater Reliability, Test Bias

Blind Marking and Sex Bias in Student Assessment.

Peer reviewed

Newstead, Stephen E.; Dennis, Ian – Assessment and Evaluation in Higher Education, 1990

Three studies investigating the existence of sex bias in the grading of undergraduate students, by examining interrater reliability for blind and non-blind grading, are reported. Negative evidence found in the results and the confusing picture presented by previous research indicate little firm evidence of sex bias in grading. (Author/MSE)

Descriptors: Evaluation Methods, Grading, Higher Education, Interrater Reliability

Effects of Knowledge of Cognitive-Moral Development and Request to Fake on Defining Issues Test P-Scores.

Peer reviewed

Napier, John D. – Journal of Psychology, 1979

Support claims that the "Defining Issues Test" of cognitive-moral development cannot be faked higher. Finds that instruction about cognitive-moral development affected the scores of the teacher trainees who were tested. (RL)

Descriptors: Cognitive Development, Higher Education, Moral Development, Test Bias

The Hybird TUCE: Origin, Data, and Limitations

Peer reviewed

Saunders, Phillip; Welsh, Arthur L. – Journal of Economic Education, 1975

Compared the "hybrid" Test of Understanding in College Economics (TUCE) with the four original TUCE versions and found that the hybrid version is 1) "broader,""thinner," and less technical in terms of content coverage; 2) more reliable with a generally superior item analysis structure; and 3) slightly "easier" for students who have taken economics…

Descriptors: Economics, Economics Education, Educational Testing, Evaluation

Note on Reliability of Fixed-Response Formats.

Peer reviewed

Bardo, John W.; Yeager, Samuel J. – Perceptual and Motor Skills, 1982

Responses to various fixed test-response formats were examined for "reliability" due to systematic error; Cronbach's alphas up to .67 were obtained. Of formats tested, four-point Likert Scales were least affected while forms of lines and faces were most problematic. Possible modification in alpha to account for systematic bias is…

Descriptors: Higher Education, Measures (Individuals), Psychometrics, Response Style (Tests)

Reducing Administration Time While Improving Reliability and Validity of Fitness Tests.

Peer reviewed

Nelson, Jack K.; Dorociak, Jeff J. – Journal of Physical Education, Recreation & Dance, 1982

Test measurement, reliability, and validity are discussed in relation to methods of physical fitness testing. A successful testing method which involved students testing their peers is described, showing the administration of various test items and the use of test practice procedures. (JN)

Descriptors: Higher Education, Physical Education, Physical Fitness, Student Participation

Unreliability of Marking: Further Evidence and a Possible Explanation.

Peer reviewed

Branthwaite, Alan; And Others – Educational Review, 1981

In this naturalistic study of essay marking, 15 university lecturers graded an examination paper and completed the Eysenck Personality Questionnaire. A significant positive correlation was found between the marks given and the grader's lie score, indicating possible effects of staff-student interactions or social desirability on biases in grading.…

Descriptors: Essay Tests, Experimenter Characteristics, Higher Education, Personality Traits

The Identification of Biased Items.

Download full text

Sinnott, Loraine T. – 1982

A standard method for exploring item bias is the intergroup comparison of item difficulties. This paper describes a refinement and generalization of this technique. In contrast to prior approaches, the proposed method deletes outlying items from the formulation of a criterion for identifying items as deviant. It also extends the mathematical…

Descriptors: College Entrance Examinations, Difficulty Level, Higher Education, Item Analysis

The Relationship Between Verbal-Meaning Test Scores and Degree of Confidence in Item Responses

Peer reviewed

Wen, Shih-Sung – Journal of Educational Measurement, 1975

The relationship between students' scores on a verbal meaning test and their degrees of confidence in item responses was investigated. Subjects were black undergraduate students and they were administered a verbal meaning test by following a confidence testing procedure. (Author/BJG)

Descriptors: Blacks, Confidence Testing, Higher Education, Language Skills

Assessing Listening in the Basic Course: The University of Wisconsin-Oshkosh Listening Test.

Download full text

Willmington, S. Clay; Steinbrecher, Milda M. – 1993

A "Fundamentals of Speech Communication" course is required of all college students, and upon completion of such a course students should possess those basic speaking and listening skills necessary to complete successfully their college educations. With a view toward developing a new, more effective listening test, a study examined…

Descriptors: Communication Research, Higher Education, Introductory Courses, Listening Comprehension

Rater Stringency and Consistency in Performance Assessment.

Webb, Lynn C.; And Others – 1990

Two aspects of rater accuracy in performance assessment were analyzed: rater stringency/leniency, and rater consistency. Data were obtained from three administrations of an oral certification examination in a health profession. The examination consists of clinical cases in four content areas or subspecialities. A total of 364 candidates were…

Descriptors: Allied Health Occupations, Evaluation Methods, Evaluators, Higher Education

Previous Page | Next Page »

Pages: 1 | 2 | 3

Western Journal of Speech…	2
American Psychologist	1
Assessment & Evaluation in…	1
Assessment and Evaluation in…	1
Composition Studies	1
ETS Research Report Series	1
Educational Measurement:…	1
Educational Review	1
Educational and Psychological…	1
Higher Education	1
Higher Education Research and…	1
History of Education Quarterly	1
International Journal of…	1
Journal of Economic Education	1
Journal of Educational…	1
Journal of Physical…	1
Journal of Psychology	1
Journal of Teaching in…	1
Measurement and Evaluation in…	1
NCME Measurement in Education	1
Perceptual and Motor Skills	1
Routledge, Taylor & Francis…	1
More ▼

Ackerman, Michael	1
Avila, Dolores R.	1
Babb, Jacob	1
Banville, Dominique	1
Bardo, John W.	1
Beller, Michal	1
Bennett, Randy Elliot	1
Branthwaite, Alan	1
Brennan, David J.	1
Corradi, David	1
Craig, Robert	1
Denison, D. Brian, Ed.	1
Dennis, Ian	1
Desrosiers, Pauline	1
Dorociak, Jeff J.	1
Fruen, Mary	1
Galambos, Eva C.	1
Genet-Volet, Yvette	1
Gliessman, David	1
Hisama, Kay K.	1
Huang, Tsai-Wei	1
Ironson, Gail H.	1
Johnson, Sylvia T.	1
Kaplan, Robert M.	1
More ▼