ERIC - Search Results

Publication Date

In 2025	1
Since 2024	8
Since 2021 (last 5 years)	8
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	21

Descriptor

Comparative Testing	146
Test Construction	146
Test Reliability	47
Test Validity	44
Test Items	38
Higher Education	35
Test Format	35
Multiple Choice Tests	23
Computer Assisted Testing	18
Foreign Countries	18
Difficulty Level	15
Item Analysis	15
Achievement Tests	14
Adults	13
Item Response Theory	13
Scores	13
Aptitude Tests	12
Elementary Secondary Education	12
Factor Analysis	12
High School Students	12
College Students	11
Questionnaires	11
Reading Tests	11
Test Interpretation	11
Testing Problems	11
More ▼

Publication Type

Reports - Research	91
Journal Articles	55
Speeches/Meeting Papers	39
Reports - Evaluative	28
Tests/Questionnaires	12
Numerical/Quantitative Data	3
Information Analyses	2
Opinion Papers	2
Reports - General	2
Books	1
Dissertations/Theses -…	1
Dissertations/Theses -…	1
Reports - Descriptive	1
More ▼

Education Level

Higher Education	4
Postsecondary Education	4
Elementary Education	2
Elementary Secondary Education	2
Grade 4	2
Grade 5	2
High Schools	2
Middle Schools	2
Grade 2	1
Grade 6	1
Intermediate Grades	1
Kindergarten	1
More ▼

Audience

Researchers	11
Practitioners	3
Teachers	2
Counselors	1

Location

United States	6
Canada	3
United Kingdom	3
Australia	2
Germany	2
Indonesia	2
Israel	2
Poland	2
Alabama	1
Austria	1
California	1
China	1
Finland	1
Idaho	1
India	1
Ireland	1
Japan	1
Michigan (Detroit)	1
New Zealand	1
Pennsylvania	1
Peru	1
Singapore	1
South Korea	1
Switzerland	1
United Kingdom (England)	1
More ▼

Laws, Policies, & Programs

What Works Clearinghouse Rating

Test Construction X

Showing 61 to 75 of 146 results Save | Export

The Effect of Altering the Position of Options in a Multiple-Choice Examination.

Download full text

Cizek, Gregory J. – 1991

A commonly accepted rule for developing equated examinations using the common-items non-equivalent groups (CINEG) design is that items common to the two examinations being equated should be identical. The CINEG design calls for two groups of examinees to respond to a set of common items that is included in two examinations. In practice, this rule…

Descriptors: Certification, Comparative Testing, Difficulty Level, Higher Education

Multiple Choice Converted to True-False: Comparative Reliabilities and Validities.

Download full text

Green, Kathy – 1978

Forty three-option multiple choice (MC) statements on a midterm examination were converted to 120 true-false (TF) statements, identical in content. Test forms (MC and TF) were randomly administered to 50 undergraduates, to investigate the validity and internal consistency reliability of the two forms. A Kuder-Richardson formula 20 reliability was…

Descriptors: Achievement Tests, Comparative Testing, Higher Education, Multiple Choice Tests

The Use of Ratio Production Scales to Assess Quality of Teaching Performance.

PDF pending restoration

Feitler, Fred C.; Graf, Stephen A. – 1978

Two forms of a teacher rating questionnaire, Student Reaction to Instruction, were administered to college students. The regular format used category scaling; the 631 responding students selected a number between one and five. Experimental "ratio production (multiply-divide)" evaluations were also completed by 26 subjects along with the…

Descriptors: College Faculty, Comparative Testing, Higher Education, Rating Scales

Scaling Behavioral Anchors.

Peer reviewed

Barnes, Janet L.; Landy, Frank J. – Applied Psychological Measurement, 1979

Although behaviorally anchored rating scales have both intuitive and empirical appeal, they have not always yielded superior results in contrast with graphic rating scales. Results indicate that the choice of an anchoring procedure will depend on the nature of the actual rating process. (Author/JKS)

Descriptors: Behavior Rating Scales, Comparative Testing, Higher Education, Rating Scales

The Effect of Negation and Polar Opposite Item Reversals on Questionnaire Reliability and Validity: An Experimental Investigation.

Peer reviewed

Schriesheim, Chester A.; And Others – Educational and Psychological Measurement, 1991

Effects of item wording on questionnaire reliability and validity were studied, using 280 undergraduate business students who completed a questionnaire comprising 4 item types: (1) regular; (2) polar opposite; (3) negated polar opposite; and (4) negated regular. Implications of results favoring regular and negated regular items are discussed. (SLD)

Descriptors: Business Education, Comparative Testing, Higher Education, Negative Forms (Language)

Building Algebra Testlets: A Comparison of Hierarchical and Linear Structures.

Peer reviewed

Wainer, Howard; And Others – Journal of Educational Measurement, 1991

Hierarchical (adaptive) and linear methods of testlet construction were compared. The performance of 2,080 ninth and tenth graders on a 4-item testlet was used to predict performance on the entire test. The adaptive test was slightly superior as a predictor, but the cost of obtaining that superiority was considerable. (SLD)

Descriptors: Adaptive Testing, Algebra, Comparative Testing, High School Students

The Effects of Multiple-Step and Single-Step Directions on Fourth and Fifth Grade Students' Grammar Assessment Performance

Download full text

Mazerik, Matthew B. – Online Submission, 2006

The mean scores of English Language Learners (ELL) and English Only (EO) students in 4th and 5th grade (N = 110), across the teacher-administered Grammar Skills Test, were examined for differences in participants' scores on assessments containing single-step directions and assessments containing multiple-step directions. The results indicated no…

Descriptors: Second Language Learning, Grade 5, Language Proficiency, Educational Testing

Assessing the Effects of Computer Administration on Scores and Parameter Estimates Using IRT Models.

Download full text

Sykes, Robert C.; And Others – 1991

To investigate the psychometric feasibility of replacing a paper-and-pencil licensing examination with a computer-administered test, a validity study was conducted. The computer-administered test (Cadm) was a common set of items for all test takers, distinct from computerized adaptive testing, in which test takers receive items appropriate to…

Descriptors: Adults, Certification, Comparative Testing, Computer Assisted Testing

Validity Study of the U.T. Austin Test for Credit in Chinese: Spring and Fall 1985 and Spring 1986. RB-86-5.

Appenzellar, Anne B.; Kelley, H. Paul – 1990

A test to be used for placement of students at the University of Texas (UT), Austin, in lower-level courses in Chinese and to award credit-by-examination in some courses was developed, and its validity was tested. Faculty members from the Department of Oriental and African Studies and the Measurement and Evaluation Center of the UT constructed a…

Descriptors: Advanced Placement, Chinese, College Students, Comparative Testing

Artistic Judgment Project I: Internal-Structure Analyses. Technical Report 1989-2.

Bezruczko, Nikolaus; Schroeder, David H. – 1989

An experimental test battery consisting of several tests that measure aspects of artistic judgment was administered to over 1,600 clients of the Johnson O'Connor Research Foundation. The battery consisted of the Visual Aesthetic Sensitivity Test (VAST) of K. O. Gotz (1981); the Design Judgment Test (DJT) of M. Graves (1948); and two tests…

Descriptors: Adults, Aesthetic Values, Aptitude Tests, Art Appreciation

Questionnaire Response Scales: Design Factors That Influence Respondent Satisfaction.

Download full text

Heller, Eric S.; Rife, Frank N. – 1987

The goal of this study was to assess the relative merit of various ranges and types of response scales in terms of respondent satisfaction and comfort and the nature of the elicited information in a population of seventh grade students. Three versions of an attitudinal questionnaire, each containing the same items but employing a different…

Descriptors: Attitude Measures, Comparative Testing, Grade 7, Junior High Schools

Head Start Evaluation and Research Center. Progress Report of Research Studies 1966 to 1967. Document 3, an Experimental Approach to Studying Non-Verbal Representation in Young Children.

Download full text

Franklin, Margery; Cobb, Judith – 1967

A current exploratory research project is directed toward developing means for gathering systematic data on nonverbal representation in young children. Tasks involving nonverbal representational functioning have been developed, evaluated in preliminary work with fifteen 4-year-old subjects, and revised. The revised series of tasks consists of four…

Descriptors: Cognitive Tests, Comparative Testing, Data Analysis, Disadvantaged

Reliability and Validity of the Speed of Thinking Test.

Peer reviewed

Carver, Ronald P. – Educational and Psychological Measurement, 1992

Reliability and validity of a new measure of cognitive speed, the Speed of Thinking Test (SST), were investigated with 129 college students, who also completed a vocabulary test, a test of reading speed, and a test of reading comprehension. The SST appears to be a reliable and valid measure. (SLD)

Descriptors: Cognitive Ability, Cognitive Tests, College Students, Comparative Testing

The Influence of Reading on Listening Test Scores.

Peer reviewed

Friedman, Stephen J.; Ansley, Timothy N. – Journal of Experimental Education, 1990

To investigate the relationship between reading and listening test scores, 3 different sets of listening items accompanied by answer sheets requiring varying amounts of reading were administered to 1,200 students in grades 3 through 8. Listening scores increased as more printed information was added to the answer sheet. (SLD)

Descriptors: Answer Sheets, Comparative Testing, Elementary Education, Elementary School Students

None of the Above.

Wiggins, Grant – Executive Educator, 1994

Instead of relying on standardized test scores and interdistrict comparisons, school systems must develop a more powerful, timely, and local approach to accountability that is truly client-centered and focused on results. Accountability requires giving successful teachers the freedom and opportunity to take effective ideas beyond their own…

Descriptors: Accountability, Comparative Testing, Elementary Secondary Education, Feedback

« Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10

Educational and Psychological…	6
Journal of Educational…	5
Applied Measurement in…	4
Psychological Reports	4
Applied Psychological…	3
Educational Measurement:…	2
Evaluation and the Health…	2
Int Rev Educ	2
Intelligence	2
Online Submission	2
Psychological Assessment	2
Studies in Educational…	2
Advances in Health Sciences…	1
American Annals of the Deaf	1
American Educational Research…	1
American Journal of Business…	1
Assessment in Education:…	1
California Journal of…	1
Educational Research Quarterly	1
Executive Educator	1
International Journal of…	1
International Journal of…	1
J Appl Psychol	1
J Psychol	1
Journal of Biological…	1
More ▼

Trevisan, Michael S.	3
Breland, Hunter M.	2
Karma, Kai	2
Schroeder, David H.	2
Steele, D. Joyce	2
Wainer, Howard	2
A. E. Ades	1
Aberman, Hugh M.	1
Agus Santoso	1
Alderton, David L.	1
Allison, Howard K., II	1
Anderson, Paul S.	1
Anderson, Stephen A.	1
Andrada, Gilbert N.	1
Ang, Cheng	1
Annabel L. Davies	1
Ansley, Timothy N.	1
Appenzellar, Anne B.	1
Armstrong, Anne-Marie	1
Asch, Harvey	1
Babcock, Judith L.	1
Baker, Eva L.	1
Barnes, Janet L.	1
More ▼

Alabama High School…	2
Graduate Record Examinations	2
SAT (College Admission Test)	2
Test of Standard Written…	2
ACT Assessment	1
Adaptive Behavior Scale	1
Armed Services Vocational…	1
Bayley Scales of Infant…	1
College Level Examination…	1
Computer Anxiety Scale	1
Differential Aptitude Test	1
Iowa Tests of Basic Skills	1
Myers Briggs Type Indicator	1
Peabody Individual…	1
Program for International…	1
Raven Progressive Matrices	1
Student Descriptive…	1
Test of Economic Literacy	1
Test of English as a Foreign…	1
Vineland Adaptive Behavior…	1
Wechsler Adult Intelligence…	1
More ▼