NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing 61 to 75 of 146 results Save | Export
Cizek, Gregory J. – 1991
A commonly accepted rule for developing equated examinations using the common-items non-equivalent groups (CINEG) design is that items common to the two examinations being equated should be identical. The CINEG design calls for two groups of examinees to respond to a set of common items that is included in two examinations. In practice, this rule…
Descriptors: Certification, Comparative Testing, Difficulty Level, Higher Education
Green, Kathy – 1978
Forty three-option multiple choice (MC) statements on a midterm examination were converted to 120 true-false (TF) statements, identical in content. Test forms (MC and TF) were randomly administered to 50 undergraduates, to investigate the validity and internal consistency reliability of the two forms. A Kuder-Richardson formula 20 reliability was…
Descriptors: Achievement Tests, Comparative Testing, Higher Education, Multiple Choice Tests
PDF pending restoration PDF pending restoration
Feitler, Fred C.; Graf, Stephen A. – 1978
Two forms of a teacher rating questionnaire, Student Reaction to Instruction, were administered to college students. The regular format used category scaling; the 631 responding students selected a number between one and five. Experimental "ratio production (multiply-divide)" evaluations were also completed by 26 subjects along with the…
Descriptors: College Faculty, Comparative Testing, Higher Education, Rating Scales
Peer reviewed Peer reviewed
Barnes, Janet L.; Landy, Frank J. – Applied Psychological Measurement, 1979
Although behaviorally anchored rating scales have both intuitive and empirical appeal, they have not always yielded superior results in contrast with graphic rating scales. Results indicate that the choice of an anchoring procedure will depend on the nature of the actual rating process. (Author/JKS)
Descriptors: Behavior Rating Scales, Comparative Testing, Higher Education, Rating Scales
Peer reviewed Peer reviewed
Schriesheim, Chester A.; And Others – Educational and Psychological Measurement, 1991
Effects of item wording on questionnaire reliability and validity were studied, using 280 undergraduate business students who completed a questionnaire comprising 4 item types: (1) regular; (2) polar opposite; (3) negated polar opposite; and (4) negated regular. Implications of results favoring regular and negated regular items are discussed. (SLD)
Descriptors: Business Education, Comparative Testing, Higher Education, Negative Forms (Language)
Peer reviewed Peer reviewed
Wainer, Howard; And Others – Journal of Educational Measurement, 1991
Hierarchical (adaptive) and linear methods of testlet construction were compared. The performance of 2,080 ninth and tenth graders on a 4-item testlet was used to predict performance on the entire test. The adaptive test was slightly superior as a predictor, but the cost of obtaining that superiority was considerable. (SLD)
Descriptors: Adaptive Testing, Algebra, Comparative Testing, High School Students
Mazerik, Matthew B. – Online Submission, 2006
The mean scores of English Language Learners (ELL) and English Only (EO) students in 4th and 5th grade (N = 110), across the teacher-administered Grammar Skills Test, were examined for differences in participants' scores on assessments containing single-step directions and assessments containing multiple-step directions. The results indicated no…
Descriptors: Second Language Learning, Grade 5, Language Proficiency, Educational Testing
Sykes, Robert C.; And Others – 1991
To investigate the psychometric feasibility of replacing a paper-and-pencil licensing examination with a computer-administered test, a validity study was conducted. The computer-administered test (Cadm) was a common set of items for all test takers, distinct from computerized adaptive testing, in which test takers receive items appropriate to…
Descriptors: Adults, Certification, Comparative Testing, Computer Assisted Testing
Appenzellar, Anne B.; Kelley, H. Paul – 1990
A test to be used for placement of students at the University of Texas (UT), Austin, in lower-level courses in Chinese and to award credit-by-examination in some courses was developed, and its validity was tested. Faculty members from the Department of Oriental and African Studies and the Measurement and Evaluation Center of the UT constructed a…
Descriptors: Advanced Placement, Chinese, College Students, Comparative Testing
Bezruczko, Nikolaus; Schroeder, David H. – 1989
An experimental test battery consisting of several tests that measure aspects of artistic judgment was administered to over 1,600 clients of the Johnson O'Connor Research Foundation. The battery consisted of the Visual Aesthetic Sensitivity Test (VAST) of K. O. Gotz (1981); the Design Judgment Test (DJT) of M. Graves (1948); and two tests…
Descriptors: Adults, Aesthetic Values, Aptitude Tests, Art Appreciation
Heller, Eric S.; Rife, Frank N. – 1987
The goal of this study was to assess the relative merit of various ranges and types of response scales in terms of respondent satisfaction and comfort and the nature of the elicited information in a population of seventh grade students. Three versions of an attitudinal questionnaire, each containing the same items but employing a different…
Descriptors: Attitude Measures, Comparative Testing, Grade 7, Junior High Schools
Franklin, Margery; Cobb, Judith – 1967
A current exploratory research project is directed toward developing means for gathering systematic data on nonverbal representation in young children. Tasks involving nonverbal representational functioning have been developed, evaluated in preliminary work with fifteen 4-year-old subjects, and revised. The revised series of tasks consists of four…
Descriptors: Cognitive Tests, Comparative Testing, Data Analysis, Disadvantaged
Peer reviewed Peer reviewed
Carver, Ronald P. – Educational and Psychological Measurement, 1992
Reliability and validity of a new measure of cognitive speed, the Speed of Thinking Test (SST), were investigated with 129 college students, who also completed a vocabulary test, a test of reading speed, and a test of reading comprehension. The SST appears to be a reliable and valid measure. (SLD)
Descriptors: Cognitive Ability, Cognitive Tests, College Students, Comparative Testing
Peer reviewed Peer reviewed
Friedman, Stephen J.; Ansley, Timothy N. – Journal of Experimental Education, 1990
To investigate the relationship between reading and listening test scores, 3 different sets of listening items accompanied by answer sheets requiring varying amounts of reading were administered to 1,200 students in grades 3 through 8. Listening scores increased as more printed information was added to the answer sheet. (SLD)
Descriptors: Answer Sheets, Comparative Testing, Elementary Education, Elementary School Students
Wiggins, Grant – Executive Educator, 1994
Instead of relying on standardized test scores and interdistrict comparisons, school systems must develop a more powerful, timely, and local approach to accountability that is truly client-centered and focused on results. Accountability requires giving successful teachers the freedom and opportunity to take effective ideas beyond their own…
Descriptors: Accountability, Comparative Testing, Elementary Secondary Education, Feedback
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10