Descriptor
Source
Public Libraries | 1 |
Author
Capie, William | 4 |
Cason, Carolyn L. | 4 |
Andrich, David | 2 |
Bliss, Leonard B. | 2 |
Busch, John Christian | 2 |
Cason, Gerald J. | 2 |
Cronin, Linda | 2 |
Haladyna, Thomas M. | 2 |
Hoover, H. D. | 2 |
Huynh, Huynh | 2 |
Jaeger, Richard M. | 2 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 218 |
Practitioners | 11 |
Counselors | 4 |
Teachers | 2 |
Administrators | 1 |
Media Staff | 1 |
Policymakers | 1 |
Location
West Germany | 3 |
Australia | 2 |
Canada | 2 |
Nigeria | 2 |
Austria | 1 |
Connecticut | 1 |
Florida | 1 |
Georgia | 1 |
India | 1 |
Israel | 1 |
Jordan | 1 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Halpin, Gerald; And Others – 1986
Based upon the assumption that the process of peer review of publications and research is flawed, interrater reliability of reviews of 188 research proposals submitted for funding at a major university was studied. The eight dimensions rated were: (1) significance of the research; (2) clarity and reasonableness of the objectives; (3)…
Descriptors: College Faculty, Evaluation Criteria, Evaluators, Grants
Halpin, Glennelle; And Others – 1986
This study was designed as a reconsideration of the weights used in evaluative decisions made with regard to research proposals submitted for funding at a major state university. The specific objective of the study was to determine whether the actual weights for components used in the evaluation of the proposals differed from a priori weights…
Descriptors: College Faculty, Decision Making, Evaluation Methods, Grants
Rose, Janet S.; Huynh, Huynh – 1984
As part of a new teacher evaluation program initiated by the local school board, the Charleston County School District (South Carolina) adopted the Assessments of Performance in Teaching (APT) as a major evaluation tool to assess the teaching performance of annual contract teachers. Since evaluation procedures can ultimately lead to teacher…
Descriptors: Classroom Observation Techniques, Elementary Secondary Education, Evaluation Methods, Interrater Reliability
Christine, Charles T.; And Others – 1982
Thirty-two children aged 7 to 12 participated in a study to determine the reliability of the Ekwall Reading Inventory (ERI) and the Classroom Reading Inventory (CRI). The children were randomly assigned to take one of the two inventories, which were administered by four different specially trained teachers. The study used a test-retest design, in…
Descriptors: Comparative Analysis, Elementary Secondary Education, Informal Reading Inventories, Interrater Reliability
Lehmann, Rainer H. – 1987
A total of 1,487 eleventh grade students from the Hamburg (West Germany) school system were asked to complete four writing assignments used in an International Association for the Evaluation of Educational Achievement (IEA) study of writing assessment. In analyzing the writing samples, the study focused on: (1) between-rater effects; (2)…
Descriptors: Evaluation Problems, Foreign Countries, High Schools, International Programs
Guthrie, Abbie C. – 2000
Too many researchers speak of "the reliability of the test," thus indicating their basic misunderstanding of reliability. This paper explains classical reliability and the score features that influence coefficient alpha. It explains when coefficient alpha can be negative, even though it is conceptually a variance-accounted-for statistic.…
Descriptors: Effect Size, Measurement Techniques, Reliability, Scores
Taylor, Marcia B; Porterfield, William D. – 1984
This paper describes the Measure of Epistemological Reflection (MER), an instrument to assess cognitive developmental level according to the Perry scheme of intellectual and ethical development. It contains sets of questions for each of the six cognitive domains: decision making, learner role, instructor role in the learning process, peer role in…
Descriptors: Cognitive Development, Cognitive Tests, Epistemology, Higher Education
Ackerman, Terry A. – 1986
The purpose of this paper is to compare the precision of direct and indirect measures of writing assessment using the test information functions from a graded response Item Response Theory (IRT) model. Subjects were 192 sophomore English students from a parochial high school in Wisconsin. Both direct and indirect measures of writing ability were…
Descriptors: Correlation, Essay Tests, High Schools, Interrater Reliability
Santmire, Toni E. – 1984
The purpose of this paper is to discuss ways in which developmental psychology suffers from the lack of an appropriate technology of measurement and statistical analysis. The paper begins by noting that developmental psychology is the study of change; that individuals develop through a succession of "stages" which are separated by…
Descriptors: Data Analysis, Data Collection, Developmental Psychology, Developmental Stages
Cason, Carolyn L.; And Others – 1986
Cason and Cason's model of performance rating was used to determine the extent to which variation in reviewer standards affected the reliability and validity of the program review process used to select papers for inclusion in the annual program. Data analyzed were the overall recommendation for acceptance and ratings on seven quality criteria…
Descriptors: Conference Papers, Data Analysis, Educational Research, Evaluation Criteria
Webber, Larry; And Others – 1986
Generalizability theory, which subsumes classical measurement theory as a special case, provides a general model for estimating the reliability of observational rating data by estimating the variance components of the measurement design. Research data from the "Heart Smart" health intervention program were analyzed as a heuristic tool.…
Descriptors: Behavior Rating Scales, Cardiovascular System, Error of Measurement, Generalizability Theory
Mitchell, Karen J.; Anderson, Judith A. – 1987
The Association of American Medical Colleges is conducting research to develop, implement, and evaluate a Medical College Admission Test (MCAT) essay testing program. Essay administration in the spring and fall of 1985 and 1986 suggested that additional research was needed on the development of topics which elicit similar skills and meet standard…
Descriptors: College Entrance Examinations, Essay Tests, Estimation (Mathematics), Generalizability Theory
Shale, Doug – 1986
This study is an attempt at a cohesive characterization of the concept of essay reliability. As such, it takes as a basic premise that previous and current practices in reporting reliability estimates for essay tests have certain shortcomings. The study provides an analysis of these shortcomings--partly to encourage a fuller understanding of the…
Descriptors: Analysis of Variance, Correlation, Error of Measurement, Essay Tests
Yap, Kueh Chin; Capie, William – 1985
The purpose of this study was to compare the relative magnitude of the variance components and generalizability coefficients derived from the Teacher Performance Assessment Instruments (TPAI) data using two different methods of data collection: (1) occasions when observers were in the classroom for simultaneous observation and (2) occasions when…
Descriptors: Analysis of Variance, Classroom Observation Techniques, Data Collection, Elementary Secondary Education
Interrater Reliability and Internal Consistency of Student and Staff Ratings of Medical Instruction.
Dielman, T. E.; Horvatich, Paula K. – 1985
The purposes of this study were to establish the interrater reliability, dimensionality, and internal consistency of an instruction evaluation instrument used at The University of Michigan Medical School. Using the nine-item rating scale, 1,758 student ratings and 88 staff ratings were gathered on 61 faculty. Interrater agreement ranged from .28…
Descriptors: Evaluation Methods, Graduate Medical Education, Higher Education, Interrater Reliability