Publication Date
In 2025 | 0 |
Since 2024 | 5 |
Since 2021 (last 5 years) | 7 |
Since 2016 (last 10 years) | 12 |
Since 2006 (last 20 years) | 54 |
Descriptor
Source
Author
Ingham, Roger J. | 6 |
Cordes, Anne K. | 5 |
Fink, Arlene | 5 |
Callahan, Carolyn M. | 4 |
Capie, William | 4 |
Cason, Carolyn L. | 4 |
Matson, Johnny L. | 4 |
Ottenbacher, Kenneth J. | 4 |
Tindal, Gerald | 4 |
Antonak, Richard F. | 3 |
Biggs, John B. | 3 |
More ▼ |
Publication Type
Education Level
Higher Education | 9 |
Early Childhood Education | 6 |
Elementary Education | 5 |
Secondary Education | 3 |
Elementary Secondary Education | 2 |
High Schools | 2 |
Middle Schools | 2 |
Primary Education | 2 |
Grade 1 | 1 |
Grade 2 | 1 |
Grade 3 | 1 |
More ▼ |
Audience
Researchers | 703 |
Practitioners | 117 |
Teachers | 26 |
Administrators | 14 |
Counselors | 12 |
Policymakers | 11 |
Students | 10 |
Media Staff | 2 |
Parents | 1 |
Location
Australia | 10 |
Canada | 4 |
Nigeria | 4 |
Japan | 3 |
United Kingdom | 3 |
West Germany | 3 |
China | 2 |
Greece | 2 |
India | 2 |
Israel | 2 |
Tennessee | 2 |
More ▼ |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating

Cordes, Anne K. – Journal of Speech, Language, and Hearing Research, 2000
In this study, 30 judges identified disfluency types they perceived in audiovisually recorded speech stimuli, first individually and then with a partner. Although intrapair and interpair agreement was higher in the partner than the individual condition, agreement for occurrences still averaged below 50 percent. Findings suggest caution in use of…
Descriptors: Adults, Evaluation Methods, Interrater Reliability, Speech Acts
Baume, David; Yorke, Mantz; Coffey, Martin – Assessment & Evaluation in Higher Education, 2004
In an attempt to gain a fuller understanding of the basis of grading, ten assessors each assessed two portfolios drawn from the course archive which had been produced by participants on a course in teaching in higher education. Assessors gave a grade or judgement on each of a portfolio's 75 portfolio elements, reasons for each judgement they made,…
Descriptors: Portfolio Assessment, Portfolios (Background Materials), Higher Education, Test Reliability
Dockett, Sue; Perry, Bob – Journal of Early Childhood Research, 2007
Much of the current rhetoric in areas of child and family research and in early childhood education emphasizes the importance of listening to children in research that has a direct impact on them. Despite this, there remain qualms in some research contexts and amongst some researchers about the reliability, validity and generalizability of…
Descriptors: Early Childhood Education, Foreign Countries, Ethics, Research Methodology
Cantor, Nancy K.; Hoover, H. D. – 1986
This paper isolates and examines separately three distinct sources of error in essay scores: lack of agreement between raters; inconsistencies in performance within mode of discourse, and inconsistencies in performance between modes of discourse. Essay prompts in the Iowa Tests of Basic Skills (ITBS) Writing Supplement were designed to assess…
Descriptors: Academic Achievement, Cues, Elementary Secondary Education, Error of Measurement
An Observational Study of the Lecture Delivery Style Characteristics of High and Low Rated Lectures.
Albanese, Mark A.; And Others – 1986
This study identifies distinguishing differences in lecture delivery styles of lecturers rated by students in a large multi-instructor course: the Introduction to Clinical Medicine Course (ICM). The 20 lowest- and highest-rated lecturers of the 1982 and 1983 ICM courses served as the target group. Non-student raters observing the 1984 lectures…
Descriptors: Analysis of Variance, Behavior Rating Scales, Higher Education, Interrater Reliability
Micceri, Theodore – 1984
This paper investigates the reliability of the Florida Performance Measurement Systems' Summative Observation instrument. Developed for the Florida Beginning Teacher Evaluation Program, it provides behavioral ratings for teachers in a classroom setting. Data came from ratings of videotapes of nine teachers conducting actual lessons by nine teams…
Descriptors: Analysis of Variance, Classroom Observation Techniques, Elementary Secondary Education, Evaluation Methods
Meier, Scott; Davis, Susan – 1983
For people-helping professionals, the concept of burnout describes the physical and emotional exhaustion they feel on the job. A cognitive-behavioral model defines burnout as a state in which individuals expect few rewards and considerable punishment from work, due to lack of valued reinforcement, controllable outcomes, or personal competence.…
Descriptors: Burnout, Cognitive Processes, Expectation, Human Services

Harper, Lawrence V.; Kraft, R. Harter – Developmental Psychology, 1986
Determines the test-retest reliability of dichotic listening procedures for assessing cerebral lateralization of receptive language in preschoolers.%x$OD)
Descriptors: Auditory Evaluation, Listening Comprehension Tests, Listening Skills, Preschool Children

Ellis, Gary; Witt, Peter A. – Journal of Leisure Research, 1984
The development, reliability, and validity of five scales designed to measure perceived freedom in leisure are explored in this article. Value of the scales for assessment and basic research on the state of mind view of leisure are discussed. (Author/DF)
Descriptors: Competence, Leisure Time, Motivation, Rating Scales

Quereshi, M. Y.; Ostrowski, Michael J. – Journal of Clinical Psychology, 1985
Administered three Wechsler adult intelligence scales to 72 undergraduates and tested the quality of means, variances, and covariances, utilizing subtest scale scores and IQs. Results indicated that the three scales were not parallel. Generally, the subtest scaled scores exhibited less similarity across the three scales than the IQ estimates.…
Descriptors: College Students, Comparative Analysis, Higher Education, Intelligence Tests

Anderson, Daniel R.; And Others – Child Development, 1985
Describes a new observational study of home television viewing by young children which involved placement of time-lapse video cameras in the homes of five-year-olds from middle-class families for a 10-day period. Families maintained TV viewing diaries, and control groups of families were employed to assess the impact of observational equipment in…
Descriptors: Diaries, Estimation (Mathematics), Parents, Questionnaires

Orwin, Robert G.; Cordray, David S. – Psychological Bulletin, 1985
Identifies three sources of reporting deficiency for meta-analytic results: quality (adequacy) of publicizing; quality of macrolevel reporting, and quality of microlevel reporting. Reanalysis of 25 reports from the Smith, Glass and Miller (1980) psychotherapy meta-analysis established two sources of misinformation, interrater reliabilities and…
Descriptors: Confidence Testing, Interrater Reliability, Meta Analysis, Psychotherapy

Teesson, Kathryn; Packman, Ann; Onslow, Mark – Journal of Speech, Language, and Hearing Research, 2003
This study examined intrajudge and interjudge agreement for the Lidcombe Behavioral Data Language (LBDL), a behaviorally based stuttering taxonomy. Ten experienced speech language pathologists and 10 undergraduates applied the LBDL to stuttered speech on two occasions. Intrajudge agreement was high for both groups, but only the experienced judges…
Descriptors: Adults, Classification, Reliability, Speech Evaluation

Kreiman, Jody; And Others – Journal of Speech and Hearing Research, 1992
Sixteen listeners (10 expert, 6 naive) judged the dissimilarity of pairs of voices drawn from pathological and normal populations. Only parameters that showed substantial variability were perceptually salient across listeners. Results suggest that traditional means of assessing listener reliability in voice perception tasks may not be appropriate.…
Descriptors: Evaluation Methods, Individual Differences, Interrater Reliability, Perception

Ingham, Roger J.; And Others – Journal of Speech and Hearing Research, 1993
Two experiments investigating interval-by-interval interjudge and intrajudge agreement for stuttered and nonstuttered speech intervals found that training of judges could improve reliability levels; judges with relatively high intrajudge agreement also showed relatively higher interjudge agreement; and interval-by-interval interjudge agreement was…
Descriptors: Evaluation Methods, Interrater Reliability, Performance Factors, Speech Evaluation