NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 706 to 720 of 728 results Save | Export
Micceri, Theodore – 1984
This paper investigates the reliability of the Florida Performance Measurement Systems' Summative Observation instrument. Developed for the Florida Beginning Teacher Evaluation Program, it provides behavioral ratings for teachers in a classroom setting. Data came from ratings of videotapes of nine teachers conducting actual lessons by nine teams…
Descriptors: Analysis of Variance, Classroom Observation Techniques, Elementary Secondary Education, Evaluation Methods
Smith, Teresa A. – 1997
The Third International Mathematics and Science Study (TIMSS) measured mathematics and science achievement of middle school students in more than 40 countries. About one quarter of the tests' nearly 300 items were free response items requiring students to generate their own answers. Scoring these responses used a two-digit diagnostic code rubric…
Descriptors: Comparative Education, English, Error of Measurement, Foreign Countries
Wolfe, Edward W. – 1996
Although portfolio assessment is becoming increasingly popular, it may not survive unless portfolio scoring can meet the demands of large-scale assessment standards. The results of studies of interrater reliability with large-scale portfolio assessments have been mixed. This paper reports the scoring results of a nationwide portfolio pilot in…
Descriptors: Decision Making, Generalizability Theory, Interrater Reliability, Language Arts
Gonzalez-Tamayo, Eulogio – 1987
The concepts of universe of admissible observation and universe of generalization from the generalizability theory were applied to calculate the intraclass correlation coefficient of a licensure test. The internal consistency coefficient of a dichotomously scored test is identical to the intraclass correlation coefficient of a two-facet design.…
Descriptors: Adults, Analysis of Variance, Content Validity, Criterion Referenced Tests
Webb, Norman L. – 1980
This project paper reports the interobserver agreements and reliabilities for the observation procedures used in the Descriptive Study of Phase IV of the Individually Guided Education Evaluation Project. Only data from four observers--at the two Developing Mathematical Processes Schools and the two Wisconsin Design for Reading Skills Development…
Descriptors: Classroom Observation Techniques, Elementary Education, Generalizability Theory, Grade 2
Peer reviewed Peer reviewed
van Weeren, J.; Theunissen, T. J. J. M. – Language Learning, 1987
A systematic and explicit approach to evaluation of pronunciation is proposed. Generalizability theory was applied in order to comprise all relevant factors in one psychomotor model. French and German pronunciation tests (in Appendix) were devised and evaluated. Common pronunciation problems for native Dutch speakers were incorporated. (Author/LMO)
Descriptors: Communicative Competence (Languages), Dutch, Error Analysis (Language), Error Patterns
Peer reviewed Peer reviewed
Crocker, Linda; And Others – Journal of Educational Measurement, 1988
Using generalizability theory as a framework, the problem of assessing the content validity of standardized achievement tests is considered. Four designs to assess test-item fit to a curriculum are described, and procedures for determining the optimal number of raters and schools in a content-validation decision-making study are considered. (TJH)
Descriptors: Achievement Tests, Content Validity, Decision Making, Elementary Education
PDF pending restoration PDF pending restoration
Tay, May Ping; And Others – 1994
This study examined the generalizability of the internal/external (I/E) frame of reference model of academic self-concept development. The "external" component of the model refers to comparing one's achievement with one's peers; in LISREL causal modeling, this external comparison is presented as positive paths. The "internal"…
Descriptors: Academic Achievement, Early Adolescents, Generalizability Theory, Grade 7
Sandler, Andrew B. – 1987
Statistical significance is misused in educational and psychological research when it is applied as a method to establish the reliability of research results. Other techniques have been developed which can be correctly utilized to establish the generalizability of findings. Methods that do provide such estimates are known as invariance or…
Descriptors: Analysis of Covariance, Analysis of Variance, Correlation, Discriminant Analysis
Njora, Hungi; Darmawan, I Gusti Ngurah; Keeves, John P. – International Education Journal, 2004
This article addresses an important problem that faces educators in assessing students' competence levels in learned tasks. Data from 165 students from Massachusetts and Minnesota in the United States are used to examine the validity of five assessment modes (multiple choice test, scenario, portfolio, self-assessment and supervisor rating) in…
Descriptors: Generalizability Theory, Human Services, Academic Achievement, Item Response Theory
Peer reviewed Peer reviewed
Linn, Robert L.; And Others – Educational Researcher, 1991
Increasing emphasis on assessment and concern about assessment techniques have stirred interest in alternative assessment forms, for which evidence is needed about consequences, transfer of performance on specific assessment tests, and assessment fairness. Criteria concerning consequences, fairness, transfer-generalizability, cognitive complexity,…
Descriptors: Achievement Tests, Cost Effectiveness, Educational Assessment, Educational Policy
Wise, Lauress – 1993
Industrial and organizational psychologists for the Department of Defense have been working for the past 10 years to develop high fidelity measures of job performance for use in validating job selection procedures and standards. Information on developing and scoring performance exercises in the Job Performance Measurement (JPM) Project is…
Descriptors: Educational Assessment, Educational Research, Evaluation Methods, Generalizability Theory
Kim, Yang Boon; Lee, Jong Sung – 1990
The empirical validity of generalizability theory was investigated by applying two three-facet designs to data obtained in 1988 from administration of the Scientific Thinking and Research Skill Test (STRST). The decision validity of the STRST was also examined. Subjects were 125 fifth-grade and 125 sixth-grade students who were administered the…
Descriptors: Analysis of Variance, Decision Making, Elementary School Students, Generalizability Theory
Lefebvre, Daniel J.; Suen, Hoi K. – 1990
An empirical investigation of methodological issues associated with evaluating treatment effect in single-subject research (SSR) designs is presented. This investigation: (1) conducted a generalizability (G) study to identify the sources of systematic and random measurement error (SRME); (2) used an analytic approach based on G theory to integrate…
Descriptors: Classroom Observation Techniques, Disabilities, Educational Research, Error of Measurement
Teddlie, Charles; And Others – 1990
The results are provided of an initial analysis of the reliability (generalizability) of the System for Teaching and Learning Assessment and Review (STAR) as a comprehensive measure of classroom teaching and learning for making teacher certification decisions. The STAR contains 140 indicators of teacher effectiveness and student learning, which…
Descriptors: Beginning Teachers, Classroom Observation Techniques, Elementary School Teachers, Elementary Secondary Education
Pages: 1  |  ...  |  39  |  40  |  41  |  42  |  43  |  44  |  45  |  46  |  47  |  48  |  49