ERIC - Search Results

Publication Date

In 2025	2
Since 2024	15
Since 2021 (last 5 years)	68
Since 2016 (last 10 years)	171
Since 2006 (last 20 years)	439

Descriptor

Generalizability Theory	728
Reliability	168
Scores	146
Error of Measurement	133
Test Reliability	125
Interrater Reliability	120
Foreign Countries	102
Statistical Analysis	85
Evaluation Methods	82
Psychometrics	75
Research Methodology	67
Validity	66
Test Validity	64
Models	62
Comparative Analysis	59
Correlation	59
Higher Education	59
Scoring	59
Item Response Theory	57
Performance Based Assessment	57
Research Design	57
Test Items	54
Test Construction	49
Elementary School Students	48
Test Theory	47
More ▼

Education Level

Higher Education	115
Postsecondary Education	68
Elementary Education	59
Secondary Education	42
Middle Schools	33
Elementary Secondary Education	29
Early Childhood Education	24
Junior High Schools	22
Grade 8	17
Grade 3	15
Preschool Education	15
Grade 4	14
Grade 5	13
Primary Education	13
Grade 7	12
High Schools	12
Intermediate Grades	11
Adult Education	10
Grade 6	7
Kindergarten	7
Grade 10	6
Grade 9	6
Grade 1	4
Grade 2	4
Two Year Colleges	3
More ▼

Audience

Researchers	28
Practitioners	2
Policymakers	1
Students	1

Location

Turkey	14
Canada	10
United States	10
California	9
Netherlands	9
Australia	6
Germany	6
South Korea	6
Iowa	5
Norway	5
Turkey (Ankara)	5
United Kingdom	5
Florida	4
South Africa	4
Tennessee	4
China	3
Hong Kong	3
Indiana	3
Japan	3
North Carolina	3
Texas	3
Alabama	2
China (Beijing)	2
Colorado	2
Cyprus	2
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	2
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 706 to 720 of 728 results Save | Export

Establishing the Reliability of the Florida Performance Measurement System's Research Based Observation Instrument.

Download full text

Micceri, Theodore – 1984

This paper investigates the reliability of the Florida Performance Measurement Systems' Summative Observation instrument. Developed for the Florida Beginning Teacher Evaluation Program, it provides behavioral ratings for teachers in a classroom setting. Data came from ratings of videotapes of nine teachers conducting actual lessons by nine teams…

Descriptors: Analysis of Variance, Classroom Observation Techniques, Elementary Secondary Education, Evaluation Methods

The Generalizability of Scoring TIMSS Open-Ended Items.

Download full text

Smith, Teresa A. – 1997

The Third International Mathematics and Science Study (TIMSS) measured mathematics and science achievement of middle school students in more than 40 countries. About one quarter of the tests' nearly 300 items were free response items requiring students to generate their own answers. Scoring these responses used a two-digit diagnostic code rubric…

Descriptors: Comparative Education, English, Error of Measurement, Foreign Countries

A Report on the Reliability of a Large-Scale Portfolio Assessment for Language Arts, Mathematics, and Science.

Download full text

Wolfe, Edward W. – 1996

Although portfolio assessment is becoming increasingly popular, it may not survive unless portfolio scoring can meet the demands of large-scale assessment standards. The results of studies of interrater reliability with large-scale portfolio assessments have been mixed. This paper reports the scoring results of a nationwide portfolio pilot in…

Descriptors: Decision Making, Generalizability Theory, Interrater Reliability, Language Arts

Content Specifications of a Test and Generalizability Theory.

Gonzalez-Tamayo, Eulogio – 1987

The concepts of universe of admissible observation and universe of generalization from the generalizability theory were applied to calculate the intraclass correlation coefficient of a licensure test. The internal consistency coefficient of a dichotomously scored test is identical to the intraclass correlation coefficient of a two-facet design.…

Descriptors: Adults, Analysis of Variance, Content Validity, Criterion Referenced Tests

Interobserver Agreement for the Observation Procedures for the DMP and WDRSD Observers. Descriptive Study. Phase IV. Project Paper 79-25. Parts 1 and 2.

Download full text

Webb, Norman L. – 1980

This project paper reports the interobserver agreements and reliabilities for the observation procedures used in the Descriptive Study of Phase IV of the Individually Guided Education Evaluation Project. Only data from four observers--at the two Developing Mathematical Processes Schools and the two Wisconsin Design for Reading Skills Development…

Descriptors: Classroom Observation Techniques, Elementary Education, Generalizability Theory, Grade 2

Testing Pronunciation: An Application of Generalizability Theory.

Peer reviewed

van Weeren, J.; Theunissen, T. J. J. M. – Language Learning, 1987

A systematic and explicit approach to evaluation of pronunciation is proposed. Generalizability theory was applied in order to comprise all relevant factors in one psychomotor model. French and German pronunciation tests (in Appendix) were devised and evaluated. Common pronunciation problems for native Dutch speakers were incorporated. (Author/LMO)

Descriptors: Communicative Competence (Languages), Dutch, Error Analysis (Language), Error Patterns

The Generalizability of Content Validity Ratings.

Peer reviewed

Crocker, Linda; And Others – Journal of Educational Measurement, 1988

Using generalizability theory as a framework, the problem of assessing the content validity of standardized achievement tests is considered. Four designs to assess test-item fit to a curriculum are described, and procedures for determining the optimal number of raters and schools in a content-validation decision-making study are considered. (TJH)

Descriptors: Achievement Tests, Content Validity, Decision Making, Elementary Education

The Internal/External Frame of Reference Model of Academic Self-Concept in Early Adolescents.

PDF pending restoration

Tay, May Ping; And Others – 1994

This study examined the generalizability of the internal/external (I/E) frame of reference model of academic self-concept development. The "external" component of the model refers to comparing one's achievement with one's peers; in LISREL causal modeling, this external comparison is presented as positive paths. The "internal"…

Descriptors: Academic Achievement, Early Adolescents, Generalizability Theory, Grade 7

The Use of Invariance and Bootstrap Procedures as a Method to Establish the Reliability of Research Results.

Sandler, Andrew B. – 1987

Statistical significance is misused in educational and psychological research when it is applied as a method to establish the reliability of research results. Other techniques have been developed which can be correctly utilized to establish the generalizability of findings. Methods that do provide such estimates are known as invariance or…

Descriptors: Analysis of Covariance, Analysis of Variance, Correlation, Discriminant Analysis

Examining the Validity of Different Assessment Modes in Measuring Competence in Performing Human Services

Peer reviewed
PDF on ERIC

Download full text

Direct link

Njora, Hungi; Darmawan, I Gusti Ngurah; Keeves, John P. – International Education Journal, 2004

This article addresses an important problem that faces educators in assessing students' competence levels in learned tasks. Data from 165 students from Massachusetts and Minnesota in the United States are used to examine the validity of five assessment modes (multiple choice test, scenario, portfolio, self-assessment and supervisor rating) in…

Descriptors: Generalizability Theory, Human Services, Academic Achievement, Item Response Theory

Complex, Performance-Based Assessment: Expectations and Validation Criteria.

Peer reviewed

Linn, Robert L.; And Others – Educational Researcher, 1991

Increasing emphasis on assessment and concern about assessment techniques have stirred interest in alternative assessment forms, for which evidence is needed about consequences, transfer of performance on specific assessment tests, and assessment fairness. Criteria concerning consequences, fairness, transfer-generalizability, cognitive complexity,…

Descriptors: Achievement Tests, Cost Effectiveness, Educational Assessment, Educational Policy

Scoring Rubrics for Performance Tests: Lessons Learned from Job Performance Assessment in the Military.

Download full text

Wise, Lauress – 1993

Industrial and organizational psychologists for the Department of Defense have been working for the past 10 years to develop high fidelity measures of job performance for use in validating job selection procedures and standards. Information on developing and scoring performance exercises in the Job Performance Measurement (JPM) Project is…

Descriptors: Educational Assessment, Educational Research, Evaluation Methods, Generalizability Theory

A Study Applying Generalizability Theory to the Scientific Thinking and Research Skill Test.

Download full text

Kim, Yang Boon; Lee, Jong Sung – 1990

The empirical validity of generalizability theory was investigated by applying two three-facet designs to data obtained in 1988 from administration of the Scientific Thinking and Research Skill Test (STRST). The decision validity of the STRST was also examined. Subjects were 125 fifth-grade and 125 sixth-grade students who were administered the…

Descriptors: Analysis of Variance, Decision Making, Elementary School Students, Generalizability Theory

Applying Generalizability Theory To Evaluate Treatment Effect in Single-Subject Research.

Download full text

Lefebvre, Daniel J.; Suen, Hoi K. – 1990

An empirical investigation of methodological issues associated with evaluating treatment effect in single-subject research (SSR) designs is presented. This investigation: (1) conducted a generalizability (G) study to identify the sources of systematic and random measurement error (SRME); (2) used an analytic approach based on G theory to integrate…

Descriptors: Classroom Observation Techniques, Disabilities, Educational Research, Error of Measurement

A Study of the Generalizability of the System for Teaching and Learning Assessment and Review (STAR).

Download full text

Teddlie, Charles; And Others – 1990

The results are provided of an initial analysis of the reliability (generalizability) of the System for Teaching and Learning Assessment and Review (STAR) as a comprehensive measure of classroom teaching and learning for making teacher certification decisions. The STAR contains 140 indicators of teacher effectiveness and student learning, which…

Descriptors: Beginning Teachers, Classroom Observation Techniques, Elementary School Teachers, Elementary Secondary Education

« Previous Page | Next Page »

Pages: 1 | ... | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 49

Educational and Psychological…	44
Advances in Health Sciences…	27
Journal of Educational…	24
Applied Measurement in…	23
ProQuest LLC	18
Language Testing	13
Grantee Submission	11
Society for Research on…	11
Psychometrika	10
School Psychology Review	10
Applied Psychological…	9
Educational Measurement:…	9
Online Submission	8
International Journal of…	7
Journal of Educational…	7
Journal of Educational…	7
Multivariate Behavioral…	7
Behavioral Research and…	6
Educational Researcher	6
Educational Sciences: Theory…	6
Journal of Psychoeducational…	6
Measurement and Evaluation in…	6
Review of Educational Research	6
School Psychology Quarterly	6
Assessment for Effective…	5
More ▼

Brennan, Robert L.	18
Lee, Guemin	13
Briesch, Amy M.	11
Clauser, Brian E.	9
Chafouleas, Sandra M.	8
Riley-Tillman, T. Chris	8
Solano-Flores, Guillermo	8
Volpe, Robert J.	8
Christ, Theodore J.	7
Lee, Yong-Won	7
Marcoulides, George A.	7
Shavelson, Richard J.	7
Tindal, Gerald	7
Alonzo, Julie	6
Anderson, Daniel	6
Hagtvet, Knut A.	5
Harik, Polina	5
Miller, M. David	5
Raymond, Mark R.	5
Atilgan, Hakan	4
Chang, Lei	4
Fitzpatrick, Anne R.	4
French, Brian F.	4
Guler, Nese	4
More ▼

Journal Articles	534
Reports - Research	426
Reports - Evaluative	180
Speeches/Meeting Papers	115
Reports - Descriptive	57
Opinion Papers	27
Information Analyses	25
Dissertations/Theses -…	19
Numerical/Quantitative Data	19
Tests/Questionnaires	11
Guides - Non-Classroom	6
Books	5
Collected Works - General	3
Book/Product Reviews	2
Reference Materials -…	2
Reports - General	2
Collected Works - Serials	1
Dissertations/Theses -…	1
Guides - Classroom - Learner	1
Guides - General	1
Non-Print Media	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	6
Program for International…	4
Teacher Performance…	4
Dynamic Indicators of Basic…	3
Trends in International…	3
ACT Assessment	2
Childrens Depression Inventory	2
National Assessment of…	2
National Survey of Student…	2
Progress in International…	2
SAT (College Admission Test)	2
Students Evaluation of…	2
Test of English for…	2
United States Medical…	2
Wechsler Adult Intelligence…	2
Advanced Placement…	1
Battelle Developmental…	1
Behavior Assessment System…	1
Big Five Inventory	1
Classroom Assessment Scoring…	1
Cognitive Abilities Test	1
Conners Teacher Rating Scale	1
Early Childhood Environment…	1
Eating Disorder Inventory	1
Eysenck Personality Inventory	1
More ▼