ERIC - Search Results

Publication Date

In 2025	2
Since 2024	15
Since 2021 (last 5 years)	68
Since 2016 (last 10 years)	171
Since 2006 (last 20 years)	439

Descriptor

Generalizability Theory	728
Reliability	168
Scores	146
Error of Measurement	133
Test Reliability	125
Interrater Reliability	120
Foreign Countries	102
Statistical Analysis	85
Evaluation Methods	82
Psychometrics	75
Research Methodology	67
Validity	66
Test Validity	64
Models	62
Comparative Analysis	59
Correlation	59
Higher Education	59
Scoring	59
Item Response Theory	57
Performance Based Assessment	57
Research Design	57
Test Items	54
Test Construction	49
Elementary School Students	48
Test Theory	47
More ▼

Education Level

Higher Education	115
Postsecondary Education	68
Elementary Education	59
Secondary Education	42
Middle Schools	33
Elementary Secondary Education	29
Early Childhood Education	24
Junior High Schools	22
Grade 8	17
Grade 3	15
Preschool Education	15
Grade 4	14
Grade 5	13
Primary Education	13
Grade 7	12
High Schools	12
Intermediate Grades	11
Adult Education	10
Grade 6	7
Kindergarten	7
Grade 10	6
Grade 9	6
Grade 1	4
Grade 2	4
Two Year Colleges	3
More ▼

Audience

Researchers	28
Practitioners	2
Policymakers	1
Students	1

Location

Turkey	14
Canada	10
United States	10
California	9
Netherlands	9
Australia	6
Germany	6
South Korea	6
Iowa	5
Norway	5
Turkey (Ankara)	5
United Kingdom	5
Florida	4
South Africa	4
Tennessee	4
China	3
Hong Kong	3
Indiana	3
Japan	3
North Carolina	3
Texas	3
Alabama	2
China (Beijing)	2
Colorado	2
Cyprus	2
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	2
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 376 to 390 of 728 results Save | Export

Quality Control of an OSCE Using Generalizability Theory and Many-Faceted Rasch Measurement

Peer reviewed

Direct link

Iramaneerat, Cherdsak; Yudkowsky, Rachel; Myford, Carol M.; Downing, Steven M. – Advances in Health Sciences Education, 2008

An Objective Structured Clinical Examination (OSCE) is an effective method for evaluating competencies. However, scores obtained from an OSCE are vulnerable to many potential measurement errors that cases, items, or standardized patients (SPs) can introduce. Monitoring these sources of errors is an important quality control mechanism to ensure…

Descriptors: Generalizability Theory, Rating Scales, Quality Control, Patients

Bootstrap Estimates of Standard Errors in Generalizability Theory

Peer reviewed

Direct link

Tong, Ye; Brennan, Robert L. – Educational and Psychological Measurement, 2007

Estimating standard errors of estimated variance components has long been a challenging task in generalizability theory. Researchers have speculated about the potential applicability of the bootstrap for obtaining such estimates, but they have identified problems (especially bias) in using the bootstrap. Using Brennan's bias-correcting procedures…

Descriptors: Error of Measurement, Generalizability Theory, Computation, Simulation

The Reliability of Workplace-Based Assessment in Postgraduate Medical Education and Training: A National Evaluation in General Practice in the United Kingdom

Peer reviewed

Direct link

Murphy, Douglas J.; Bruce, David A.; Mercer, Stewart W.; Eva, Kevin W. – Advances in Health Sciences Education, 2009

To investigate the reliability and feasibility of six potential workplace-based assessment methods in general practice training: criterion audit, multi-source feedback from clinical and non-clinical colleagues, patient feedback (the CARE Measure), referral letters, significant event analysis, and video analysis of consultations. Performance of GP…

Descriptors: Reliability, Graduate Medical Education, Family Practice (Medicine), Vocational Evaluation

A Generalizability Theory Approach to Standard Error Estimates for Bookmark Standard Settings

Peer reviewed

Direct link

Lee, Guemin; Lewis, Daniel M. – Educational and Psychological Measurement, 2008

The bookmark standard-setting procedure is an item response theory-based method that is widely implemented in state testing programs. This study estimates standard errors for cut scores resulting from bookmark standard settings under a generalizability theory model and investigates the effects of different universes of generalization and error…

Descriptors: Generalizability Theory, Testing Programs, Error of Measurement, Cutting Scores

Open-Book Tests to Complement Assessment-Programmes: Analysis of Open and Closed-Book Tests

Peer reviewed

Direct link

Heijne-penninga, M.; Kuks, J. B. M.; Schonrock-adema, J.; Snijders, T. A. B.; Cohen-schotanus, J. – Advances in Health Sciences Education, 2008

Today's health sciences educational programmes have to deal with a growing and changing amount of knowledge. It is becoming increasingly important for students to be able to use and manage knowledge. We suggest incorporating open-book tests in assessment programmes to meet these changes. This view on the use of open-book tests is discussed and the…

Descriptors: Medical Schools, College Students, Information Management, Test Reliability

Generalizability of Cognitive Interview-Based Measures across Cultural Groups

Peer reviewed

Direct link

Solano-Flores, Guillermo; Li, Min – Educational Measurement: Issues and Practice, 2009

We addressed the challenge of scoring cognitive interviews in research involving multiple cultural groups. We interviewed 123 fourth- and fifth-grade students from three cultural groups to probe how they related a mathematics item to their personal lives. Item meaningfulness--the tendency of students to relate the content and/or context of an item…

Descriptors: Generalizability Theory, Scoring, Error of Measurement, Grade 5

The Effects of the Number of Scale Points and Non-Normality on the Generalizability Coefficient: A Monte Carlo Study

Peer reviewed

Direct link

Shumate, Steven R.; Surles, James; Johnson, Robert L.; Penny, Jim – Applied Measurement in Education, 2007

Increasingly, assessment practitioners use generalizability coefficients to estimate the reliability of scores from performance tasks. Little research, however, examines the relation between the estimation of generalizability coefficients and the number of rubric scale points and score distributions. The purpose of the present research is to…

Descriptors: Generalizability Theory, Monte Carlo Methods, Measures (Individuals), Program Effectiveness

Same-Form Retest Effects on Credentialing Examinations

Peer reviewed

Direct link

Raymond, Mark R.; Neustel, Sandra; Anderson, Dan – Educational Measurement: Issues and Practice, 2009

Examinees who take high-stakes assessments are usually given an opportunity to repeat the test if they are unsuccessful on their initial attempt. To prevent examinees from obtaining unfair score increases by memorizing the content of specific test items, testing agencies usually assign a different test form to repeat examinees. The use of multiple…

Descriptors: Test Results, Test Items, Testing, Aptitude Tests

Gender Effects in the Peer Reviews of Grant Proposals: A Comprehensive Meta-Analysis Comparing Traditional and Multilevel Approaches

Peer reviewed

Direct link

Marsh, Herbert W.; Bornmann, Lutz; Mutz, Rudiger; Daniel, Hans-Dieter; O'Mara, Alison – Review of Educational Research, 2009

Peer review is valued in higher education, but also widely criticized in terms of potential biases, particularly gender. We evaluate gender differences in peer reviews of grant applications, extending Bornmann, Mutz, and Daniel's meta-analyses that reported small gender differences in favor of men (d = 0.04), but a substantial heterogeneity in…

Descriptors: Effect Size, Gender Differences, Grants, Peer Evaluation

Examining the Dependability of Academic Achievement Measures for English Language Learners

Peer reviewed

Direct link

Solano-Flores, Guillermo; Li, Min – Assessment for Effective Intervention, 2008

The dependability of academic achievement measures for English language learners (ELLs) is influenced by three facts: (a) Each ELL has unique strengths and weaknesses in each language mode (listening, speaking, reading, and writing) both in English and in his or her first language, (b) each test item poses a different set of linguistic demands…

Descriptors: Generalizability Theory, Test Items, Dialects, Academic Achievement

Do Teachers, Principals, and Superintendents Perceive Leadership the Same Way? A Structural Equation Modeling Test of the Equivalence of a Multi-Dimensional Construct across Groups

Peer reviewed

Direct link

Rodriguez-Campos, Liliana; Rincones-Gomez, Rigoberto; Shen, Jianping – Frontiers of Education in China, 2008

Structural Equation Modeling (SEM) was used in this study to determine the extent to which teachers, principals, and superintendents perceive the leadership construct in the same way. The researchers found that the two-factor model fits the principal group and particularly the superintendent group better than does the four-factor model. The…

Descriptors: Structural Equation Models, Superintendents, Principals, Teacher Attitudes

An Application of Generalizability Theory on Writing Assessment: Effects of Marking Components Weighting

Direct link

Lam, Ling Chi Tenny – ProQuest LLC, 2010

In writing assessment, there are quite a number of factors influencing the marking stability and the reliability of the assessment such as the attitude towards marking and consistency of markers, the physical environment, the design of the items, and marking rubrics. Even the methods to train markers have effects on the reliability of the…

Descriptors: Foreign Countries, Grading, Scoring Rubrics, Educational Assessment

Using Generalizability Theory to Assess the Score Reliability of the Special Ability Selection Examinations for Music Education Programmes in Higher Education

Direct link

Atilgan, Hakan – International Journal of Research & Method in Education, 2008

The "Special Ability Selection Examination" (SASE), which is used to select appropriate students for the music education departments of educational faculties in Turkey, has many subsections and must evaluate highly competitive cohorts of students according to a broad range of criteria. The test consists of three subsections, with a large…

Descriptors: Generalizability Theory, Schools of Education, Music Education, Music

The Generalizability of Externalizing Behavior Composites and Subscale Scores across Time, Rater, and Instrument

Peer reviewed

Direct link

Bergeron, Renee; Floyd, Randy G.; McCormack, Allison C.; Farmer, William L. – School Psychology Review, 2008

The dependability of externalizing behavior composites and subscale scores from the Behavior Assessment System for Children, Second Edition, Teacher Rating Scale-Child (Reynolds & Kamphaus, 2004) and the Achenbach System of Empirically Based Assessment, Teacher's Report Form for Ages 6-18 (Achenbach & Rescorla, 2001) was investigated.…

Descriptors: Generalizability Theory, Scores, Rating Scales, Error of Measurement

Why Generalisability Is Not Generalisable

Peer reviewed

Direct link

Fendler, Lynn – Journal of Philosophy of Education, 2006

In the United States there is an increasing tendency to view the only educational research worthy of federal funding as that which is designed as an experiment using randomised controls. One of the foundational assumptions underlying this research design is that the results of such research are meant to be generalisable beyond any particular…

Descriptors: Generalizability Theory, Educational Research, Research Design, Research Projects

« Previous Page | Next Page »

Pages: 1 | ... | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | ... | 49

Educational and Psychological…	44
Advances in Health Sciences…	27
Journal of Educational…	24
Applied Measurement in…	23
ProQuest LLC	18
Language Testing	13
Grantee Submission	11
Society for Research on…	11
Psychometrika	10
School Psychology Review	10
Applied Psychological…	9
Educational Measurement:…	9
Online Submission	8
International Journal of…	7
Journal of Educational…	7
Journal of Educational…	7
Multivariate Behavioral…	7
Behavioral Research and…	6
Educational Researcher	6
Educational Sciences: Theory…	6
Journal of Psychoeducational…	6
Measurement and Evaluation in…	6
Review of Educational Research	6
School Psychology Quarterly	6
Assessment for Effective…	5
More ▼

Brennan, Robert L.	18
Lee, Guemin	13
Briesch, Amy M.	11
Clauser, Brian E.	9
Chafouleas, Sandra M.	8
Riley-Tillman, T. Chris	8
Solano-Flores, Guillermo	8
Volpe, Robert J.	8
Christ, Theodore J.	7
Lee, Yong-Won	7
Marcoulides, George A.	7
Shavelson, Richard J.	7
Tindal, Gerald	7
Alonzo, Julie	6
Anderson, Daniel	6
Hagtvet, Knut A.	5
Harik, Polina	5
Miller, M. David	5
Raymond, Mark R.	5
Atilgan, Hakan	4
Chang, Lei	4
Fitzpatrick, Anne R.	4
French, Brian F.	4
Guler, Nese	4
More ▼

Journal Articles	534
Reports - Research	426
Reports - Evaluative	180
Speeches/Meeting Papers	115
Reports - Descriptive	57
Opinion Papers	27
Information Analyses	25
Dissertations/Theses -…	19
Numerical/Quantitative Data	19
Tests/Questionnaires	11
Guides - Non-Classroom	6
Books	5
Collected Works - General	3
Book/Product Reviews	2
Reference Materials -…	2
Reports - General	2
Collected Works - Serials	1
Dissertations/Theses -…	1
Guides - Classroom - Learner	1
Guides - General	1
Non-Print Media	1
Reference Materials - General	1
More ▼

Test of English as a Foreign…	6
Program for International…	4
Teacher Performance…	4
Dynamic Indicators of Basic…	3
Trends in International…	3
ACT Assessment	2
Childrens Depression Inventory	2
National Assessment of…	2
National Survey of Student…	2
Progress in International…	2
SAT (College Admission Test)	2
Students Evaluation of…	2
Test of English for…	2
United States Medical…	2
Wechsler Adult Intelligence…	2
Advanced Placement…	1
Battelle Developmental…	1
Behavior Assessment System…	1
Big Five Inventory	1
Classroom Assessment Scoring…	1
Cognitive Abilities Test	1
Conners Teacher Rating Scale	1
Early Childhood Environment…	1
Eating Disorder Inventory	1
Eysenck Personality Inventory	1
More ▼