NotesFAQContact Us
Collection
Advanced
Search Tips
What Works Clearinghouse Rating
Showing 376 to 390 of 728 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Iramaneerat, Cherdsak; Yudkowsky, Rachel; Myford, Carol M.; Downing, Steven M. – Advances in Health Sciences Education, 2008
An Objective Structured Clinical Examination (OSCE) is an effective method for evaluating competencies. However, scores obtained from an OSCE are vulnerable to many potential measurement errors that cases, items, or standardized patients (SPs) can introduce. Monitoring these sources of errors is an important quality control mechanism to ensure…
Descriptors: Generalizability Theory, Rating Scales, Quality Control, Patients
Peer reviewed Peer reviewed
Direct linkDirect link
Tong, Ye; Brennan, Robert L. – Educational and Psychological Measurement, 2007
Estimating standard errors of estimated variance components has long been a challenging task in generalizability theory. Researchers have speculated about the potential applicability of the bootstrap for obtaining such estimates, but they have identified problems (especially bias) in using the bootstrap. Using Brennan's bias-correcting procedures…
Descriptors: Error of Measurement, Generalizability Theory, Computation, Simulation
Peer reviewed Peer reviewed
Direct linkDirect link
Murphy, Douglas J.; Bruce, David A.; Mercer, Stewart W.; Eva, Kevin W. – Advances in Health Sciences Education, 2009
To investigate the reliability and feasibility of six potential workplace-based assessment methods in general practice training: criterion audit, multi-source feedback from clinical and non-clinical colleagues, patient feedback (the CARE Measure), referral letters, significant event analysis, and video analysis of consultations. Performance of GP…
Descriptors: Reliability, Graduate Medical Education, Family Practice (Medicine), Vocational Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Lee, Guemin; Lewis, Daniel M. – Educational and Psychological Measurement, 2008
The bookmark standard-setting procedure is an item response theory-based method that is widely implemented in state testing programs. This study estimates standard errors for cut scores resulting from bookmark standard settings under a generalizability theory model and investigates the effects of different universes of generalization and error…
Descriptors: Generalizability Theory, Testing Programs, Error of Measurement, Cutting Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Heijne-penninga, M.; Kuks, J. B. M.; Schonrock-adema, J.; Snijders, T. A. B.; Cohen-schotanus, J. – Advances in Health Sciences Education, 2008
Today's health sciences educational programmes have to deal with a growing and changing amount of knowledge. It is becoming increasingly important for students to be able to use and manage knowledge. We suggest incorporating open-book tests in assessment programmes to meet these changes. This view on the use of open-book tests is discussed and the…
Descriptors: Medical Schools, College Students, Information Management, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Solano-Flores, Guillermo; Li, Min – Educational Measurement: Issues and Practice, 2009
We addressed the challenge of scoring cognitive interviews in research involving multiple cultural groups. We interviewed 123 fourth- and fifth-grade students from three cultural groups to probe how they related a mathematics item to their personal lives. Item meaningfulness--the tendency of students to relate the content and/or context of an item…
Descriptors: Generalizability Theory, Scoring, Error of Measurement, Grade 5
Peer reviewed Peer reviewed
Direct linkDirect link
Shumate, Steven R.; Surles, James; Johnson, Robert L.; Penny, Jim – Applied Measurement in Education, 2007
Increasingly, assessment practitioners use generalizability coefficients to estimate the reliability of scores from performance tasks. Little research, however, examines the relation between the estimation of generalizability coefficients and the number of rubric scale points and score distributions. The purpose of the present research is to…
Descriptors: Generalizability Theory, Monte Carlo Methods, Measures (Individuals), Program Effectiveness
Peer reviewed Peer reviewed
Direct linkDirect link
Raymond, Mark R.; Neustel, Sandra; Anderson, Dan – Educational Measurement: Issues and Practice, 2009
Examinees who take high-stakes assessments are usually given an opportunity to repeat the test if they are unsuccessful on their initial attempt. To prevent examinees from obtaining unfair score increases by memorizing the content of specific test items, testing agencies usually assign a different test form to repeat examinees. The use of multiple…
Descriptors: Test Results, Test Items, Testing, Aptitude Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Marsh, Herbert W.; Bornmann, Lutz; Mutz, Rudiger; Daniel, Hans-Dieter; O'Mara, Alison – Review of Educational Research, 2009
Peer review is valued in higher education, but also widely criticized in terms of potential biases, particularly gender. We evaluate gender differences in peer reviews of grant applications, extending Bornmann, Mutz, and Daniel's meta-analyses that reported small gender differences in favor of men (d = 0.04), but a substantial heterogeneity in…
Descriptors: Effect Size, Gender Differences, Grants, Peer Evaluation
Peer reviewed Peer reviewed
Direct linkDirect link
Solano-Flores, Guillermo; Li, Min – Assessment for Effective Intervention, 2008
The dependability of academic achievement measures for English language learners (ELLs) is influenced by three facts: (a) Each ELL has unique strengths and weaknesses in each language mode (listening, speaking, reading, and writing) both in English and in his or her first language, (b) each test item poses a different set of linguistic demands…
Descriptors: Generalizability Theory, Test Items, Dialects, Academic Achievement
Peer reviewed Peer reviewed
Direct linkDirect link
Rodriguez-Campos, Liliana; Rincones-Gomez, Rigoberto; Shen, Jianping – Frontiers of Education in China, 2008
Structural Equation Modeling (SEM) was used in this study to determine the extent to which teachers, principals, and superintendents perceive the leadership construct in the same way. The researchers found that the two-factor model fits the principal group and particularly the superintendent group better than does the four-factor model. The…
Descriptors: Structural Equation Models, Superintendents, Principals, Teacher Attitudes
Lam, Ling Chi Tenny – ProQuest LLC, 2010
In writing assessment, there are quite a number of factors influencing the marking stability and the reliability of the assessment such as the attitude towards marking and consistency of markers, the physical environment, the design of the items, and marking rubrics. Even the methods to train markers have effects on the reliability of the…
Descriptors: Foreign Countries, Grading, Scoring Rubrics, Educational Assessment
Atilgan, Hakan – International Journal of Research & Method in Education, 2008
The "Special Ability Selection Examination" (SASE), which is used to select appropriate students for the music education departments of educational faculties in Turkey, has many subsections and must evaluate highly competitive cohorts of students according to a broad range of criteria. The test consists of three subsections, with a large…
Descriptors: Generalizability Theory, Schools of Education, Music Education, Music
Peer reviewed Peer reviewed
Direct linkDirect link
Bergeron, Renee; Floyd, Randy G.; McCormack, Allison C.; Farmer, William L. – School Psychology Review, 2008
The dependability of externalizing behavior composites and subscale scores from the Behavior Assessment System for Children, Second Edition, Teacher Rating Scale-Child (Reynolds & Kamphaus, 2004) and the Achenbach System of Empirically Based Assessment, Teacher's Report Form for Ages 6-18 (Achenbach & Rescorla, 2001) was investigated.…
Descriptors: Generalizability Theory, Scores, Rating Scales, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Fendler, Lynn – Journal of Philosophy of Education, 2006
In the United States there is an increasing tendency to view the only educational research worthy of federal funding as that which is designed as an experiment using randomised controls. One of the foundational assumptions underlying this research design is that the results of such research are meant to be generalisable beyond any particular…
Descriptors: Generalizability Theory, Educational Research, Research Design, Research Projects
Pages: 1  |  ...  |  22  |  23  |  24  |  25  |  26  |  27  |  28  |  29  |  30  |  ...  |  49