Publication Date
In 2025 | 2 |
Since 2024 | 15 |
Since 2021 (last 5 years) | 68 |
Since 2016 (last 10 years) | 171 |
Since 2006 (last 20 years) | 439 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
Researchers | 28 |
Practitioners | 2 |
Policymakers | 1 |
Students | 1 |
Location
Turkey | 14 |
Canada | 10 |
United States | 10 |
California | 9 |
Netherlands | 9 |
Australia | 6 |
Germany | 6 |
South Korea | 6 |
Iowa | 5 |
Norway | 5 |
Turkey (Ankara) | 5 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 2 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Volpe, Robert J.; Briesch, Amy M.; Gadow, Kenneth D. – Journal of School Psychology, 2011
Although the efficiency with which a wide range of behavioral data can be obtained makes behavior rating scales particularly attractive tools for the purposes of screening and evaluation, feasibility concerns arise in the context of formative assessment. Specifically, informant load, or the amount of time informants are asked to contribute to the…
Descriptors: Generalizability Theory, Formative Evaluation, Behavior Rating Scales, Measures (Individuals)
Praetorius, Anna-Katharina; Lenske, Gerlinde; Helmke, Andreas – Learning and Instruction, 2012
Despite considerable interest in the topic of instructional quality in research as well as practice, little is known about the quality of its assessment. Using generalizability analysis as well as content analysis, the present study investigates how reliably and validly instructional quality is measured by observer ratings. Twelve trained raters…
Descriptors: Student Teachers, Interrater Reliability, Content Analysis, Observation
Keller, Lisa A.; Clauser, Brian E.; Swanson, David B. – Advances in Health Sciences Education, 2010
In recent years, demand for performance assessments has continued to grow. However, performance assessments are notorious for lower reliability, and in particular, low reliability resulting from task specificity. Since reliability analyses typically treat the performance tasks as randomly sampled from an infinite universe of tasks, these estimates…
Descriptors: Generalizability Theory, Test Reliability, Performance Based Assessment, Error of Measurement
Alkahtani, Saif F. – ProQuest LLC, 2012
The principal aim of the present study was to better guide the Quranic recitation appraisal practice by presenting an application of Generalizability theory and Many-facet Rasch Measurement Model for assessing the dependability and fit of two suggested rubrics. Recitations of 93 students were rated holistically and analytically by 3 independent…
Descriptors: Generalizability Theory, Item Response Theory, Verbal Tests, Islam
Maier, Kimberly S.; Maiti, Tapabrata; Dass, Sarat C.; Lim, Chae Young – Society for Research on Educational Effectiveness, 2012
The purpose of this study is to develop an estimate of Adequate Yearly Progress (AYP) that will allow for reliable and valid comparisons among student subgroups, schools, and districts. A shrinkage-type estimator of AYP using the Bayesian framework is described. Using simulated data, the performance of the Bayes estimator will be compared to…
Descriptors: Educational Improvement, Federal Programs, Academic Achievement, Educational Indicators
Heilmann, John; DeBrock, Lindsay; Riley-Tillman, T. Chris – American Journal of Speech-Language Pathology, 2013
Purpose: The purpose of this study was to examine the reliability of, and sources of variability in, language measures from interviews collected from young school-age children. Method: Two 10-min interviews were collected from 20 at-risk kindergarten children by an examiner using a standardized set of questions. Test-retest reliability…
Descriptors: Measures (Individuals), Structured Interviews, Reliability, Kindergarten
Carman, Carol A. – Journal of Advanced Academics, 2013
The lack of a unified definition of giftedness leads researchers to use very different operationalizations when selecting a sample of gifted individuals for use in research. We found 104 empirical articles from 38 journals that differentiated between gifted and nongifted students which were analyzed to determine the most common methods of…
Descriptors: Gifted, Educational Research, Educational History, Bibliometrics
Orem, Chris D. – ProQuest LLC, 2012
Meta-assessment, or the assessment of assessment, can provide meaningful information about the trustworthiness of an academic program's assessment results (Bresciani, Gardner, & Hickmott, 2009; Palomba & Banta, 1999; Suskie, 2009). Many institutions conduct meta-assessments for their academic programs (Fulcher, Swain, & Orem, 2012),…
Descriptors: Validity, Evidence, Evaluation Methods, Meta Analysis
Mercer, Sterett H.; Dufrene, Brad A.; Zoder-Martell, Kimberly; Harpole, Lauren Lestremau; Mitchell, Rachel R.; Blaze, John T. – Assessment for Effective Intervention, 2012
Despite growing use of CBM Maze in universal screening and research, little information is available regarding the number of CBM Maze probes needed for reliable decisions. The current study extends existing research on the technical adequacy of CBM Maze by investigating the number of probes and assessment durations (1-3 min) needed for reliable…
Descriptors: Generalizability Theory, Curriculum Based Assessment, Reading Tests, Cloze Procedure
Thipwiwatpotjana, Phantipa – ProQuest LLC, 2010
Uncertainty occurs when there is more than one realization that can represent an information. This dissertation concerns merely discrete realizations of an uncertainty. Different interpretations of an uncertainty and their relationships are addressed when the uncertainty is not a probability of each realization. A well known model that can handle…
Descriptors: Intervals, Programming, Mathematical Applications, Probability
Zhou, Hong; Muellerleile, Paige; Ingram, Debra; Wong, Seok P. – Journal of Educational and Behavioral Statistics, 2011
Intraclass correlation coefficients (ICCs) are commonly used in behavioral measurement and psychometrics when a researcher is interested in the relationship among variables of a common class. The formulas for deriving ICCs, or generalizability coefficients, vary depending on which models are specified. This article gives the equations for…
Descriptors: Computation, Statistical Analysis, Generalizability Theory, Correlation
Williams, Judith C.; Alwis, W. A. M.; Rotgans, Jerome I. – Advances in Health Sciences Education, 2011
The purpose of this study was to investigate the stability of three distinct tutor behaviors (1) use of subject-matter expertise, (2) social congruence and (3) cognitive congruence, in a problem-based learning (PBL) environment. The data comprised the input from 16,047 different students to a survey of 762 tutors administered in three consecutive…
Descriptors: Expertise, Generalizability Theory, Tutor Training, Problem Based Learning
Sao Pedro, Michael A.; Baker, Ryan S. J. d.; Gobert, Janice D. – Grantee Submission, 2013
When validating assessment models built with data mining, generalization is typically tested at the student-level, where models are tested on new students. This approach, though, may fail to find cases where model performance suffers if other aspects of those cases relevant to prediction are not well represented. We explore this here by testing if…
Descriptors: Educational Research, Data Collection, Data Analysis, Generalizability Theory
Gugiu, Mihaiela R.; Gugiu, Paul C.; Baldus, Robert – Journal of MultiDisciplinary Evaluation, 2012
Background: Educational researchers have long espoused the virtues of writing with regard to student cognitive skills. However, research on the reliability of the grades assigned to written papers reveals a high degree of contradiction, with some researchers concluding that the grades assigned are very reliable whereas others suggesting that they…
Descriptors: Grades (Scholastic), Grading, Scoring Rubrics, Research Design
Crits-Christoph, Paul; Gibbons, Mary Beth Connolly; Hamilton, Jessica; Ring-Kurtz, Sarah; Gallop, Robert – Journal of Consulting and Clinical Psychology, 2011
Objective: To examine the dependability of alliance scores at the patient and therapist level, to evaluate the potential causal direction of session-to-session changes in alliance and depressive symptoms, and to investigate the impact of aggregating the alliance over progressively more sessions on the size of the alliance-outcome relationship.…
Descriptors: Counselor Client Relationship, Generalizability Theory, Patients, Psychotherapy