NotesFAQContact Us
Collection
Advanced
Search Tips
Laws, Policies, & Programs
Elementary and Secondary…1
Assessments and Surveys
National Assessment of…1
What Works Clearinghouse Rating
Showing 1 to 15 of 39 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Pere J. Ferrando; David Navarro-González; Fabia Morales-Vives – Educational and Psychological Measurement, 2025
The problem of local item dependencies (LIDs) is very common in personality and attitude measures, particularly in those that measure narrow-bandwidth dimensions. At the structural level, these dependencies can be modeled by using extended factor analytic (FA) solutions that include correlated residuals. However, the effects that LIDs have on the…
Descriptors: Scores, Accuracy, Evaluation Methods, Factor Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Raykov, Tenko; Marcoulides, George A. – Educational and Psychological Measurement, 2015
A latent variable modeling approach for scale reliability evaluation in heterogeneous populations is discussed. The method can be used for point and interval estimation of reliability of multicomponent measuring instruments in populations representing mixtures of an unknown number of latent classes or subpopulations. The procedure is helpful also…
Descriptors: Test Reliability, Evaluation Methods, Measurement Techniques, Computation
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Dziuban, Charles; Moskal, Patsy; Thompson, Jessica; Kramer, Lauren; DeCantis, Genevieve; Hermsdorfer, Andrea – Online Learning, 2015
The authors explore the possible relationship between student satisfaction with online learning and the theory of psychological contracts. The study incorporates latent trait models using the image analysis procedure and computation of Anderson and Rubin factors scores with contrasts for students who are satisfied, ambivalent, or dissatisfied with…
Descriptors: Student Attitudes, Online Courses, Scores, Learning Experience
Peer reviewed Peer reviewed
Direct linkDirect link
Evans, C.; Kandiko Howson, C.; Forsythe, A. – Higher Education Pedagogies, 2018
Internationally, the political appetite for educational measurement capable of capturing a metric of value for money and effectiveness has momentum. While most would agree with the need to assess costs relevant to quality to help support better governmental policy decisions about public spending, poorly understood measurement comes with unintended…
Descriptors: Higher Education, Achievement Gains, Political Issues, Quality Assurance
Smith, Julie M. – ProQuest LLC, 2011
This study examines the proposed Reliability Generalization (RG) method for studying reliability. RG employs the application of meta-analytic techniques similar to those used in validity generalization studies to examine reliability coefficients. This study explains why RG does not provide a proper research method for the study of reliability,…
Descriptors: Reliability, Generalization, Sampling, Research Methodology
Peer reviewed Peer reviewed
Direct linkDirect link
Casby, Michael W. – Child Language Teaching and Therapy, 2011
Mean length of utterance (MLU) is a frequently used measure of the expressive language of young children. The suggested conventional, contemporary, clinical practice is to calculate it from a language sample of a minimum of 50 to 100 contiguous intelligible utterances. This practice places considerable strain on professionals working with young…
Descriptors: Language Impairments, Young Children, Expressive Language, Developmental Delays
Peer reviewed Peer reviewed
Direct linkDirect link
Phillips, Gary W. – Applied Measurement in Education, 2015
This article proposes that sampling design effects have potentially huge unrecognized impacts on the results reported by large-scale district and state assessments in the United States. When design effects are unrecognized and unaccounted for they lead to underestimating the sampling error in item and test statistics. Underestimating the sampling…
Descriptors: State Programs, Sampling, Research Design, Error of Measurement
Peer reviewed Peer reviewed
Direct linkDirect link
Calmettes, Guillaume; Drummond, Gordon B.; Vowler, Sarah L. – Advances in Physiology Education, 2012
A jack knife is a pocket knife that is put to many tasks, because it's ready to hand. Often there could be a better tool for the job, such as a screwdriver, a scraper, or a can-opener, but these are not usually pocket items. In statistical terms, the expression implies making do with what's available. Another simile, of an extreme situation, is…
Descriptors: Statistical Analysis, Computation, Population Distribution, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Soslau, Elizabeth; Lewis, Kandia – Action in Teacher Education, 2014
For accreditation and programmatic decision making, education school administrators use inter-rater reliability analyses to judge credibility of student-teacher assessments. Although weak levels of agreement between university-appointed supervisors and cooperating teachers are usually interpreted to indicate that the process is not being…
Descriptors: Interrater Reliability, Accreditation (Institutions), Student Teacher Evaluation, Focus Groups
Bill & Melinda Gates Foundation, 2012
No one has a bigger stake in teaching effectiveness than students. Nor are there any better experts on how teaching is experienced by its intended beneficiaries. Only recently have many policymakers and practitioners come to recognize that--when asked the right questions, in the right ways--students can be an important source of information on the…
Descriptors: Student Surveys, Student Attitudes, Feedback (Response), Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Eva, Kevin W.; Solomon, Patty; Neville, Alan J.; Ladouceur, Michael; Kaufman, Karyn; Walsh, Allyn; Norman, Geoffrey R. – Advances in Health Sciences Education, 2007
Introduction: Tutorial-based assessment, despite providing a good match with the philosophy adopted by educational programmes that emphasize small group learning, remains one of the greatest challenges for educators working in this context. The current study was performed in an attempt to assess the psychometric characteristics of tutorial-based…
Descriptors: Construct Validity, Sampling, Psychometrics, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Runyan, Desmond K.; Dunne, Michael P.; Zolotor, Adam J. – Child Abuse & Neglect: The International Journal, 2009
The "World Report on Children and Violence", (Pinheiro, 2006) was produced at the request of the UN Secretary General and the UN General Assembly. This report recommended improvement in research on child abuse. ISPCAN representatives took this charge and developed 3 new instruments. We describe this background and introduce three new measures…
Descriptors: Child Abuse, Screening Tests, Child Welfare, Test Construction
Peer reviewed Peer reviewed
Direct linkDirect link
Raykov, Tenko; Marcoulides, George A. – International Journal of Testing, 2006
A structural equation modeling approach to scale reliability evaluation can be employed to estimate generalizability theory indexes in settings where sampling of subjects and conditions is carried out. In one- and two-facet crossed designs, it is demonstrated how this method can be used to obtain estimates of relative generalizability…
Descriptors: Computation, Generalizability Theory, Structural Equation Models, Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
McAuliffe, Megan J.; Robb, Michael P.; Murdoch, Bruce E. – Clinical Linguistics & Phonetics, 2007
The study investigated adaptation to a standard electropalatographic (EPG) practise palate in a group of eight adults (mean age = 24 years). The participants read the phrase "a CVC" over four sampling conditions: prior to inserting the palate, immediately following insertion of the palate, 45 minutes after palate insertion, and 3 hours after…
Descriptors: Articulation (Speech), Phonology, Sampling, Acoustics
Peer reviewed Peer reviewed
Flack, Virginia F.; And Others – Psychometrika, 1988
A method is presented for determining sample size that will achieve a pre-specified bound on confidence interval width for the interrater agreement measure "kappa." The same results can be used when a pre-specified power is desired for testing hypotheses about the value of kappa. (Author/SLD)
Descriptors: Evaluation Methods, Interrater Reliability, Research Methodology, Research Problems
Previous Page | Next Page »
Pages: 1  |  2  |  3