NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 1 to 15 of 1,612 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Hulteen, Ryan M.; True, Larissa; Kroc, Edward – Measurement in Physical Education and Exercise Science, 2023
The typical process for assessing inter-rater reliability is facilitated by training raters within a research team. Lacking is an understanding if inter-rater reliability scores "between" research teams demonstrate adequate reliability. This study examined inter-rater reliability between 16 researchers who assessed fundamental motor…
Descriptors: Psychomotor Skills, Scores, Reliability, Interrater Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Verdugo, Miguel Angel; Vicente, Eva; Guillén, Verónica Marina; Sánchez, Sergio; Ibáñez, Alba; Gómez, Laura Elisabet – International Journal of Developmental Disabilities, 2023
Background: Appropriate supports and instructional practices contribute to the development of self-determination. Also, research shows that the promotion of skills related to self-determination has been linked to the achievement of desired outcomes over the different life stages. Advances in self-determination require the development of assessment…
Descriptors: Measures (Individuals), Self Determination, Intellectual Disability, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Arielle Boguslav; Julie Cohen – Journal of Teacher Education, 2024
Teacher preparation programs are increasingly expected to use data on preservice teacher (PST) skills to drive program improvement and provide targeted supports. Observational ratings are especially vital, but also prone to measurement issues. Scores may be influenced by factors unrelated to PSTs' instructional skills, including rater standards.…
Descriptors: Preservice Teachers, Measures (Individuals), Evaluation Problems, Teaching Skills
Peer reviewed Peer reviewed
Direct linkDirect link
Kristen Bottema-Beutel; Shannon Crowley LaPoint; So Yoon Kim; Sarah Mohiuddin; Qun Yu; Rachael McKinnon – Exceptional Children, 2024
In this secondary analysis of a previously conducted systematic review, we analyze social validity assessments in intervention research for transition-age autistic youth. Social validity is concerned with the acceptability of the intervention goals, the acceptability and feasibility of the intervention procedures, and the perceived importance of…
Descriptors: Autism Spectrum Disorders, Intervention, Validity, Psychometrics
Peer reviewed Peer reviewed
Direct linkDirect link
Pereira, Valerie J.; Tuomainen, Jyrki; Lee, Kathy Y. S.; Tong, Michael C. F.; Sell, Debbie A. – International Journal of Language & Communication Disorders, 2021
Background: The status of the velopharyngeal mechanism can be inferred from perceptual ratings of specified speech parameters. Several studies have proposed the measure of an overall velopharyngeal composite score based on these perceptual ratings and have reported good validity. The Cleft Audit Protocol for Speech--Augmented (CAPS-A) is a…
Descriptors: Congenital Impairments, Speech Tests, Outcome Measures, Test Validity
Peer reviewed Peer reviewed
Direct linkDirect link
Brogan L. Barr; Virginia V. W. McIntosh; Eileen F. Britt; Jennifer Jordan; Janet D. Carter – Measurement: Interdisciplinary Research and Perspectives, 2024
Even when raters demonstrate agreement in the use of a measure, limited score variability or violation of often-ignored statistical assumptions can result in lower reliability estimates than intuitively expected. This article uses data drawn from two randomized controlled trials of schema therapy and cognitive behavioral therapy for the treatment…
Descriptors: Evaluators, Interrater Reliability, Reliability, Measurement Techniques
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Metsämuuronen, Jari – Practical Assessment, Research & Evaluation, 2023
Traditional estimators of reliability such as coefficients alpha, theta, omega, and rho (maximal reliability) are prone to give radical underestimates of reliability for the tests common when testing educational achievement. These tests are often structured by widely deviating item difficulties. This is a typical pattern where the traditional…
Descriptors: Test Reliability, Achievement Tests, Computation, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Orhan, Ali – Journal of Psychoeducational Assessment, 2022
The aims of this reliability generalization study were to provide the overall alpha values of the California critical thinking disposition inventory (CCTDI) total score and subscales scores and investigate the characteristics of the studies that may be associated with the variability in the reliability values of the CCTDI total score and subscales…
Descriptors: Critical Thinking, Measures (Individuals), Test Reliability, Generalization
Peer reviewed Peer reviewed
Direct linkDirect link
M. Arda Atakaya; Ugur Sak; M. Bahadir Ayas – Creativity Research Journal, 2024
Scoring in creativity research has been a central problem since creativity became an important issue in psychology and education in the 1950s. The current study examined the psychometric properties of 27 creativity indices derived from summed and averaged scores using 15 scoring methods. Participants included 2802 middle-school students. Data…
Descriptors: Psychometrics, Creativity, Creativity Tests, Scoring
Peer reviewed Peer reviewed
Direct linkDirect link
Ehri Ryu – Society for Research on Educational Effectiveness, 2024
Background/Context: Confirmatory factor analysis (CFA) model is a commonly adopted framework to estimate and test a measurement model. Once a well-fitting final CFA model is selected, the selected model may be used to test structural relationships of the latent constructs with other variables, to construct a test with desired reliability and…
Descriptors: Research Problems, Factor Analysis, Scores, Computation
Peer reviewed Peer reviewed
Direct linkDirect link
Moore, C. Missy; Crawford, Carey C.; Tertichny, Alissa – Measurement and Evaluation in Counseling and Development, 2023
We examined dimensionality and temporal stability of the Interpersonal Stress Scale-Counselor (ISS-C) scores in a sample of professional counselors (n = 518). Confirmatory factor analyses provided support for a four-factor model previously identified through exploratory factor analysis and a bifactor model. Using a randomized test-retest, temporal…
Descriptors: Counselors, Interpersonal Relationship, Stress Variables, Measures (Individuals)
Peer reviewed Peer reviewed
Direct linkDirect link
Bouwer, Renske; Koster, Monica; van den Bergh, Huub – Assessment in Education: Principles, Policy & Practice, 2023
Assessing students' writing performance is essential to adequately monitor and promote individual writing development, but it is also a challenge. The present research investigates a benchmark rating procedure for assessing texts written by upper-elementary students. In two studies we examined whether a benchmark rating procedure (1) leads to…
Descriptors: Benchmarking, Writing Evaluation, Evaluation Methods, Elementary School Students
Peer reviewed Peer reviewed
Direct linkDirect link
Almehrizi, Rashid S. – Educational Measurement: Issues and Practice, 2022
Coefficient alpha reliability persists as the most common reliability coefficient reported in research. The assumptions for its use are, however, not well-understood. The current paper challenges the commonly used expressions of coefficient alpha and argues that while these expressions are correct when estimating reliability for summed scores,…
Descriptors: Reliability, Scores, Scaling, Statistical Analysis
Peer reviewed Peer reviewed
Direct linkDirect link
Steinmann, Isa; Sánchez, Daniel; van Laar, Saskia; Braeken, Johan – Assessment in Education: Principles, Policy & Practice, 2022
Questionnaire scales that are mixed-worded, i.e. include both positively and negatively worded items, often suffer from issues like low reliability and more complex latent structures than intended. Part of the problem might be that some responders fail to respond consistently to the mixed-worded items. We investigated the prevalence and impact of…
Descriptors: Response Style (Tests), Test Items, Achievement Tests, Foreign Countries
Peer reviewed Peer reviewed
Direct linkDirect link
Aberdine R. Dwight; Amy M. Briesch; Jessica A. Hoffman; Christopher Rutt – Child & Youth Care Forum, 2024
Background: Although the Depression Anxiety Stress Scales, Short Form (DASS-21) was developed for adults, its authors noted no compelling reasons to not use the measure with youth as young as 12 years. Despite increasingly widespread use with youth, psychometric evidence in support of its use with this population needs to be investigated to fully…
Descriptors: Depression (Psychology), Measures (Individuals), Anxiety, Stress Variables
Previous Page | Next Page »
Pages: 1  |  2  |  3  |  4  |  5  |  6  |  7  |  8  |  9  |  10  |  11  |  ...  |  108