Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 9 |
Since 2006 (last 20 years) | 43 |
Descriptor
Correlation | 61 |
Reliability | 38 |
Test Reliability | 15 |
Factor Analysis | 12 |
Interrater Reliability | 11 |
Models | 11 |
Statistical Analysis | 11 |
Test Validity | 11 |
Validity | 11 |
Error of Measurement | 9 |
Computation | 8 |
More ▼ |
Source
Author
Raykov, Tenko | 3 |
Daniel, Larry G. | 2 |
Fan, Xitao | 2 |
Goldstein, Miriam D. | 2 |
Haberman, Shelby J. | 2 |
Onwuegbuzie, Anthony J. | 2 |
Ables, Adrienne Z. | 1 |
Aleong, Chandra | 1 |
Anthony, James C. | 1 |
Arth, Thomas O. | 1 |
Atkins, David C. | 1 |
More ▼ |
Publication Type
Reports - Descriptive | 61 |
Journal Articles | 49 |
Speeches/Meeting Papers | 4 |
Books | 2 |
Tests/Questionnaires | 2 |
Numerical/Quantitative Data | 1 |
Education Level
Higher Education | 9 |
Elementary Secondary Education | 7 |
Postsecondary Education | 6 |
Elementary Education | 3 |
Adult Education | 1 |
Grade 4 | 1 |
High Schools | 1 |
Middle Schools | 1 |
Secondary Education | 1 |
Audience
Practitioners | 2 |
Teachers | 2 |
Administrators | 1 |
Researchers | 1 |
Location
California | 2 |
Australia | 1 |
Florida | 1 |
Ireland (Dublin) | 1 |
Russia | 1 |
Sweden | 1 |
United Kingdom (England) | 1 |
United States | 1 |
Wisconsin | 1 |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 2 |
Assessments and Surveys
What Works Clearinghouse Rating
Raykov, Tenko; Anthony, James C.; Menold, Natalja – Educational and Psychological Measurement, 2023
The population relationship between coefficient alpha and scale reliability is studied in the widely used setting of unidimensional multicomponent measuring instruments. It is demonstrated that for any set of component loadings on the common factor, regardless of the extent of their inequality, the discrepancy between alpha and reliability can be…
Descriptors: Correlation, Evaluation Research, Reliability, Measurement Techniques
Ferrando, Pere J.; Lorenzo-Seva, Urbano – Educational and Psychological Measurement, 2019
Measures initially designed to be single-trait often yield data that are compatible with both an essentially unidimensional factor-analysis (FA) solution and a correlated-factors solution. For these cases, this article proposes an approach aimed at providing information for deciding which of the two solutions is the most appropriate and useful.…
Descriptors: Factor Analysis, Computation, Reliability, Goodness of Fit
Raykov, Tenko; Marcoulides, George A.; Li, Tenglong – Educational and Psychological Measurement, 2018
This note extends the results in the 2016 article by Raykov, Marcoulides, and Li to the case of correlated errors in a set of observed measures subjected to principal component analysis. It is shown that when at least two measures are fallible, the probability is zero for any principal component--and in particular for the first principal…
Descriptors: Factor Analysis, Error of Measurement, Correlation, Reliability
Trafimow, David – Teaching Statistics: An International Journal for Teachers, 2016
Much of the science reported in the media depends on correlation coefficients. But the size of correlation coefficients depends, in part, on the reliability with which the correlated variables are measured. Understanding this is a statistical literacy issue.
Descriptors: Statistics, Statistical Analysis, Correlation, Reliability
Leckie, George – Journal of Educational and Behavioral Statistics, 2018
The traditional approach to estimating the consistency of school effects across subject areas and the stability of school effects across time is to fit separate value-added multilevel models to each subject or cohort and to correlate the resulting empirical Bayes predictions. We show that this gives biased correlations and these biases cannot be…
Descriptors: Value Added Models, Reliability, Statistical Bias, Computation
Vaske, Jerry J. – Sagamore-Venture, 2019
Data collected from surveys can result in hundreds of variables and thousands of respondents. This implies that time and energy must be devoted to (a) carefully entering the data into a database, (b) running preliminary analyses to identify any problems (e.g., missing data, potential outliers), (c) checking the reliability and validity of the…
Descriptors: Surveys, Theories, Hypothesis Testing, Effect Size
Bichi, Ado Abdu; Talib, Rohaya – International Journal of Evaluation and Research in Education, 2018
Testing in educational system perform a number of functions, the results from a test can be used to make a number of decisions in education. It is therefore well accepted in the education literature that, testing is an important element of education. To effectively utilize the tests in educational policies and quality assurance its validity and…
Descriptors: Item Response Theory, Test Items, Test Construction, Decision Making
Teo, Timothy; Fan, Xitao – Asia-Pacific Education Researcher, 2013
Cronbach's coefficient alpha has been widely known and used in educational research. Many education research practitioners, however, may not be aware of the potential issues when the main assumptions for coefficient alpha are violated in research practice. This paper provides a brief discussion about two assumptions that may make the use and…
Descriptors: Educational Research, Reliability, Test Items, Correlation
Augustyniak, Robert A.; Ables, Adrienne Z.; Guilford, Philip; Lujan, Heidi L.; Cortright, Ronald N.; DiCarlo, Stephen E. – Advances in Physiology Education, 2016
Intrinsic motivation to learn involves engaging in learning opportunities because they are seen as enjoyable, interesting, or relevant to meeting one's core psychological needs. As a result, intrinsic motivation is associated with high levels of effort and task performance. Students with greater levels of intrinsic motivation demonstrate strong…
Descriptors: Student Motivation, Academic Achievement, Physiology, Student Interests
Humphry, Stephen M.; McGrane, Joshua A. – Australian Educational Researcher, 2015
This paper presents a method for equating writing assessments using pairwise comparisons which does not depend upon conventional common-person or common-item equating designs. Pairwise comparisons have been successfully applied in the assessment of open-ended tasks in English and other areas such as visual art and philosophy. In this paper,…
Descriptors: Writing Evaluation, Evaluation Methods, Comparative Analysis, Writing Tests
Gorbunova, Tatiana N. – European Journal of Contemporary Education, 2017
The subject of the research is to build methodologies to evaluate the student knowledge by testing. The author points to the importance of feedback about the mastering level in the learning process. Testing is considered as a tool. The object of the study is to create the test system models for defence practice problems. Special attention is paid…
Descriptors: Testing, Evaluation Methods, Feedback (Response), Simulation
Kim, Seonghoon – Psychometrika, 2012
Assuming item parameters on a test are known constants, the reliability coefficient for item response theory (IRT) ability estimates is defined for a population of examinees in two different ways: as (a) the product-moment correlation between ability estimates on two parallel forms of a test and (b) the squared correlation between the true…
Descriptors: Reliability, Item Response Theory, Tests, Correlation
Cropley, David H.; Kaufman, James C. – Journal of Creative Behavior, 2012
The Creative Solution Diagnosis Scale (CSDS) is a 30-item scale based on a core of four criteria: Relevance & Effectiveness, Novelty, Elegance, and Genesis. The CSDS offers potential for the consensual assessment of functional product creativity. This article describes an empirical study in which non-expert judges rated a series of mousetrap…
Descriptors: Expertise, Creativity, Identification, Measures (Individuals)
Nunan, Anna – Language Learning in Higher Education, 2014
The Applied Language Centre at University College Dublin offers foreign language modules to students in ten languages at CEFR [Common European Framework of Reference for Languages] levels ranging from A1 to B2. Efforts have been underway in the Centre to standardise the assessment components across languages to ensure parity between module credits…
Descriptors: Second Language Learning, Second Language Instruction, College Students, Standards
Fan, Xitao; Sun, Shaojing – Journal of Early Adolescence, 2014
In adolescence research, the treatment of measurement reliability is often fragmented, and it is not always clear how different reliability coefficients are related. We show that generalizability theory (G-theory) is a comprehensive framework of measurement reliability, encompassing all other reliability methods (e.g., Pearson "r,"…
Descriptors: Generalizability Theory, Measurement, Reliability, Correlation