Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 4 |
Since 2006 (last 20 years) | 9 |
Descriptor
Tests | 19 |
Test Reliability | 12 |
Test Validity | 7 |
Reliability | 6 |
Evaluation Methods | 5 |
Scores | 5 |
Student Evaluation | 5 |
College Students | 4 |
Foreign Countries | 4 |
Measurement | 3 |
Psychometrics | 3 |
More ▼ |
Source
Author
Wesolowski, Brian C. | 2 |
Caldwell, Michael S. | 1 |
Callahan, Carolyn M. | 1 |
Funke, Joachim | 1 |
Greiff, Samuel | 1 |
Hepburn, Mary A. | 1 |
Jones, Philip | 1 |
Kim, Seonghoon | 1 |
Kolen, Michael J. | 1 |
Lee, Won-Chan | 1 |
Mahar, Matthew T. | 1 |
More ▼ |
Publication Type
Reports - Descriptive | 19 |
Journal Articles | 18 |
Education Level
Higher Education | 3 |
Postsecondary Education | 2 |
Laws, Policies, & Programs
Assessments and Surveys
Kaufman Assessment Battery… | 1 |
What Works Clearinghouse Rating
Wesolowski, Brian C. – Music Educators Journal, 2020
Validity, reliability, and fairness are three prominent indicators for evaluating the quality of assessment processes. Each of the indicators is most often written about and applied in the context of large-scale assessment. As a result, the technical properties of these indicators make them limited in both their practicality and relevance for…
Descriptors: Music Education, Test Validity, Test Reliability, Student Evaluation
Wesolowski, Brian C.; Wind, Stefanie A. – Journal of Educational Measurement, 2019
Rater-mediated assessments are a common methodology for measuring persons, investigating rater behavior, and/or defining latent constructs. The purpose of this article is to provide a pedagogical framework for examining rater variability in the context of rater-mediated assessments using three distinct models. The first model is the observation…
Descriptors: Interrater Reliability, Models, Observation, Measurement
Nicewander, W. Alan – Educational and Psychological Measurement, 2019
This inquiry is focused on three indicators of the precision of measurement--conditional on fixed values of ?, the latent variable of item response theory (IRT). The indicators that are compared are (1) The traditional, conditional standard errors, s(eX|?) = CSEM; (2) the IRT-based conditional standard errors, s[subscript irt](eX|?)=C[subscript…
Descriptors: Measurement, Accuracy, Scores, Error of Measurement
Center on Standards and Assessments Implementation, 2018
Reliability is a measure of consistency. It is the degree to which student results are the same when they take the same test on different occasions, when different scorers score the same item or task, and when different but equivalent tests are taken at the same time or at different times. Reliability is about making sure that different test forms…
Descriptors: Test Reliability, Test Validity, Student Evaluation, Test Bias
Kim, Seonghoon – Psychometrika, 2012
Assuming item parameters on a test are known constants, the reliability coefficient for item response theory (IRT) ability estimates is defined for a population of examinees in two different ways: as (a) the product-moment correlation between ability estimates on two parallel forms of a test and (b) the squared correlation between the true…
Descriptors: Reliability, Item Response Theory, Tests, Correlation
Naumann, Fiona; Moore, Keri; Mildon, Sally; Jones, Philip – Asia-Pacific Journal of Cooperative Education, 2014
This paper aims to develop a valid method to assess the key competencies of the exercise physiology profession acquired through work-integrated learning (WIL). In order to develop a competency-based assessment, the key professional tasks needed to be identified and the test designed so students' competency in different tasks and settings could be…
Descriptors: Exercise Physiology, Competence, Test Construction, Work Experience
Greiff, Samuel; Wustenberg, Sascha; Funke, Joachim – Applied Psychological Measurement, 2012
This article addresses two unsolved measurement issues in dynamic problem solving (DPS) research: (a) unsystematic construction of DPS tests making a comparison of results obtained in different studies difficult and (b) use of time-intensive single tasks leading to severe reliability problems. To solve these issues, the MicroDYN approach is…
Descriptors: Problem Solving, Tests, Measurement, Structural Equation Models
Kolen, Michael J.; Lee, Won-Chan – Educational Measurement: Issues and Practice, 2011
This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…
Descriptors: Test Use, Test Format, Error of Measurement, Raw Scores
Mahar, Matthew T.; Rowe, David A. – Measurement in Physical Education and Exercise Science, 2008
Accurate measures of youth fitness are needed by researchers and practitioners. Evidence of validity and reliability are essential before results of youth fitness tests can be used to make sound decisions. This article describes a three-stage paradigm for validation research and provides guidance for conducting and understanding norm-referenced…
Descriptors: Test Reliability, Test Validity, Guidelines, Physical Education Teachers

Sawilowsky, Shlomo S. – Educational and Psychological Measurement, 2000
Reviews issues, raised in part because of "Educational and Psychological Measurement" (EPM) policies, regarding "test reliability," which is psychometric terminology, and "score reliability," score-centric terminology. Discusses datametrics and provides a critique of T. Vacha-Haase's proposed meta-analytic reliability generalization via…
Descriptors: Generalization, Meta Analysis, Psychometrics, Reliability

Thompson, Bruce; Vacha-Haase, Tammi – Educational and Psychological Measurement, 2000
Responds to criticisms of some "Educational and Psychological Measurement" policies and the reliability generalization meta-analytic methods of T. Vacha-Haase. Explores consequences of misunderstanding score reliability. (SLD)
Descriptors: Generalization, Meta Analysis, Psychometrics, Reliability

Hepburn, Mary A.; Strickland, Joseph B. – Journal of Social Studies Research, 1979
Describes the development and assessment of evaluation instruments designed to test student political-citizenship knowledge, skills, and attitudes. The tests are a part of the Improving Citizenship Education Project in Fulton County, Georgia. (Author/CK)
Descriptors: Citizenship, Educational Assessment, Elementary Secondary Education, Evaluation Methods
Urban, Klaus K. – International Education Journal, 2005
The Test for Creative Thinking-Drawing Production (TCT-DP), its design, concept and evaluation scheme as well as experiences and results of application are described. The test was designed to mirror a more holistic concept of creativity than the mere quantitatively oriented, traditional divergent thinking tests. The specific design using figural…
Descriptors: Creativity, Ability Grouping, Creative Thinking, Tests

Stalder, Daniel R. – Teaching of Psychology, 2001
Evaluates the use of discrimination indexes (or item-total correlation) for examining the reliability of examinations. States this technique has drawbacks and may cause examination validity to be lower. Discusses the idea of discrimination power and why poor students may answer an item correctly. (CMK)
Descriptors: Academic Failure, Educational Research, Higher Education, Psychology

Callahan, Carolyn M.; Caldwell, Michael S. – Journal for the Education of the Gifted, 1993
This article describes the database of the National Repository for Instruments and Strategies Used in the Identification and Evaluation of Gifted Programs (University of Virginia). The Scale for the Evaluation of Gifted Identification Instruments is applied to the Kaufman Assessment Battery for Children. A sample bibliographic reference from the…
Descriptors: Ability Identification, Bibliographic Databases, Databases, Elementary Secondary Education
Previous Page | Next Page ยป
Pages: 1 | 2