Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 0 |
Since 2006 (last 20 years) | 3 |
Descriptor
Source
American Psychologist | 1 |
Educational Testing Service | 1 |
Executive Review | 1 |
Language Testing | 1 |
NCME Measurement in Education | 1 |
Online Submission | 1 |
Public Libraries | 1 |
Author
Coffman, William E. | 3 |
Thompson, Bruce | 2 |
Alderson, J. Charles | 1 |
Atkinson, Dianne | 1 |
Baker, C. Scott | 1 |
Barnes, Robert E. | 1 |
Bickman, Leonard | 1 |
Bobie, Allen | 1 |
Booth, Mary W. | 1 |
Brittain, Clay V. | 1 |
Brittain, Mary M. | 1 |
More ▼ |
Publication Type
Opinion Papers | 75 |
Speeches/Meeting Papers | 75 |
Information Analyses | 14 |
Reports - Evaluative | 9 |
Reports - Descriptive | 6 |
Journal Articles | 4 |
Reports - Research | 3 |
Collected Works - Serials | 1 |
Guides - Non-Classroom | 1 |
Tests/Questionnaires | 1 |
Education Level
Audience
Researchers | 9 |
Practitioners | 2 |
Media Staff | 1 |
Location
Canada | 1 |
Ireland | 1 |
Kentucky | 1 |
United Kingdom | 1 |
United Kingdom (England) | 1 |
United Kingdom (Great Britain) | 1 |
United Kingdom (Wales) | 1 |
Laws, Policies, & Programs
Elementary and Secondary… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
National Assessment of… | 2 |
Test of Economic Literacy | 1 |
Test of Understanding in… | 1 |
What Works Clearinghouse Rating
Kane, Michael – Educational Testing Service, 2010
The 12th annual William H. Angoff Memorial Lecture was presented by Dr. Michael T. Kane, ETS's (Educational Testing Service) Samuel J. Messick Chair in Test Validity and the former Director of Research at the National Conference of Bar Examiners. Dr. Kane argues that it is important for policymakers to recognize the impact of errors of measurement…
Descriptors: Error of Measurement, Scores, Public Policy, Test Theory
Stansfield, Charles W. – Language Testing, 2008
In this speech, the author covers a lot of ground. In the first half of his speech, the author gives a brief summary of the last 40 years of the history of language testing, from his perspective. The author reviews these years more or less by decade. Additionally, he discusses the evolution of the profession of language testing during this period,…
Descriptors: History, Testing, Language Tests, Role
Roberts, J. Kyle; Onwuegbuzie, Anthony J. – 2000
Much of the current research concerning reliability emphatically suggests that researchers should gather their own reliability estimates when administering an instrument. It has also been recommended that data with low reliability be discarded. While some data obtained from instruments that originally yielded reliable results may be unreliable, it…
Descriptors: Estimation (Mathematics), Reliability, Researchers
Lees, Elaine O. – 1981
Given the concern for reliability in essay evaluation and the prospect of "error" variance in its absence, methods to promote interrater reliability in the evaluation of written compositions have been developed. These methods reduce variation in the value systems being applied by readers to texts, either by limiting the group of readers…
Descriptors: Elementary Secondary Education, Evaluation Criteria, Evaluation Methods, Evaluative Thinking
Robertson, Gary J. – 1981
Some fundamental concepts of criterion referenced test (CRT) reliability are highlighted. Emphasis is given to the procedures for determining reliability of scores for individual pupils because this is an area requiring increased awareness by classroom teachers and practitioners. Reliability issues encountered in the evaluation of instructional…
Descriptors: Criterion Referenced Tests, Reading Tests, Scores, Test Reliability
Thompson, Bruce – 1996
The program evaluation standards approved by the American National Standards Institute (ANSI) in 1994 that deal with reliability and validity accurately represent contemporary views of the psychometric community with regard to reliability and validity. As such, these standards move the field forward. The ANSI standards recognize that reliability…
Descriptors: Program Evaluation, Psychometrics, Reliability, Scores
Munby, Hugh – 2001
This paper explores how facets of the concept "rigor" might be applied to questions about the validity and reliability of research independently of the research modes. The focus of the critical lens could then be on how to assess the contribution of various forms of research rather than on the "paradigm wars" and arguments…
Descriptors: Educational Research, Ethics, Models, Qualitative Research
Henson, Robin K. – 2000
The purpose of this paper is to highlight some psychometric cautions that should be observed when seeking to develop short form versions of tests. Several points are made: (1) score reliability is impacted directly by the characteristics of the sample and testing conditions; (2) sampling error has a direct influence on reliability and factor…
Descriptors: Factor Structure, Psychometrics, Reliability, Sampling
Wainer, Howard – 1982
This paper is the transcript of a talk given to those who use test information but who have little technical background in test theory. The concepts of modern test theory are compared with traditional test theory, as well as a probable future test theory. The explanations given are couched within an extended metaphor that allows a full description…
Descriptors: Difficulty Level, Latent Trait Theory, Metaphors, Test Items
Wilkinson, Rebecca L. – 1992
Problems inherent in relying solely on statistical significance testing as a means of data interpretation are reviewed. The biggest problem with statistical significance testing is that researchers have used the results of this testing to ascribe importance or meaning to their studies where such meaning often does not exist. Often researchers…
Descriptors: Data Interpretation, Effect Size, Power (Statistics), Reliability
Kvale, Steinar – 1994
Arguments are presented for conceptualizing validity within a postmodern approach. Validity, reliability, and generalizability have been a holy trinity of social science research, and standard definitions of validity have been taken from criteria developed for psychometric tests. From a postmodern point of view, validity is sometimes discarded as…
Descriptors: Communication (Thought Transfer), Constructivism (Learning), Definitions, Generalizability Theory
Baker, C. Scott; Fadely, Dean – 1986
A study examined identification and consistency theory of interface between rhetorical and communication theory, to demonstrate the compatibility of specific principles in Kenneth Burke's theory with those in the work of Fritz Heider and other consistency theorists and to make suggestions toward a Burkeian theory of identification through…
Descriptors: Communication (Thought Transfer), Identification (Psychology), Political Influences, Reliability
Hedge, Jerry W.; Laue, Frances J. – 1988
The ability of individuals to make accurate judgments about others is examined and literature on this subject is reviewed. A wide variety of situational factors affects the appraisal of performance. It is generally accepted that the purpose of the appraisal influences the accuracy of the appraiser. The instrumentation, or tools, available to the…
Descriptors: Evaluation Criteria, Evaluation Methods, Evaluation Problems, Performance Factors
Ekstrom, Ruth B. – 1979
Three areas of concern related to test bias and validity should be considered during the revision of the Standards for Educational and Psychological Tests. The first area concerns the sources and consequences of test bias. Five sources of bias have been identified: numerical bias, role bias, status bias, stereotypic bias, and familiarity bias. The…
Descriptors: Evaluation Criteria, Psychometrics, Test Bias, Test Construction
Atkinson, Dianne; Murray, Mary – 1987
Noting that improvement in rater reliability means eliminating differences among raters, this paper discusses ways to assess writing evaluator reliability and methods for achieving higher levels of interrater reliability. After showing that reliability can be improved two ways--by increasing the number of raters or measurements made, and by…
Descriptors: Evaluation Methods, Holistic Evaluation, Interrater Reliability, Measurement Techniques