ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	3

Source

American Psychologist	1
Educational Testing Service	1
Executive Review	1
Language Testing	1
NCME Measurement in Education	1
Online Submission	1
Public Libraries	1

Publication Type

Opinion Papers	75
Speeches/Meeting Papers	75
Information Analyses	14
Reports - Evaluative	9
Reports - Descriptive	6
Journal Articles	4
Reports - Research	3
Collected Works - Serials	1
Guides - Non-Classroom	1
Tests/Questionnaires	1

Education Level

Audience

Researchers	9
Practitioners	2
Media Staff	1

Location

Canada	1
Ireland	1
Kentucky	1
United Kingdom	1
United Kingdom (England)	1
United Kingdom (Great Britain)	1
United Kingdom (Wales)	1

Laws, Policies, & Programs

Elementary and Secondary…	1
No Child Left Behind Act 2001	1

Assessments and Surveys

National Assessment of…	2
Test of Economic Literacy	1
Test of Understanding in…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 75 results Save | Export

Errors of Measurement, Theory, and Public Policy. William H. Angoff Memorial Lecture Series

Download full text

Kane, Michael – Educational Testing Service, 2010

The 12th annual William H. Angoff Memorial Lecture was presented by Dr. Michael T. Kane, ETS's (Educational Testing Service) Samuel J. Messick Chair in Test Validity and the former Director of Research at the National Conference of Bar Examiners. Dr. Kane argues that it is important for policymakers to recognize the impact of errors of measurement…

Descriptors: Error of Measurement, Scores, Public Policy, Test Theory

Lecture:"Where We Have Been and Where We Should Go"

Peer reviewed

Direct link

Stansfield, Charles W. – Language Testing, 2008

In this speech, the author covers a lot of ground. In the first half of his speech, the author gives a brief summary of the last 40 years of the history of language testing, from his perspective. The author reviews these years more or less by decade. Additionally, he discusses the evolution of the profession of language testing during this period,…

Descriptors: History, Testing, Language Tests, Role

Alternative Approaches for Interpreting Alpha with Homogeneous Subsamples.

Download full text

Roberts, J. Kyle; Onwuegbuzie, Anthony J. – 2000

Much of the current research concerning reliability emphatically suggests that researchers should gather their own reliability estimates when administering an instrument. It has also been recommended that data with low reliability be discarded. While some data obtained from instruments that originally yielded reliable results may be unreliable, it…

Descriptors: Estimation (Mathematics), Reliability, Researchers

Critical Monism, Critical Pluralism, and the Ideal of Inter-Rater Reliability.

Lees, Elaine O. – 1981

Given the concern for reliability in essay evaluation and the prospect of "error" variance in its absence, methods to promote interrater reliability in the evaluation of written compositions have been developed. These methods reduce variation in the value systems being applied by readers to texts, either by limiting the group of readers…

Descriptors: Elementary Secondary Education, Evaluation Criteria, Evaluation Methods, Evaluative Thinking

Assessing Reliability of Criterion-Referenced Instruments.

Robertson, Gary J. – 1981

Some fundamental concepts of criterion referenced test (CRT) reliability are highlighted. Emphasis is given to the procedures for determining reliability of scores for individual pupils because this is an area requiring increased awareness by classroom teachers and practitioners. Reliability issues encountered in the evaluation of instructional…

Descriptors: Criterion Referenced Tests, Reading Tests, Scores, Test Reliability

The Treatment of Score Reliability and Validity in the New ANSI-Approved Program Evaluation Standards.

Download full text

Thompson, Bruce – 1996

The program evaluation standards approved by the American National Standards Institute (ANSI) in 1994 that deal with reliability and validity accurately represent contemporary views of the psychometric community with regard to reliability and validity. As such, these standards move the field forward. The ANSI standards recognize that reliability…

Descriptors: Program Evaluation, Psychometrics, Reliability, Scores

Educational Research as Disciplined Inquiry: Examining the Facets of Rigor in Our Work.

Download full text

Munby, Hugh – 2001

This paper explores how facets of the concept "rigor" might be applied to questions about the validity and reliability of research independently of the research modes. The focus of the critical lens could then be on how to assess the contribution of various forms of research rather than on the "paradigm wars" and arguments…

Descriptors: Educational Research, Ethics, Models, Qualitative Research

Sacrificing Reliability and Exalting Sampling Error at the Altar of Parsimony: Some Cautions Concerning Short-Form Test Development.

Download full text

Henson, Robin K. – 2000

The purpose of this paper is to highlight some psychometric cautions that should be observed when seeking to develop short form versions of tests. Several points are made: (1) score reliability is impacted directly by the characteristics of the sample and testing conditions; (2) sampling error has a direct influence on reliability and factor…

Descriptors: Factor Structure, Psychometrics, Reliability, Sampling

Testing and Test Theory: Whither and Whence.

Download full text

Wainer, Howard – 1982

This paper is the transcript of a talk given to those who use test information but who have little technical background in test theory. The concepts of modern test theory are compared with traditional test theory, as well as a probable future test theory. The explanations given are couched within an extended metaphor that allows a full description…

Descriptors: Difficulty Level, Latent Trait Theory, Metaphors, Test Items

Statistical Significance Testing: Alternatives and Considerations.

Wilkinson, Rebecca L. – 1992

Problems inherent in relying solely on statistical significance testing as a means of data interpretation are reviewed. The biggest problem with statistical significance testing is that researchers have used the results of this testing to ascribe importance or meaning to their studies where such meaning often does not exist. Often researchers…

Descriptors: Data Interpretation, Effect Size, Power (Statistics), Reliability

Validation as Communication and Action: On the Social Construction of Validity.

Download full text

Kvale, Steinar – 1994

Arguments are presented for conceptualizing validity within a postmodern approach. Validity, reliability, and generalizability have been a holy trinity of social science research, and standard definitions of validity have been taken from criteria developed for psychometric tests. From a postmodern point of view, validity is sometimes discarded as…

Descriptors: Communication (Thought Transfer), Constructivism (Learning), Definitions, Generalizability Theory

Rhetorical and Communication Theory: An Area of Interface.

Baker, C. Scott; Fadely, Dean – 1986

A study examined identification and consistency theory of interface between rhetorical and communication theory, to demonstrate the compatibility of specific principles in Kenneth Burke's theory with those in the work of Fritz Heider and other consistency theorists and to make suggestions toward a Burkeian theory of identification through…

Descriptors: Communication (Thought Transfer), Identification (Psychology), Political Influences, Reliability

Can Appraisers Rate Work Performance Accurately?

Hedge, Jerry W.; Laue, Frances J. – 1988

The ability of individuals to make accurate judgments about others is examined and literature on this subject is reviewed. A wide variety of situational factors affects the appraisal of performance. It is generally accepted that the purpose of the appraisal influences the accuracy of the appraiser. The instrumentation, or tools, available to the…

Descriptors: Evaluation Criteria, Evaluation Methods, Evaluation Problems, Performance Factors

Issues of Test Bias and Validity.

Ekstrom, Ruth B. – 1979

Three areas of concern related to test bias and validity should be considered during the revision of the Standards for Educational and Psychological Tests. The first area concerns the sources and consequences of test bias. Five sources of bias have been identified: numerical bias, role bias, status bias, stereotypic bias, and familiarity bias. The…

Descriptors: Evaluation Criteria, Psychometrics, Test Bias, Test Construction

Improving Interrater Reliability.

Download full text

Atkinson, Dianne; Murray, Mary – 1987

Noting that improvement in rater reliability means eliminating differences among raters, this paper discusses ways to assess writing evaluator reliability and methods for achieving higher levels of interrater reliability. After showing that reliability can be improved two ways--by increasing the number of raters or measurements made, and by…

Descriptors: Evaluation Methods, Holistic Evaluation, Interrater Reliability, Measurement Techniques

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5

Test Reliability	46
Test Validity	35
Reliability	25
Elementary Secondary Education	18
Evaluation Methods	18
Testing Problems	18
Higher Education	17
Test Construction	15
Validity	15
Research Methodology	13
Student Evaluation	12
Measurement Techniques	11
Testing	10
Research Problems	9
Writing Evaluation	9
Educational Assessment	8
Models	8
Psychometrics	8
Standardized Tests	8
Educational Research	7
Test Format	7
Test Interpretation	7
Program Evaluation	6
Research Design	6
Scoring	6
More ▼

Coffman, William E.	3
Thompson, Bruce	2
Alderson, J. Charles	1
Atkinson, Dianne	1
Baker, C. Scott	1
Barnes, Robert E.	1
Bickman, Leonard	1
Bobie, Allen	1
Booth, Mary W.	1
Brittain, Clay V.	1
Brittain, Mary M.	1
Carlson, Janet F.	1
Cashin, William E.	1
Cohen, Eli	1
Crehan, Kevin D.	1
Crismore, Avon	1
D'Agostino, Jerome V.	1
Day, Donald	1
Dudczak, Craig	1
Ebel, Robert L.	1
Ekstrom, Ruth B.	1
Fadely, Dean	1
Fenton, Ray	1
Flaitz, Jim	1
More ▼