Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 0 |
Since 2016 (last 10 years) | 1 |
Since 2006 (last 20 years) | 1 |
Descriptor
Source
Author
Publication Type
Opinion Papers | 38 |
Journal Articles | 24 |
Reports - Evaluative | 7 |
Speeches/Meeting Papers | 6 |
Reports - Descriptive | 3 |
Information Analyses | 2 |
Reports - Research | 2 |
Collected Works - Serials | 1 |
Reference Materials -… | 1 |
Education Level
Elementary Education | 1 |
Audience
Practitioners | 2 |
Researchers | 1 |
Location
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Greathouse, Dan; Shaughnessy, Michael F. – Journal of Psychoeducational Assessment, 2016
Whenever a major intelligence or achievement test is revised, there is always renewed interest in the underlying structure of the test as well as a renewed interest in the scoring, administration, and interpretation changes. In this interview, Amy Gabel discusses the most recent revision of the "Wechsler Intelligence Scale for Children-Fifth…
Descriptors: Children, Intelligence Tests, Test Use, Test Validity

Fisicaro, Sebastiano A.; Vance, Robert J. – Educational and Psychological Measurement, 1994
This article presents arguments that the correlation measure "r" of halo is not conceptually more appropriate than the standard deviation (SD) measure. It also describes conditions under which halo effects occur and when the SD and r measures can be used. Neither measure is uniformly superior to the other. (SLD)
Descriptors: Correlation, Evaluation Methods, Interrater Reliability, Measurement Techniques

Gottfredson, Don M., Ed. – Criminal Justice and Behavior, 1983
Examines the use of the Minnesota Multiphasic Personality Inventory (Megargee typology) as a valid classification system in correctional decision making in a series of six articles. Most results urge caution in the use of the Megargee typology, finding poor test validity and test-retest reliability. (WAS)
Descriptors: Classification, Correctional Institutions, Personality Assessment, Position Papers

MacKay, Gilbert; Lundie, Jennifer – International Journal of Disability, Development and Education, 1998
Recognizes the attraction of Goal Attainment Scaling (GAS), a technique that uses a scale to measure client's achievement, but suggests that there are concerns about the calculation of its standard scores. Examples show how GAS may be used in service development, whether or not numerical values are attached. (Author/CR)
Descriptors: Achievement Gains, Achievement Rating, Adults, Children

Schutz, Richard E. – Educational Evaluation and Policy Analysis, 1985
This paper updates the concept of test validity. This new conception entails a set of 10 categories combined together in pairs: curriculum and instructional validity, statutory and forensic validity, media and journalistic validity, political and legislative validity, and partisan and activist validity. (Author/DWH)
Descriptors: Educational Testing, Politics of Education, Predictive Validity, Psychometrics
The Constant Danger of Sacrificing Validity to Reliability: Making Writing Assessment Serve Writers.

Wiggins, Grant – Assessing Writing, 1994
Suggests that assessment must be built into the curriculum and focused upon the kinds of skills students need. Considers much educational testing in writing to be reductionist, unrealistic, and detrimental to learning. Critiques writing assessment's trust and reliance on a single or small sample of student work collected and scored outside of a…
Descriptors: Elementary Secondary Education, Evaluation Methods, Reliability, Student Evaluation

Pittenger, David J. – Journal of Career Planning and Employment, 1993
Considers problems with use of Myers-Briggs Type Indicator (MBTI). Presents brief history of MBTI and brief theory of type. Examines MBTI's statistical structure, reliability, and validity. Concludes that MBTI does not conform to many basic standards expected of psychological tests. (NB)
Descriptors: Career Choice, Evaluation Problems, Labeling (of Persons), Personality Measures

Butler, Katherine G. – Reading Teacher, 1985
Concludes that the lack of normative data, the suggestion that even one failure on the test (with approximately 111 items) makes a child suspect for "at risk" labelling, along with the brevity of directions and interpretation of data require that the test be used with great caution. (FL)
Descriptors: Kindergarten, Language Skills, Screening Tests, Speech Skills
Ediger, Marlow – 2001
It is difficult to know the information that should be included on state report cards to enable comparisons among school districts and among different states. There may be many problems with such report cards, ranging from the possibility of computer error to the chance of reporting test scores that are not reliable or valid or the use of tests…
Descriptors: Academic Achievement, Comparative Analysis, Elementary Secondary Education, Reliability

Carver, Ronald P. – Journal of Reading, 1985
Argues that the Degrees of Reading Power test is not a valid tool for its main purpose, matching students to appropriate texts, because the test's units of text difficulty are not uniformly comparable to the test's units used to reflect a reader's ability. (HOD)
Descriptors: Elementary Secondary Education, Readability, Readability Formulas, Reading Ability

Amberg, Jay – American Scholar, 1982
The fact that the Scholastic Aptitude Test (SAT) is susceptible to coaching does not mean it is a poor test. The abilities measured by it are acquired, apart from test-wiseness. Even though some uses of the scores in admissions may be discriminatory, the test itself is fair, uniform, and judiciously administered. (MSE)
Descriptors: Admission Criteria, Advance Organizers, College Entrance Examinations, Higher Education

Imrie, Bradford W. – Assessment and Evaluation in Higher Education, 1982
Evaluation of the final examination should be part of the course evaluation and should include student perceptions of the exam's nature and the questions' quality. The final examination experience of two groups of undergraduate and graduate students are considered. (MSE)
Descriptors: Course Evaluation, Higher Education, Student Attitudes, Student Evaluation

Taylor, Catherine S.; Nolen, Susan Bobbitt – Education Policy Analysis Archives, 1996
The usefulness of traditional concepts of validity and reliability, developed for large-scale assessments, for the classroom context is explored. Alternate frameworks that situate these constructs in teachers' work in classrooms are presented, and their use in an assessment course for preservice teachers is described. (SLD)
Descriptors: Educational Assessment, Learning, Models, Preservice Teachers

Woodburn, Mary Stuart – Reading Teacher, 1986
Concludes that the test has a well-designed reading booklet and a carefully constructed manual, but that it has a narrow applicability. (FL)
Descriptors: Elementary Secondary Education, Oral Reading, Reading Achievement, Reading Diagnosis
Aiken, Lewis R. – 1979
The research literature on oral achievement testing is reviewed, and advantages and disadvantages of oral tests are described. A number of suggestions are made for improving the objectivity, reliability, and validity of oral tests. The results of a survey of the attitudes and experiences of a selected sample of college students with regard to…
Descriptors: Achievement Tests, Evaluation Methods, Interpretive Skills, Speech Skills