ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	3
Since 2006 (last 20 years)	6

Descriptor

Evaluation Methods	13
Scores	13
Test Content	13
Test Validity	5
Achievement Tests	4
Comparative Analysis	4
Student Evaluation	4
Test Construction	4
Test Items	4
Test Reliability	4
Test Results	4
Test Use	4
Higher Education	3
Testing	3
Academic Achievement	2
Accountability	2
Elementary Secondary Education	2
Grading	2
Psychometrics	2
Standardized Tests	2
Test Bias	2
Test Format	2
Writing Tests	2
Adult Basic Education	1
Aptitude Tests	1
More ▼

Source

Academic Medicine	1
Applied Measurement in…	1
ERS Spectrum	1
Education Policy Analysis…	1
Journal of Applied Testing…	1
Journal of Chemical Education	1
Journal of Educational…	1
National Technical Assistance…	1
Regional Educational…	1

Publication Type

Journal Articles	7
Reports - Research	7
Speeches/Meeting Papers	4
Information Analyses	3
Reports - Evaluative	2
Books	1
Guides - Classroom - Teacher	1
Guides - Non-Classroom	1
Reports - Descriptive	1
Tests/Questionnaires	1

Education Level

Elementary Education	2
Elementary Secondary Education	2
Adult Basic Education	1
Adult Education	1
Higher Education	1
Middle Schools	1
Postsecondary Education	1

Audience

Teachers	2
Practitioners	1

Location

Canada	1
Florida	1

Laws, Policies, & Programs

Every Student Succeeds Act…

Assessments and Surveys

Florida Comprehensive…	1
Measures of Academic Progress	1
National Assessment of…	1
SAT (College Admission Test)	1
Test of Adult Basic Education	1
Wide Range Achievement Test	1
Woodcock Johnson Tests of…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Releasing Content to Deter Cheating: An Analysis of the Impact on Candidate Performance

Peer reviewed

Direct link

Wolkowitz, Amanda A.; Davis-Becker, Susan L.; Gerrow, Jack D. – Journal of Applied Testing Technology, 2016

The purpose of this study was to investigate the impact of a cheating prevention strategy employed for a professional credentialing exam that involved releasing over 7,000 active and retired exam items. This study evaluated: 1) If any significant differences existed between examinee performance on released versus non-released items; 2) If item…

Descriptors: Cheating, Test Content, Test Items, Foreign Countries

The Comparability of Scores from Different Digital Devices: A Literature Review and Synthesis with Recommendations for Practice

Peer reviewed

Direct link

Dadey, Nathan; Lyons, Susan; DePascale, Charles – Applied Measurement in Education, 2018

Evidence of comparability is generally needed whenever there are variations in the conditions of an assessment administration, including variations introduced by the administration of an assessment on multiple digital devices (e.g., tablet, laptop, desktop). This article is meant to provide a comprehensive examination of issues relevant to the…

Descriptors: Evaluation Methods, Computer Assisted Testing, Educational Technology, Technology Uses in Education

A Brief Guide to Selecting and Using Pre-Post Assessments

Download full text

Sanders, Sara – National Technical Assistance Center for the Education of Neglected or Delinquent Children and Youth (NDTAC), 2019

This guide is designed to assist States, agencies, and/or facilities who work with youth who are neglected, delinquent, or at-risk (N or D). The information in the guide will benefit those who are (a) interested in implementing pre-posttests, (b) in the process of identifying an appropriate pre-posttest, or (c) ready to evaluate current testing…

Descriptors: At Risk Students, Delinquency, Pretests Posttests, Testing

Understanding the State of the Art for Measurement in Chemistry Education Research: Examining the Psychometric Evidence

Peer reviewed

Direct link

Arjoon, Janelle A.; Xu, Xiaoying; Lewis, Jennifer E. – Journal of Chemical Education, 2013

Many of the instruments developed for research use by the chemistry education community are relatively new. Because psychometric evidence dictates the validity of interpretations made from test scores, gathering and reporting validity and reliability evidence is of utmost importance. Therefore, the purpose of this study was to investigate what…

Descriptors: Science Instruction, Measurement Techniques, Psychometrics, Evidence

The Predictive Validity of Selected Benchmark Assessments Used in the Mid-Atlantic Region. Issues & Answers. REL 2007-No. 017

Peer reviewed
PDF on ERIC

Download full text

Brown, Richard S.; Coughlin, Ed – Regional Educational Laboratory Mid-Atlantic, 2007

This report examines the availability and quality of predictive validity data for a selection of benchmark assessments identified by state and district personnel as in use within Mid-Atlantic Region jurisdictions. Based on a review of practices within the school districts in the region, this report details the benchmark assessments being used, in…

Descriptors: Test Content, Academic Achievement, Predictive Validity, Program Effectiveness

An Application of Score Equity Assessment: Invariance of Linkage of New SAT[R] to Old SAT across Gender Groups

Peer reviewed

Direct link

Liu, Jinghua; Cahn, Miriam F.; Dorans, Neil J. – Journal of Educational Measurement, 2006

The College Board's SAT[R] data are used to illustrate how the score equity assessment (SEA) can help inform the program about equatability. SEA is used to examine whether the content change(s) to the revised new SAT result in differential linking functions across gender groups. Results of population sensitivity analyses are reported on the…

Descriptors: Aptitude Tests, Comparative Analysis, Gender Differences, Scores

Setting Content-Based Standards for National Board Exams: Initial Research for the Comprehensive Part I Examination.

Peer reviewed

Swanson, David B.; And Others – Academic Medicine, 1990

This study is the National Board of Medical Examiners exploration of content-based techniques (standard-setting techniques in which pass/fail decisions are based upon the performance of examinees in relation to test content). Two content-based techniques (Angoff and Ebel) and three methods of evaluating examinee performance were studied. (MLW)

Descriptors: Content Validity, Evaluation Methods, Higher Education, Medical Education

Should Achievement Tests Be Used To Judge School Quality?

Peer reviewed

Bauer, Scott C. – Education Policy Analysis Archives, 2000

Studied whether student scores on standardized tests represent reasonable measures of instructional quality using ratings by 10 parents and 11 educators (school principals) of the degree to which test items from a nationally marketed standardized achievement test represent the content actually taught. On average, raters felt that test items…

Descriptors: Achievement Tests, Educational Quality, Elementary Secondary Education, Evaluation Methods

Factors in Performance on Brief, Impromptu Essay Examinations. College Board Report No. 95-4.

Download full text

Breland, Hunter M.; And Others – 1995

Brief, impromptu essays written for the 1990 administration of the College Board's English Composition Test (ECT) were randomly sampled for four groups of examinees. These essays were subjected to further holistic ratings beyond those conducted for the ECT, and analytical ratings were also obtained. The holistic scores were correlated with the…

Descriptors: Cohesion (Written Composition), English, Essays, Evaluation Methods

Basic Precepts in Test Construction.

Download full text

Buser, Karen – 1996

Most seasoned test developers recognize the importance of thoughtful decision making when constructing a test. Unfortunately, many classroom achievement tests are created by novice test developed who have not received sufficient instruction in item writing (G. Gulliksen, 1986; R. J. Stiggins, 1991). The result is often a test that is poorly…

Descriptors: Achievement Tests, Decision Making, Educational Planning, Evaluation Methods

Tips for Improving Testing and Grading. Survival Skills for Scholars, Volume 4.

Ory, John C.; Ryan, Katherine E. – 1993

This book for college faculty provides a resource for developing, using, and grading classroom exams. The first chapter addresses ways to determine what content should be included on an exam. The second chapter identifies testing considerations such as number of exams, difficulty level of items, and test length. Chapters 3 and 4 provide guidelines…

Descriptors: Classroom Techniques, Codes of Ethics, Essay Tests, Evaluation Methods

The Role of Language Testing in Language Program Evaluation.

Download full text

Palmer, Adrian – 1991

A discussion of second language program evaluation focuses on the interpretability of test scores as a criterion in program evaluation. It looks at both test design and research design issues. First, eight method-comparison, program evaluation studies that compare acquisition-based and analysis/practice based methods are described. Acquisition…

Descriptors: Comparative Analysis, Criterion Referenced Tests, Evaluation Criteria, Evaluation Methods

Go Back and Check Your Work: Recommendations for Improving Florida's Accountability System

Peer reviewed

Direct link

Jones, Brett D.; Egley, Robert J. – ERS Spectrum, 2005

The purpose of this paper is to discuss Florida teachers' recommendations for improving the Florida Comprehensive Assessment Test (FCAT) and to compare their recommendations with those of Florida administrators. Although teachers' suggestions varied as to the types and extent of remedies needed to improve the FCAT, some common themes emerged. The…

Descriptors: Test Results, Core Curriculum, Student Evaluation, Accountability

Arjoon, Janelle A.	1
Bauer, Scott C.	1
Breland, Hunter M.	1
Brown, Richard S.	1
Buser, Karen	1
Cahn, Miriam F.	1
Coughlin, Ed	1
Dadey, Nathan	1
Davis-Becker, Susan L.	1
DePascale, Charles	1
Dorans, Neil J.	1
Egley, Robert J.	1
Gerrow, Jack D.	1
Jones, Brett D.	1
Lewis, Jennifer E.	1
Liu, Jinghua	1
Lyons, Susan	1
Ory, John C.	1
Palmer, Adrian	1
Ryan, Katherine E.	1
Sanders, Sara	1
Swanson, David B.	1
Wolkowitz, Amanda A.	1
Xu, Xiaoying	1
More ▼