ERIC - Search Results

Descriptor

Testing Programs	41
Test Reliability	24
State Programs	22
Test Construction	16
Educational Assessment	15
Elementary Secondary Education	14
Performance Based Assessment	13
Scoring	11
Test Validity	11
Reliability	9
Academic Achievement	8
Interrater Reliability	8
Standardized Tests	8
Achievement Tests	7
Testing Problems	7
Writing Evaluation	7
High Schools	6
Student Evaluation	6
Difficulty Level	5
Equated Scores	5
Essay Tests	5
Evaluation Methods	5
Scores	5
Test Items	5
Accountability	4
More ▼

Source

Educational Measurement:…	1
Online Submission	1

Publication Type

Speeches/Meeting Papers	41
Reports - Research	23
Reports - Evaluative	15
Reports - Descriptive	3
Opinion Papers	2
Tests/Questionnaires	2
Historical Materials	1
Information Analyses	1
Journal Articles	1
Numerical/Quantitative Data	1

Education Level

Grade 4	1
Grade 6	1
Grade 8	1

Audience

Researchers

Location

New York	3
Florida	2
Georgia	2
North Carolina	2
Hawaii	1
Ireland	1
Kentucky	1
Louisiana	1
Maine	1
Pennsylvania	1
United Kingdom (England)	1
United Kingdom (Wales)	1
More ▼

Laws, Policies, & Programs

Education Consolidation…

Assessments and Surveys

National Assessment of…	3
Comprehensive Tests of Basic…	2
California Achievement Tests	1
General Educational…	1
Iowa Tests of Basic Skills	1
Metropolitan Achievement Tests	1
North Carolina End of Course…	1
SRA Achievement Series	1
Sequential Tests of…	1
Teacher Performance…	1

What Works Clearinghouse Rating

Showing 1 to 15 of 41 results Save | Export

The Consistency of DIF/DTF across Different Test Administrations: A Multidimensional Perspective.

Download full text

Flowers, Claudia P.; Oshima, T. C. – 1994

This study was patterned after a previous study by Skaggs and Lissitz (1992) in which inconsistency of differential item functioning (DIF) was reported across test administrations. They suggested multidimensionality of test data as one possible reason for inconsistency. Therefore, in this study, DIF indices which were developed recently with a…

Descriptors: Ethnic Groups, Item Bias, Mathematics, Reliability

Using Traditional Psychometric Methodologies and the Rasch Model in Designing a Test.

Download full text

Crislip, Marian A.; Chin-Chance, Selvin – 2001

This paper discusses the use of two theories of item analysis and test construction, their strengths and weaknesses, and applications to the design of the Hawaii State Test of Essential Competencies (HSTEC). Traditional analyses of the data collected from the HSTEC field test were viewed from the perspectives of item difficulty levels and item…

Descriptors: Difficulty Level, Item Response Theory, Psychometrics, Reliability

Self-Scoring Accuracy of the Kuder General Interest Survey.

Download full text

Lampe, Richard E. – 1984

This study examines the accuracy of the self-scoring efforts of 306 eighth-graders on the Kuder General Interest Survey (GIS), and suggests possible methods to improve self-scoring accuracy. The GIS is widely used to assist junior high school students with their educational and vocational planning. After the administration of the test by English…

Descriptors: Interest Inventories, Junior High Schools, Profiles, Scoring

The Effects of Functional Level Testing on Five New Standardized Reading Achievement Tests.

Download full text

Easton, John Q.; Washington, Elois D. – 1982

The effects of students taking different levels of the same standardized achievement test were assessed by administering two levels of the same test to each student. The functional level of the test was taken by all students. The second level of testing was randomly assigned at the adjacent higher or lower level of the test. Functional level…

Descriptors: Elementary Education, Pilot Projects, Reading Achievement, Scores

Portfolio Assessment: A Theoretical Estimate of Score Reliability.

Peer reviewed

Reckase, Mark D. – Educational Measurement: Issues and Practice, 1995

An example application of portfolio assessment was developed and the model and estimates of reliability derived from the literature were then used to estimate the characteristics of an operational large-scale portfolio assessment program. Costs were estimated to put results in a realistic context. (SLD)

Descriptors: Cost Estimates, Educational Assessment, Educational Theories, Models

Reading National Assessment.

Download full text

Jones, Lex – 1996

In England and Wales, a National Curriculum initiated in 1988 was designed to ensure that all schools provided a curriculum which represented different areas of knowledge. The past 20 years has increasingly seen more emphasis on the link between the financial amounts spent on education and subsequent return on this money. The impact of the…

Descriptors: British National Curriculum, Foreign Countries, Literacy, Performance Based Assessment

The Effect of Content Integration on the Construct Validity of Reading Performance Assessment.

Download full text

Yen, Shu Jing; Bene, Nancy; Huynh, Huynh – 2000

Content integration in performance assessment involves mixing different areas of knowledge in one assessment. In this type of testing situation, assessment tasks are designed to measure the ability of students to solve problems by applying their knowledge and skills in multiple content areas. This study examined the effect of integrated science…

Descriptors: Elementary Secondary Education, Integrated Activities, Performance Based Assessment, Reading Achievement

Overview of the Most Difficult Technical Issues on the VNT.

Download full text

Skaggs, Gary; Bourque, Mary Lyn – 1998

Political and legislative pressures have posed a number of measurement issues and challenges to the development of sound, valid voluntary national tests (VNTs). This paper focuses on what appear to be the most difficult technical issues related to the VNT proposed by President Clinton in 1997. Technical issues refer to psychometric issues, as…

Descriptors: Academic Achievement, Achievement Tests, Classification, Difficulty Level

Large-Scale Writing Assessment: Methods, Accommodations and Reliability.

Download full text

Coon, Anne C. – 1992

Every year approximately 1,300 first-year students at the Rochester Institute of Technology complete a 50-minute placement essay during summer and fall orientations. The essays are scored holistically, and the students are placed into one of three levels of an English composition course. At the end of the 10-week quarter of instruction, students…

Descriptors: Freshman Composition, Higher Education, Instructional Effectiveness, Program Descriptions

Differential Consequential Validity and the Stability of Inferences across Ethnicity and Community on New York State Large Scale Tests.

Download full text

DeMauro, Gerald E. – 2001

The consequences of large state testing are often uniformity of expectations for achievement. The largest impact of higher standards, then, are realized by traditionally disenfranchised student populations, particularly the least affluent who are most likely to bear the yoke of low expectation. This paper advances S. Messick's (1981) fundamental…

Descriptors: Academic Achievement, Achievement Tests, Disadvantaged Youth, Elementary Secondary Education

Army Job Training Development and Testing Practices Compared to the Instructional Systems Development Model.

Oxford-Carpenter, Rebecca L.; And Others – 1984

This paper presents an evaluation of Army job training development and testing practices, with a focus on Advanced Individual Testing. Information comes from intensive interviews with school instructors and from observations in the schools. Results indicate that some aspects of the Instructional Systems Development (ISD) model have been…

Descriptors: Adults, Criterion Referenced Tests, Instructional Development, Instructional Systems

Cautionary Observations on Reliability and Equating of Forms in High Stakes Performance Assessment: The Problem of Granularity.

Download full text

Cope, Ronald T. – 1995

This paper deals with the problems that arise in performance assessment from the granularity that results from having a small number of tasks or prompts and raters of responses to these tasks or prompts. Two problems are discussed in detail: (1) achieving a satisfactory degree of reliability; and (2) equating or adjusting for differences of…

Descriptors: Difficulty Level, Educational Assessment, Equated Scores, High Stakes Tests

Applying the APA/AERA/NCME "Standards": Evidence for the Validity and Reliability of Three Statewide Teaching Assessment Instruments.

Download full text

Rothenberg, Lori; Hessling, Peter A. – 1990

The statewide teaching performance assessment instruments being used in Georgia, North Carolina, and Florida were examined. Forty-one reliability and validity studies regarding the instruments in use in each state were collected from state departments and universities. Georgia uses the Georgia Teacher Performance Assessment Instrument. North…

Descriptors: Construct Validity, Educational Assessment, Elementary Secondary Education, Meta Analysis

Effects of Essay Order on Raters' Score Assignments in a Large-Scale Writing Assessment.

Ferrara, Steven F. – 1987

The necessity of controlling the order in which trained essay raters for a statewide writing assessment program receive student essays was studied. The underlying theoretical question concerns possible rater bias caused by raters reading long strings of essays of homogeneous quality; this problem is usually referred to as context effect or…

Descriptors: Context Effect, Essay Tests, Evaluators, Graduation Requirements

Problems of Articulation and Testing: Lessons from the 1920s.

Download full text

Barnwell, David Patrick – 1993

Language testing historians have tended to ignore a significant period in the evolution of language tests, the years 1883-1929. In the earliest years, testing focused on knowledge about, not of, the language and reflected the teaching of Latin and Greek more than that of living languages. Grammatical formalism and translation were emphasized, and…

Descriptors: Articulation (Education), Educational History, Language Tests, Second Language Instruction

Previous Page | Next Page »

Pages: 1 | 2 | 3

Falk, Beverly	2
Ferrara, Steven F.	2
Yen, Shu Jing	2
Algina, James	1
Allen, Nancy L.	1
Anderson, Lorin W.	1
Auchter, Joan Chikos	1
Barnwell, David Patrick	1
Bene, Nancy	1
Benoit, Joyce	1
Bourque, Mary Lyn	1
Brauchle, Paul E.	1
Braungart-Bloom, Diane S.	1
Chin-Chance, Selvin	1
Coon, Anne C.	1
Cope, Ronald T.	1
Crislip, Marian A.	1
Cromack, Theodore R.	1
DeMauro, Gerald E.	1
Easton, John Q.	1
Ellett, Chad D.	1
Flowers, Claudia P.	1
Friedman, Greg	1
Green, Donald Ross	1
More ▼