NotesFAQContact Us
Collection
Advanced
Search Tips
Education Level
Grade 41
Grade 61
Grade 81
Audience
Researchers5
Laws, Policies, & Programs
Education Consolidation…1
What Works Clearinghouse Rating
Showing 1 to 15 of 41 results Save | Export
Flowers, Claudia P.; Oshima, T. C. – 1994
This study was patterned after a previous study by Skaggs and Lissitz (1992) in which inconsistency of differential item functioning (DIF) was reported across test administrations. They suggested multidimensionality of test data as one possible reason for inconsistency. Therefore, in this study, DIF indices which were developed recently with a…
Descriptors: Ethnic Groups, Item Bias, Mathematics, Reliability
Crislip, Marian A.; Chin-Chance, Selvin – 2001
This paper discusses the use of two theories of item analysis and test construction, their strengths and weaknesses, and applications to the design of the Hawaii State Test of Essential Competencies (HSTEC). Traditional analyses of the data collected from the HSTEC field test were viewed from the perspectives of item difficulty levels and item…
Descriptors: Difficulty Level, Item Response Theory, Psychometrics, Reliability
Lampe, Richard E. – 1984
This study examines the accuracy of the self-scoring efforts of 306 eighth-graders on the Kuder General Interest Survey (GIS), and suggests possible methods to improve self-scoring accuracy. The GIS is widely used to assist junior high school students with their educational and vocational planning. After the administration of the test by English…
Descriptors: Interest Inventories, Junior High Schools, Profiles, Scoring
Easton, John Q.; Washington, Elois D. – 1982
The effects of students taking different levels of the same standardized achievement test were assessed by administering two levels of the same test to each student. The functional level of the test was taken by all students. The second level of testing was randomly assigned at the adjacent higher or lower level of the test. Functional level…
Descriptors: Elementary Education, Pilot Projects, Reading Achievement, Scores
Peer reviewed Peer reviewed
Reckase, Mark D. – Educational Measurement: Issues and Practice, 1995
An example application of portfolio assessment was developed and the model and estimates of reliability derived from the literature were then used to estimate the characteristics of an operational large-scale portfolio assessment program. Costs were estimated to put results in a realistic context. (SLD)
Descriptors: Cost Estimates, Educational Assessment, Educational Theories, Models
Jones, Lex – 1996
In England and Wales, a National Curriculum initiated in 1988 was designed to ensure that all schools provided a curriculum which represented different areas of knowledge. The past 20 years has increasingly seen more emphasis on the link between the financial amounts spent on education and subsequent return on this money. The impact of the…
Descriptors: British National Curriculum, Foreign Countries, Literacy, Performance Based Assessment
Yen, Shu Jing; Bene, Nancy; Huynh, Huynh – 2000
Content integration in performance assessment involves mixing different areas of knowledge in one assessment. In this type of testing situation, assessment tasks are designed to measure the ability of students to solve problems by applying their knowledge and skills in multiple content areas. This study examined the effect of integrated science…
Descriptors: Elementary Secondary Education, Integrated Activities, Performance Based Assessment, Reading Achievement
Skaggs, Gary; Bourque, Mary Lyn – 1998
Political and legislative pressures have posed a number of measurement issues and challenges to the development of sound, valid voluntary national tests (VNTs). This paper focuses on what appear to be the most difficult technical issues related to the VNT proposed by President Clinton in 1997. Technical issues refer to psychometric issues, as…
Descriptors: Academic Achievement, Achievement Tests, Classification, Difficulty Level
Coon, Anne C. – 1992
Every year approximately 1,300 first-year students at the Rochester Institute of Technology complete a 50-minute placement essay during summer and fall orientations. The essays are scored holistically, and the students are placed into one of three levels of an English composition course. At the end of the 10-week quarter of instruction, students…
Descriptors: Freshman Composition, Higher Education, Instructional Effectiveness, Program Descriptions
DeMauro, Gerald E. – 2001
The consequences of large state testing are often uniformity of expectations for achievement. The largest impact of higher standards, then, are realized by traditionally disenfranchised student populations, particularly the least affluent who are most likely to bear the yoke of low expectation. This paper advances S. Messick's (1981) fundamental…
Descriptors: Academic Achievement, Achievement Tests, Disadvantaged Youth, Elementary Secondary Education
Oxford-Carpenter, Rebecca L.; And Others – 1984
This paper presents an evaluation of Army job training development and testing practices, with a focus on Advanced Individual Testing. Information comes from intensive interviews with school instructors and from observations in the schools. Results indicate that some aspects of the Instructional Systems Development (ISD) model have been…
Descriptors: Adults, Criterion Referenced Tests, Instructional Development, Instructional Systems
Cope, Ronald T. – 1995
This paper deals with the problems that arise in performance assessment from the granularity that results from having a small number of tasks or prompts and raters of responses to these tasks or prompts. Two problems are discussed in detail: (1) achieving a satisfactory degree of reliability; and (2) equating or adjusting for differences of…
Descriptors: Difficulty Level, Educational Assessment, Equated Scores, High Stakes Tests
Rothenberg, Lori; Hessling, Peter A. – 1990
The statewide teaching performance assessment instruments being used in Georgia, North Carolina, and Florida were examined. Forty-one reliability and validity studies regarding the instruments in use in each state were collected from state departments and universities. Georgia uses the Georgia Teacher Performance Assessment Instrument. North…
Descriptors: Construct Validity, Educational Assessment, Elementary Secondary Education, Meta Analysis
Ferrara, Steven F. – 1987
The necessity of controlling the order in which trained essay raters for a statewide writing assessment program receive student essays was studied. The underlying theoretical question concerns possible rater bias caused by raters reading long strings of essays of homogeneous quality; this problem is usually referred to as context effect or…
Descriptors: Context Effect, Essay Tests, Evaluators, Graduation Requirements
Barnwell, David Patrick – 1993
Language testing historians have tended to ignore a significant period in the evolution of language tests, the years 1883-1929. In the earliest years, testing focused on knowledge about, not of, the language and reflected the teaching of Latin and Greek more than that of living languages. Grammatical formalism and translation were emphasized, and…
Descriptors: Articulation (Education), Educational History, Language Tests, Second Language Instruction
Previous Page | Next Page ยป
Pages: 1  |  2  |  3