NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 14 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Liou, Gloria; Bonner, Cavan V.; Tay, Louis – International Journal of Testing, 2022
With the advent of big data and advances in technology, psychological assessments have become increasingly sophisticated and complex. Nevertheless, traditional psychometric issues concerning the validity, reliability, and measurement bias of such assessments remain fundamental in determining whether score inferences of human attributes are…
Descriptors: Psychometrics, Computer Assisted Testing, Adaptive Testing, Data
Peer reviewed Peer reviewed
Direct linkDirect link
Choi, Youn-Jeng; Asilkalkan, Abdullah – Measurement: Interdisciplinary Research and Perspectives, 2019
About 45 R packages to analyze data using item response theory (IRT) have been developed over the last decade. This article introduces these 45 R packages with their descriptions and features. It also describes possible advanced IRT models using R packages, as well as dichotomous and polytomous IRT models, and R packages that contain applications…
Descriptors: Item Response Theory, Data Analysis, Computer Software, Test Bias
Rose, Janet S.; Huynh, Huynh – 1984
As part of a new teacher evaluation program initiated by the local school board, the Charleston County School District (South Carolina) adopted the Assessments of Performance in Teaching (APT) as a major evaluation tool to assess the teaching performance of annual contract teachers. Since evaluation procedures can ultimately lead to teacher…
Descriptors: Classroom Observation Techniques, Elementary Secondary Education, Evaluation Methods, Interrater Reliability
Doolittle, Allen E. – 1983
The stability of selected indices for detecting differential item performance (item bias), from one randomly equivalent sample to another, is addressed. Some recent research has criticized these indices as too unreliable for utility in measuring bias in achievement test items. Using data from a national testing of the ACT Assessment, however, this…
Descriptors: Black Students, Item Analysis, Racial Factors, Reliability
Hernandez, Arthur E.; Willson, Victor – 1984
Scores of two groups of White and Hispanic children at 11 age levels from 2.5 years to 12.5 years were assessed. The scores were drawn from the Kaufman Assessment Battery for Children (K-ABC), an individually administered assessment battery designed to measure intelligence and achievement and intended for minority group assessment. Reliability…
Descriptors: Achievement Tests, Elementary Education, Error of Measurement, Hispanic Americans
Loyd, Brenda – 1983
The chi-square procedure has been suggested as a viable index of test bias because it provides the best agreement with the three parameter item characteristic curve without the large sample requirement, computer complexity, and cost. This study examines the effect of using different numbers of ability intervals on the reliability of chi-square…
Descriptors: Academic Ability, Black Students, College Entrance Examinations, English
Florida State Dept. of Education, Tallahassee. – 1981
This bulletin describes the technical adequacy of the Florida Teacher Certification Examination and includes discussions on establishing test reliability, test validity, passing scores, and protecting the test from cultural or ethnic bias. Chapter I describes the development of the examination: (1) development, identification, and validation of…
Descriptors: Evaluation Methods, Grading, Higher Education, Minimum Competency Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Hall, John D.; Ashley, Donna M.; Bramlett, Ronald K.; Dielmann, Kim B.; Murphy, John J. – Journal of Applied School Psychology, 2005
This study examined effects of negative versus positive symptom formats on the assessment and subsequent classification of ADHD in children in public schools. Symptoms associated with the disorder based on the Diagnostic and Statistical Manual of Mental Disorders Fourth Edition (DSM-IV) were presented to parents and teachers of referred children…
Descriptors: Response Style (Tests), Attention Deficit Disorders, Classification, Hyperactivity
Nenty, H. Johnson – 1986
The Cattell Culture Fair Intelligence Test (CCFIT) was administered to a large sample of American, Nigerian, and Indian adolescents, and item data were examined for cultural bias. The CCFIT was designed to measure fluid intelligence, which is not influenced by cultural differences. Four different item analysis techniques were used to determine…
Descriptors: Construct Validity, Cross Cultural Studies, Cultural Influences, Culture Fair Tests
Simmons, Johnny O. – 1985
This study examined the need to develop reliable and valid procedures for screening large populations for possible speech and language problems and the use of the Fluharty Preschool Speech and Language Screening Test (FPSLST) as such a device. The test was administered to 260 preschool children, ages three to six. There were 166 Blacks and 94…
Descriptors: Black Students, Comprehension, Correlation, Difficulty Level
Peer reviewed Peer reviewed
Ackerman, Michael – History of Education Quarterly, 1995
Discusses the development and uses of various aptitude tests in higher education from the 1920s through the early 1960s. Although seen as a gateway to educational attainment for returning World War II veterans, intelligence testing faced criticism in the early 1960s as a restrictive practice. (MJP)
Descriptors: Educational Assessment, Educational Attainment, Educational History, Educational Mobility
Rengel, Elizabeth – 1986
The Ball Aptitude Battery (BAB) was examined for item bias in a sample of 577 high school students in which males and females, as well as three ethnic groups (Blacks, Whites, and Hispanics) were represented. The objectives of the investigation were: (1) to assess the level of interrater agreement for the judgmental method; (2) to find the level of…
Descriptors: Academic Aptitude, Aptitude Tests, Black Students, Culture Fair Tests
National Evaluation Systems, Inc., Amherst, MA. – 1991
Five papers presented at a symposium during the 1991 annual conference of the National Council on Measurement in Education explore the design, development, and implementation of the Texas Master Teacher Examination (TMTE) Program. Educational policymakers have begun to maintain that the professionalization of teaching can be substantially…
Descriptors: Educational Legislation, Educational Objectives, Educational Policy, Elementary Secondary Education
Macpherson, Colin R.; Rowley, Glenn L. – 1986
Teacher-made mastery tests were administered in a classroom-sized sample to study their decision consistency. Decision-consistency of criterion-referenced tests is usually defined in terms of the proportion of examinees who are classified in the same way after two test administrations. Single-administration estimates of decision consistency were…
Descriptors: Classroom Research, Comparative Testing, Criterion Referenced Tests, Cutting Scores