Publication Date
In 2025 | 0 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 2 |
Since 2016 (last 10 years) | 7 |
Since 2006 (last 20 years) | 15 |
Descriptor
Test Bias | 16 |
Test Content | 16 |
Test Items | 11 |
Computer Assisted Testing | 6 |
Test Format | 5 |
Test Validity | 5 |
Comparative Testing | 4 |
Content Analysis | 4 |
Educational Technology | 4 |
Evaluation Methods | 4 |
Mathematics Tests | 4 |
More ▼ |
Source
Author
Ackerman, Debra J. | 1 |
Allen, Nancy | 1 |
Ariel, Adelaide | 1 |
Arjoon, Janelle A. | 1 |
Bennett, Randy Elliott | 1 |
Chipman, Susan F. | 1 |
Coen, Thomas | 1 |
Dadey, Nathan | 1 |
Dara Bright | 1 |
DePascale, Charles | 1 |
Drake, Samuel | 1 |
More ▼ |
Publication Type
Reports - Research | 16 |
Journal Articles | 14 |
Information Analyses | 1 |
Numerical/Quantitative Data | 1 |
Tests/Questionnaires | 1 |
Education Level
Audience
Location
Canada | 1 |
China | 1 |
Colorado (Denver) | 1 |
Delaware | 1 |
Florida | 1 |
Hong Kong | 1 |
Illinois | 1 |
Maryland | 1 |
New York (New York) | 1 |
North Carolina | 1 |
North Carolina (Charlotte) | 1 |
More ▼ |
Laws, Policies, & Programs
Every Student Succeeds Act… | 1 |
Assessments and Surveys
Graduate Record Examinations | 1 |
Law School Admission Test | 1 |
Program for International… | 1 |
What Works Clearinghouse Rating
Marjolein Muskens; Willem E. Frankenhuis; Lex Borghans – npj Science of Learning, 2024
In many countries, standardized math tests are important for achieving academic success. Here, we examine whether content of items, the story that explains a mathematical question, biases performance of low-SES students. In a large-scale cohort study of Trends in International Mathematics and Science Studies (TIMSS)--including data from 58…
Descriptors: Mathematics Tests, Standardized Tests, Test Items, Low Income Students
Parry, James R. – Online Submission, 2020
This paper presents research and provides a method to ensure that parallel assessments, that are generated from a large test-item database, maintain equitable difficulty and content coverage each time the assessment is presented. To maintain fairness and validity it is important that all instances of an assessment, that is intended to test the…
Descriptors: Culture Fair Tests, Difficulty Level, Test Items, Test Validity
Yiyun Fan; Kristin L. K. Koskey; Dara Bright; Gabriel Matney; Jonathan Bostic; Toni A. May; Gregory E. Stone – Educational Assessment, 2024
Advancement of testing of mathematical problem-solving skills calls for open-ended, realistic tasks particularly susceptible to bias, compromising the score validity and fairness of tests. Informed by universal design principles, this study framed 360 prototype items developed for the "Problem-solving Measures Grades 6-8 Computer Adaptive…
Descriptors: Access to Education, Mathematics Education, Problem Solving, Mathematics Tests
Dadey, Nathan; Lyons, Susan; DePascale, Charles – Applied Measurement in Education, 2018
Evidence of comparability is generally needed whenever there are variations in the conditions of an assessment administration, including variations introduced by the administration of an assessment on multiple digital devices (e.g., tablet, laptop, desktop). This article is meant to provide a comprehensive examination of issues relevant to the…
Descriptors: Evaluation Methods, Computer Assisted Testing, Educational Technology, Technology Uses in Education
Huang, Xiaoting; Wilson, Mark; Wang, Lei – Educational Psychology, 2016
In recent years, large-scale international assessments have been increasingly used to evaluate and compare the quality of education across regions and countries. However, measurement variance between different versions of these assessments often posts threats to the validity of such cross-cultural comparisons. In this study, we investigated the…
Descriptors: Test Bias, International Assessment, Science Tests, Test Validity
Grand, James A.; Golubovich, Juliya; Ryan, Ann Marie; Schmitt, Neal – Organizational Behavior and Human Decision Processes, 2013
In organizational and educational practices, sensitivity reviews are commonly advocated techniques for reducing test bias and enhancing fairness. In the present paper, results from two studies are reported which investigate how effective individuals are at detecting problematic test content and the influence such content has on important testing…
Descriptors: Test Items, Test Content, Test Bias, Individual Differences
Ackerman, Debra J. – ETS Research Report Series, 2018
Kindergarten entry assessments (KEAs) have increasingly been incorporated into state education policies over the past 5 years, with much of this interest stemming from Race to the Top--Early Learning Challenge (RTT-ELC) awards, Enhanced Assessment Grants, and nationwide efforts to develop common K-12 state learning standards. Drawing on…
Descriptors: Screening Tests, Kindergarten, Test Validity, Test Reliability
Keller, Lisa A.; Keller, Robert R. – Applied Measurement in Education, 2015
Equating test forms is an essential activity in standardized testing, with increased importance with the accountability systems in existence through the mandate of Adequate Yearly Progress. It is through equating that scores from different test forms become comparable, which allows for the tracking of changes in the performance of students from…
Descriptors: Item Response Theory, Rating Scales, Standardized Tests, Scoring Rubrics
Arjoon, Janelle A.; Xu, Xiaoying; Lewis, Jennifer E. – Journal of Chemical Education, 2013
Many of the instruments developed for research use by the chemistry
education community are relatively new. Because psychometric evidence dictates the validity of interpretations made from test scores, gathering and reporting validity and reliability evidence is of utmost importance. Therefore, the purpose of this study was to investigate what…
Descriptors: Science Instruction, Measurement Techniques, Psychometrics, Evidence
Gill, Brian; Shoji, Megan; Coen, Thomas; Place, Kate – Regional Educational Laboratory Mid-Atlantic, 2016
School districts and states across the Regional Educational Laboratory Mid-Atlantic Region and the country as a whole have been modifying their teacher evaluation systems to identify more effective and less effective teachers and provide better feedback to improve instructional practice. The new systems typically include components related to…
Descriptors: Predictive Validity, Test Bias, Test Content, School Districts
Ferne, Tracy; Rupp, Andre A. – Language Assessment Quarterly, 2007
This article reviews research on differential item functioning (DIF) in language testing conducted primarily between 1990 and 2005 with an eye toward providing methodological guidelines for developing, conducting, and disseminating research in this area. The article contains a synthesis of 27 studies with respect to five essential sets of…
Descriptors: Test Bias, Evaluation Research, Testing, Language Tests
van der Linden, Wim J.; Ariel, Adelaide; Veldkamp, Bernard P. – Journal of Educational and Behavioral Statistics, 2006
Test-item writing efforts typically results in item pools with an undesirable correlational structure between the content attributes of the items and their statistical information. If such pools are used in computerized adaptive testing (CAT), the algorithm may be forced to select items with less than optimal information, that violate the content…
Descriptors: Adaptive Testing, Computer Assisted Testing, Test Items, Item Banks
Gu, Lixiong; Drake, Samuel; Wolfe, Edward W. – Journal of Technology, Learning, and Assessment, 2006
This study seeks to determine whether item features are related to observed differences in item difficulty (DIF) between computer- and paper-based test delivery media. Examinees responded to 60 quantitative items similar to those found on the GRE general test in either a computer-based or paper-based medium. Thirty-eight percent of the items were…
Descriptors: Test Bias, Test Items, Educational Testing, Student Evaluation
Johnson, Martin; Green, Sylvia – Journal of Technology, Learning, and Assessment, 2006
The transition from paper-based to computer-based assessment raises a number of important issues about how mode might affect children's performance and question answering strategies. In this project 104 eleven-year-olds were given two sets of matched mathematics questions, one set on-line and the other on paper. Facility values were analyzed to…
Descriptors: Student Attitudes, Computer Assisted Testing, Program Effectiveness, Elementary School Students

Chipman, Susan F.; And Others – American Educational Research Journal, 1991
The effects of problem content on mathematics word problem performance were explored for 128 male and 128 female college students solving problems with masculine, feminine, and neutral (familiar and unfamiliar) cover stories. No effect of sex typing was found, and a small, but highly significant, effect was found for familiarity. (SLD)
Descriptors: College Students, Comparative Testing, Familiarity, Females
Previous Page | Next Page ยป
Pages: 1 | 2