Publication Date
In 2025 | 0 |
Since 2024 | 4 |
Since 2021 (last 5 years) | 4 |
Since 2016 (last 10 years) | 13 |
Since 2006 (last 20 years) | 31 |
Descriptor
Source
Author
Liu, Kristin K. | 2 |
Thurlow, Martha L. | 2 |
Ackerman, Debra J. | 1 |
Allen, Nancy | 1 |
Amit Sevak | 1 |
Ariel, Adelaide | 1 |
Arjoon, Janelle A. | 1 |
Bennett, Randy Elliott | 1 |
Brown, Kevin | 1 |
Camilli, Gregory | 1 |
Cheng, Liying | 1 |
More ▼ |
Publication Type
Journal Articles | 21 |
Reports - Research | 15 |
Reports - Evaluative | 8 |
Reports - Descriptive | 4 |
Guides - General | 3 |
Tests/Questionnaires | 3 |
Information Analyses | 2 |
Guides - Non-Classroom | 1 |
Numerical/Quantitative Data | 1 |
Education Level
Audience
Location
China | 2 |
United States | 2 |
Canada | 1 |
Colorado (Denver) | 1 |
Delaware | 1 |
Florida | 1 |
Hong Kong | 1 |
Illinois | 1 |
Maryland | 1 |
New York (New York) | 1 |
North Carolina | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 3 |
Every Student Succeeds Act… | 1 |
Individuals with Disabilities… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Marjolein Muskens; Willem E. Frankenhuis; Lex Borghans – npj Science of Learning, 2024
In many countries, standardized math tests are important for achieving academic success. Here, we examine whether content of items, the story that explains a mathematical question, biases performance of low-SES students. In a large-scale cohort study of Trends in International Mathematics and Science Studies (TIMSS)--including data from 58…
Descriptors: Mathematics Tests, Standardized Tests, Test Items, Low Income Students
Vahid Aryadoust – Applied Linguistics, 2024
I analyzed a corpus of the international English language testing system (IELTS) comprising 256 listening sections (1996-2021). The primary objective of the study was to gain insights into the assumptions made by test designers regarding the real-life contexts that test-takers will encounter. Overall, 15 superordinate topic areas and 300 subtopics…
Descriptors: Dialects, Pronunciation, Commercialization, Second Language Learning
Parry, James R. – Online Submission, 2020
This paper presents research and provides a method to ensure that parallel assessments, that are generated from a large test-item database, maintain equitable difficulty and content coverage each time the assessment is presented. To maintain fairness and validity it is important that all instances of an assessment, that is intended to test the…
Descriptors: Culture Fair Tests, Difficulty Level, Test Items, Test Validity
Patrick C. Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Institute, 2024
Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international largescale assessments of cognitive and…
Descriptors: Performance Based Assessment, Evaluation Criteria, Evaluation Methods, Test Bias
Yiyun Fan; Kristin L. K. Koskey; Dara Bright; Gabriel Matney; Jonathan Bostic; Toni A. May; Gregory E. Stone – Educational Assessment, 2024
Advancement of testing of mathematical problem-solving skills calls for open-ended, realistic tasks particularly susceptible to bias, compromising the score validity and fairness of tests. Informed by universal design principles, this study framed 360 prototype items developed for the "Problem-solving Measures Grades 6-8 Computer Adaptive…
Descriptors: Access to Education, Mathematics Education, Problem Solving, Mathematics Tests
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…
Descriptors: Testing, Standards, Comparative Analysis, Guidelines
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…
Descriptors: Testing, Standards, Comparative Analysis, Guidelines
Updated Assessment Principles and Guidelines for English Learners with Disabilities. NCEO Report 424
Liu, Kristin K.; Lazarus, Sheryl S.; Thurlow, Martha L.; Jarmin, Jaime; Ward, Jenna; Christensen, Laurene – National Center on Educational Outcomes, 2020
This report is an update of the assessment principles and guidelines for English language learners published in 2013 (Thurlow, Liu, Ward, & Christensen). That report, which was developed by the Improving the Validity of Assessment Results for English Language Learners with Disabilities (IVARED) project, presented essential principles of…
Descriptors: English Language Learners, Students with Disabilities, Student Evaluation, Evaluation Methods
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS) to provide guidance to states that are interested in including New Meridian content and would like to either keep reporting scores on the New Meridian Scale or use the New Meridian performance levels; that is, the state…
Descriptors: Testing, Standards, Comparative Analysis, Test Content
Dadey, Nathan; Lyons, Susan; DePascale, Charles – Applied Measurement in Education, 2018
Evidence of comparability is generally needed whenever there are variations in the conditions of an assessment administration, including variations introduced by the administration of an assessment on multiple digital devices (e.g., tablet, laptop, desktop). This article is meant to provide a comprehensive examination of issues relevant to the…
Descriptors: Evaluation Methods, Computer Assisted Testing, Educational Technology, Technology Uses in Education
Huang, Xiaoting; Wilson, Mark; Wang, Lei – Educational Psychology, 2016
In recent years, large-scale international assessments have been increasingly used to evaluate and compare the quality of education across regions and countries. However, measurement variance between different versions of these assessments often posts threats to the validity of such cross-cultural comparisons. In this study, we investigated the…
Descriptors: Test Bias, International Assessment, Science Tests, Test Validity
Grand, James A.; Golubovich, Juliya; Ryan, Ann Marie; Schmitt, Neal – Organizational Behavior and Human Decision Processes, 2013
In organizational and educational practices, sensitivity reviews are commonly advocated techniques for reducing test bias and enhancing fairness. In the present paper, results from two studies are reported which investigate how effective individuals are at detecting problematic test content and the influence such content has on important testing…
Descriptors: Test Items, Test Content, Test Bias, Individual Differences
Ackerman, Debra J. – ETS Research Report Series, 2018
Kindergarten entry assessments (KEAs) have increasingly been incorporated into state education policies over the past 5 years, with much of this interest stemming from Race to the Top--Early Learning Challenge (RTT-ELC) awards, Enhanced Assessment Grants, and nationwide efforts to develop common K-12 state learning standards. Drawing on…
Descriptors: Screening Tests, Kindergarten, Test Validity, Test Reliability
Keller, Lisa A.; Keller, Robert R. – Applied Measurement in Education, 2015
Equating test forms is an essential activity in standardized testing, with increased importance with the accountability systems in existence through the mandate of Adequate Yearly Progress. It is through equating that scores from different test forms become comparable, which allows for the tracking of changes in the performance of students from…
Descriptors: Item Response Theory, Rating Scales, Standardized Tests, Scoring Rubrics
Cormier, Damien C.; McGrew, Kevin S.; Evans, Jeffrey J. – Journal of Psychoeducational Assessment, 2011
The linguistic demand of spoken instructions on individually administered norm-referenced psychological and educational tests is of concern when examining individuals who have varying levels of language processing ability or varying cultural backgrounds. The authors present a new method for analyzing the level of verbosity, complexity, and total…
Descriptors: Intelligence Tests, Oral Language, Difficulty Level, Test Bias