Publication Date
In 2025 | 1 |
Since 2024 | 26 |
Since 2021 (last 5 years) | 75 |
Since 2016 (last 10 years) | 199 |
Since 2006 (last 20 years) | 410 |
Descriptor
Test Content | 820 |
Test Construction | 283 |
Test Items | 262 |
Test Validity | 186 |
Foreign Countries | 167 |
Test Format | 156 |
Student Evaluation | 137 |
Test Reliability | 134 |
Elementary Secondary Education | 125 |
Testing | 110 |
Standardized Tests | 105 |
More ▼ |
Source
Author
Sireci, Stephen G. | 9 |
Kitao, Kenji | 4 |
Kitao, S. Kathleen | 4 |
Papageorgiou, Spiros | 4 |
Thurlow, Martha L. | 4 |
Winnick, Joseph P. | 4 |
van der Linden, Wim J. | 4 |
Chang, Hua-Hua | 3 |
Donovan, Jenny | 3 |
Ewing, Maureen | 3 |
Hau, Kit-Tai | 3 |
More ▼ |
Publication Type
Education Level
Audience
Teachers | 68 |
Practitioners | 59 |
Administrators | 20 |
Students | 15 |
Policymakers | 9 |
Researchers | 7 |
Parents | 6 |
Counselors | 3 |
Community | 2 |
Support Staff | 1 |
Location
Australia | 18 |
California | 15 |
Canada | 14 |
China | 12 |
United States | 12 |
Massachusetts | 9 |
Europe | 8 |
Georgia | 8 |
Japan | 8 |
Rhode Island | 8 |
Turkey | 8 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Alan Shaw – PASAA: Journal of Language Teaching and Learning in Thailand, 2023
Although the TOEFL iBT Listening test is sometimes used for other purposes, it was designed primarily for use as a college entrance examination. Item difficulty in TOEFL iBT Listening tests is the product of interactions between two sets of complex relationships: 1) relationships among numerous item characteristics themselves, and 2) relationships…
Descriptors: English (Second Language), Second Language Instruction, Listening Skills, Language Tests
Burn, Helen Elizabeth; Thrill, Chauntee; Wood, J. Luke; Zamani-Gallaher, Eboni; Mesa, Vilma – Community College Journal of Research and Practice, 2023
This article describes the content validation of the Transitioning Learners to Calculus in Community Colleges Institutional Self-Assessment Tool. The instrument comprises five content areas, each with an associated set of items representing practices to promote the success of underrepresented racially minoritized (URM) students as they transition…
Descriptors: Calculus, Community College Students, Test Validity, Self Evaluation (Individuals)
Kate Toft; Catherine Best; Jayne Donaldson – International Journal of Language & Communication Disorders, 2024
Background: The MD Anderson Dysphagia Inventory (MDADI) is a widely used patient-reported outcome measure (PROM) which assesses dysphagia-related quality of life (QoL) in head and neck cancer (HNC). Despite its common use in HNC research and clinical practice, few of its psychometric properties have been reappraised since its inception. The aim of…
Descriptors: Cancer, Foreign Countries, Physicians, Outcomes of Treatment
Wise, Steven L. – Applied Measurement in Education, 2020
In achievement testing there is typically a practical requirement that the set of items administered should be representative of some target content domain. This is accomplished by establishing test blueprints specifying the content constraints to be followed when selecting the items for a test. Sometimes, however, students give disengaged…
Descriptors: Test Items, Test Content, Achievement Tests, Guessing (Tests)
BijanKhan, Mahmood; ShayesteFar, Parvaneh; Mohebbi, Hassan – Language Testing in Asia, 2023
Drawing on a growing body of research on the interface between corpus linguistics and second/foreign language testing and assessment, we adopted "Peykare," a large-scale, annotated, Persian written language resource to evaluate the content (i.e., coverage and typicality) and construct validity of a Persian language proficiency test…
Descriptors: Indo European Languages, Language Tests, Test Construction, Test Validity
Do-Hong Kim; Chuang Wang; Thi Nhu Ngoc Truong – Language Teaching Research, 2024
Researchers and practitioners in the field of second language acquisition have come to realize the importance of non-cognitive skills such as self-efficacy and self-regulation in students' learning of a second language. However, there has been limited systematic research on such measures in the second language context and the validity and…
Descriptors: Psychometrics, Test Content, Self Efficacy, English Language Learners
Agus Santoso; Heri Retnawati; Timbul Pardede; Ibnu Rafi; Munaya Nikma Rosyada; Gulzhaina K. Kassymova; Xu Wenxin – Practical Assessment, Research & Evaluation, 2024
The test blueprint is important in test development, where it guides the test item writer in creating test items according to the desired objectives and specifications or characteristics (so-called a priori item characteristics), such as the level of item difficulty in the category and the distribution of items based on their difficulty level.…
Descriptors: Foreign Countries, Undergraduate Students, Business English, Test Construction
Wellberg, Sarah – Assessment in Education: Principles, Policy & Practice, 2023
Classroom assessment research in the United States has shifted away from the examination of teacher-made tests, but such tests are still widely used and have an enormous impact on students' educational experiences. Given the major shifts in educational policy in the United States, including the widespread adoption of the Common Core State…
Descriptors: Teacher Made Tests, Mathematics Tests, Common Core State Standards, Test Items
Stevens, Scott P.; Palocsay, Susan W.; Novoa, Luis J. – INFORMS Transactions on Education, 2023
Test writing is a fundamental component of teaching. With increasing pressure to teach larger groups of students, conduct formal assessment of learning outcomes, and offer online and hybrid classes, there is a need for alternatives to constructed response problem-solving test questions. We believe that appropriate use of multiple-choice (MC)…
Descriptors: Multiple Choice Tests, Introductory Courses, Test Construction, Content Validity
Tugçe Duran; Musa Dikmenli – Journal of Education in Science, Environment and Health, 2024
This study aimed to comprehensively examine the articles in which multi-tier concept diagnostic tests, which are among the alternative assessment methods frequently used in recent years to identify misconceptions, were used in biology education between 2000 and 2022. For this purpose, systematic review steps were followed and summarized in the…
Descriptors: Diagnostic Tests, Biology, Misconceptions, Science Education
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…
Descriptors: Testing, Standards, Comparative Analysis, Guidelines
Firdissa J. Aga – Intersection: A Journal at the Intersection of Assessment and Learning, 2024
The study investigated hurdles to the quality of student learning assessment by examining issues related to assessment procedures and practices, learners and learning, learning resources and test constructs, and test admin and feedback. Quantitative and qualitative data were collected from two Ethiopian universities using two types of…
Descriptors: Foreign Countries, College Faculty, College Students, Test Construction
Timothy S. Faith – Teaching and Learning Excellence through Scholarship, 2024
This study compared traditional methods of college-level instruction, including lecture and class discussion followed by assessment via course content exams, with a variety of other instructional techniques. The intent was to evaluate whether more contemporary instructional techniques are significantly correlated with improved average exam scores…
Descriptors: Community College Students, Business Administration Education, Teaching Methods, Alternative Assessment
Vahid Aryadoust – Applied Linguistics, 2024
I analyzed a corpus of the international English language testing system (IELTS) comprising 256 listening sections (1996-2021). The primary objective of the study was to gain insights into the assumptions made by test designers regarding the real-life contexts that test-takers will encounter. Overall, 15 superordinate topic areas and 300 subtopics…
Descriptors: Dialects, Pronunciation, Commercialization, Second Language Learning
Russell, Michael; Moncaleano, Sebastian – Practical Assessment, Research & Evaluation, 2020
Although both content alignment and standard-setting procedures rely on content-expert panel judgements, only the latter employs discussion among panel members. This study employed a modified form of the Webb methodology to examine content alignment for twelve tests administered as part of the Massachusetts Comprehensive Assessment System (MCAS).…
Descriptors: Test Content, Test Items, Discussion, Test Validity