Publication Date
In 2025 | 0 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 6 |
Since 2016 (last 10 years) | 23 |
Since 2006 (last 20 years) | 66 |
Descriptor
Test Construction | 118 |
Test Content | 118 |
Test Items | 118 |
Test Validity | 30 |
Student Evaluation | 24 |
Test Format | 24 |
Computer Assisted Testing | 17 |
Psychometrics | 17 |
Test Reliability | 17 |
Difficulty Level | 16 |
Foreign Countries | 15 |
More ▼ |
Source
Author
Publication Type
Education Level
Higher Education | 15 |
Elementary Secondary Education | 14 |
Postsecondary Education | 13 |
Secondary Education | 11 |
High Schools | 8 |
Grade 8 | 7 |
Elementary Education | 6 |
Grade 12 | 6 |
Grade 4 | 6 |
Middle Schools | 5 |
Junior High Schools | 4 |
More ▼ |
Audience
Teachers | 13 |
Practitioners | 7 |
Students | 5 |
Administrators | 3 |
Parents | 2 |
Counselors | 1 |
Policymakers | 1 |
Researchers | 1 |
Location
Georgia | 4 |
Illinois | 2 |
Japan | 2 |
Louisiana | 2 |
Netherlands | 2 |
Taiwan | 2 |
United States | 2 |
Australia | 1 |
Canada | 1 |
Delaware | 1 |
Illinois (Chicago) | 1 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 2 |
No Child Left Behind Act 2001 | 2 |
Rehabilitation Act 1973… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Anne Traynor; Sara C. Christopherson – Applied Measurement in Education, 2024
Combining methods from earlier content validity and more contemporary content alignment studies may allow a more complete evaluation of the meaning of test scores than if either set of methods is used on its own. This article distinguishes item relevance indices in the content validity literature from test representativeness indices in the…
Descriptors: Test Validity, Test Items, Achievement Tests, Test Construction
Sarah K. Cowan; Michael Hout; Stuart Perrett – Sociological Methods & Research, 2024
Long-running surveys need a systematic way to reflect social change and to keep items relevant to respondents, especially when they ask about controversial subjects, or they threaten the items' validity. We propose a protocol for updating measures that preserves content and construct validity. First, substantive experts articulate the current and…
Descriptors: Surveys, Public Opinion, Social Attitudes, Pregnancy
Alan Shaw – PASAA: Journal of Language Teaching and Learning in Thailand, 2023
Although the TOEFL iBT Listening test is sometimes used for other purposes, it was designed primarily for use as a college entrance examination. Item difficulty in TOEFL iBT Listening tests is the product of interactions between two sets of complex relationships: 1) relationships among numerous item characteristics themselves, and 2) relationships…
Descriptors: English (Second Language), Second Language Instruction, Listening Skills, Language Tests
Agus Santoso; Heri Retnawati; Timbul Pardede; Ibnu Rafi; Munaya Nikma Rosyada; Gulzhaina K. Kassymova; Xu Wenxin – Practical Assessment, Research & Evaluation, 2024
The test blueprint is important in test development, where it guides the test item writer in creating test items according to the desired objectives and specifications or characteristics (so-called a priori item characteristics), such as the level of item difficulty in the category and the distribution of items based on their difficulty level.…
Descriptors: Foreign Countries, Undergraduate Students, Business English, Test Construction
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…
Descriptors: Testing, Standards, Comparative Analysis, Guidelines
Luo, Xiao; Wang, Xinrui – International Journal of Testing, 2019
This study introduced dynamic multistage testing (dy-MST) as an improvement to existing adaptive testing methods. dy-MST combines the advantages of computerized adaptive testing (CAT) and computerized adaptive multistage testing (ca-MST) to create a highly efficient and regulated adaptive testing method. In the test construction phase, multistage…
Descriptors: Adaptive Testing, Computer Assisted Testing, Test Construction, Psychometrics
Mohammed Ambusaidi – ProQuest LLC, 2022
There is an increased demand on nursing faculty to provide quality teaching and assessment. Nursing faculty are required to ensure accurate assessment of learning through testing and outcome measurement that are critical elements of the evaluation process. Likewise, nursing faculty should implement a logical evaluation system. However, the…
Descriptors: Nursing Education, College Faculty, Test Construction, Test Validity
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…
Descriptors: Testing, Standards, Comparative Analysis, Guidelines
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…
Descriptors: Testing, Standards, Comparative Analysis, Guidelines
Sinharay, Sandip – Educational Measurement: Issues and Practice, 2018
The choice of anchor tests is crucial in applications of the nonequivalent groups with anchor test design of equating. Sinharay and Holland (2006, 2007) suggested "miditests," which are anchor tests that are content-representative and have the same mean item difficulty as the total test but have a smaller spread of item difficulties.…
Descriptors: Test Content, Difficulty Level, Test Items, Test Construction
Mohammed, Aisha; Dawood, Abdul Kareem Shareef; Alghazali, Tawfeeq; Kadhim, Qasim Khlaif; Sabti, Ahmed Abdulateef; Sabit, Shaker Holh – International Journal of Language Testing, 2023
Cognitive diagnostic models (CDMs) have received much interest within the field of language testing over the last decade due to their great potential to provide diagnostic feedback to all stakeholders and ultimately improve language teaching and learning. A large number of studies have demonstrated the application of CDMs on advanced large-scale…
Descriptors: Reading Comprehension, Reading Tests, Language Tests, English (Second Language)
Butz, Amanda R.; Branchaw, Janet L. – CBE - Life Sciences Education, 2020
Expanding the scope of previous undergraduate research assessment tools, the "Entering Research" Learning Assessment (ERLA) measures undergraduate and graduate research trainee learning gains in the seven areas of trainee development in the evidence-based "Entering Research" conceptual framework: Research Comprehension and…
Descriptors: Undergraduate Students, Graduate Students, College Students, Student Research
National Assessment Governing Board, 2019
Since 1973, the National Assessment of Educational Progress (NAEP) has gathered information about student achievement in mathematics. The NAEP assessment in mathematics has two components that differ in purpose. One assessment measures long-term trends in achievement among 9-, 13-, and 17-year-old students by using the same basic design each time.…
Descriptors: National Competency Tests, Mathematics Achievement, Grade 4, Grade 8
Harlacher, Jason – Regional Educational Laboratory Central, 2016
Educators have many decisions to make and it's important that they have the right data to inform those decisions and access to questionnaires that can gather that data. This guide, developed by REL Central and based on work done through separate projects with the Wyoming Office of Public Instruction and the Nebraska Department of Education,…
Descriptors: Questionnaires, Test Construction, Student Surveys, Teacher Surveys
Updated Assessment Principles and Guidelines for English Learners with Disabilities. NCEO Report 424
Liu, Kristin K.; Lazarus, Sheryl S.; Thurlow, Martha L.; Jarmin, Jaime; Ward, Jenna; Christensen, Laurene – National Center on Educational Outcomes, 2020
This report is an update of the assessment principles and guidelines for English language learners published in 2013 (Thurlow, Liu, Ward, & Christensen). That report, which was developed by the Improving the Validity of Assessment Results for English Language Learners with Disabilities (IVARED) project, presented essential principles of…
Descriptors: English Language Learners, Students with Disabilities, Student Evaluation, Evaluation Methods