Publication Date
In 2025 | 0 |
Since 2024 | 6 |
Since 2021 (last 5 years) | 21 |
Since 2016 (last 10 years) | 54 |
Since 2006 (last 20 years) | 126 |
Descriptor
Test Construction | 283 |
Test Content | 283 |
Test Items | 118 |
Test Validity | 76 |
Student Evaluation | 54 |
Elementary Secondary Education | 50 |
Test Format | 50 |
Test Reliability | 49 |
Achievement Tests | 39 |
Evaluation Methods | 39 |
Test Use | 38 |
More ▼ |
Source
Author
Kitao, Kenji | 4 |
Kitao, S. Kathleen | 4 |
Sireci, Stephen G. | 4 |
Winnick, Joseph P. | 4 |
Chang, Hua-Hua | 3 |
Ewing, Maureen | 3 |
Hau, Kit-Tai | 3 |
Leung, Chi-Keung | 3 |
Short, Francis X. | 3 |
Thurlow, Martha L. | 3 |
van der Linden, Wim J. | 3 |
More ▼ |
Publication Type
Education Level
Audience
Teachers | 32 |
Practitioners | 27 |
Administrators | 8 |
Students | 7 |
Parents | 4 |
Policymakers | 4 |
Researchers | 2 |
Community | 1 |
Counselors | 1 |
Location
Georgia | 8 |
Illinois | 3 |
United States | 3 |
Australia | 2 |
Germany | 2 |
Iran | 2 |
Japan | 2 |
Kentucky | 2 |
Louisiana | 2 |
Netherlands | 2 |
New Mexico | 2 |
More ▼ |
Laws, Policies, & Programs
Every Student Succeeds Act… | 3 |
No Child Left Behind Act 2001 | 3 |
Individuals with Disabilities… | 2 |
Kentucky Education Reform Act… | 1 |
Rehabilitation Act 1973… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Shilo, Gila – Educational Research Quarterly, 2015
The purpose of the study was to examine the quality of open test questions directed to high school and college students. One thousand five hundred examination questions from various fields of study were examined using criteria based on the writing centers directions and guidelines. The 273 questions that did not fulfill the criteria were analyzed…
Descriptors: Questioning Techniques, Questionnaires, Test Construction, High School Students
Peter, Johannes; Leichner, Nikolas; Mayer, Anne-Kathrin; Krampen, Günter – Psychology Learning and Teaching, 2015
This paper reports the development of a fixed-choice test for the assessment of basic knowledge in psychology, for use with undergraduate as well as graduate students. Test content is selected based on a core concepts approach and includes a sample of concepts which are indexed most frequently in common introductory psychology textbooks. In a…
Descriptors: Tests, Psychology, Knowledge Level, Scores
Keller, Lisa A.; Keller, Robert R. – Applied Measurement in Education, 2015
Equating test forms is an essential activity in standardized testing, with increased importance with the accountability systems in existence through the mandate of Adequate Yearly Progress. It is through equating that scores from different test forms become comparable, which allows for the tracking of changes in the performance of students from…
Descriptors: Item Response Theory, Rating Scales, Standardized Tests, Scoring Rubrics
Towns, Marcy H. – Journal of Chemical Education, 2014
Chemistry faculty members are highly skilled in obtaining, analyzing, and interpreting physical measurements, but often they are less skilled in measuring student learning. This work provides guidance for chemistry faculty from the research literature on multiple-choice item development in chemistry. Areas covered include content, stem, and…
Descriptors: Multiple Choice Tests, Test Construction, Psychometrics, Test Items
Turner, Ronné Patrick – New England Journal of Higher Education, 2014
As an institution that receives close to 50,000 applications for the 2,800 spaces for the first-year entering class, Northeastern University took special interest in the College Board's March 5 announcement on the SAT redesign. In this article, associate vice president of enrollment and dean of admissions at Northeastern, Ronné Turner, describes…
Descriptors: College Entrance Examinations, Test Construction, Universities, Deans
Herman, Joan; Linn, Robert – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2013
Two consortia, the Smarter Balanced Assessment Consortium (Smarter Balanced) and the Partnership for Assessment of Readiness for College and Careers (PARCC), are currently developing comprehensive, technology-based assessment systems to measure students' attainment of the Common Core State Standards (CCSS). The consequences of the consortia…
Descriptors: Consortia, Student Evaluation, Educational Testing, Academic Standards
Mileff, Milo – Bulgarian Comparative Education Society, 2013
In the present paper and the discussion that follows, the author presents aspects of test construction and a careful description of instructional objectives. Constructing tests involves several stages such as describing language objectives, selecting appropriate test task, devising and assembling test tasks, and devising a scoring system for…
Descriptors: Behavioral Objectives, Test Construction, Norm Referenced Tests, Criterion Referenced Tests
Hsiao, Chien-Hua; Wu, Ying-Tien; Lin, Chung-Yen; Wong, Terrence William; Fu, Hsieh-Hai; Yeh, Ting-Kuang; Chang, Chung-Yen – Learning Environments Research, 2014
This study aimed to develop an instrument, named the inquiry-based laboratory classroom environment instrument (ILEI), for assessing senior high-school science students' preferred and perceived laboratory environment. A total of 262 second-year students, from a senior-high school in Taiwan, were recruited for this study. Four stages were included…
Descriptors: Test Construction, Science Laboratories, Inquiry, Science Instruction
Bichi, Ado Abdu; Hafiz, Hadiza; Bello, Samira Abdullahi – International Journal of Evaluation and Research in Education, 2016
High-stakes testing is used for the purposes of providing results that have important consequences. Validity is the cornerstone upon which all measurement systems are built. This study applied the Item Response Theory principles to analyse Northwest University Kano Post-UTME Economics test items. The developed fifty (50) economics test items was…
Descriptors: Item Response Theory, Test Items, Difficulty Level, Statistical Analysis
National Assessment Governing Board, 2014
Since 1973, the National Assessment of Educational Progress (NAEP) has gathered information about student achievement in mathematics. Results of these periodic assessments, produced in print and web-based formats, provide valuable information to a wide variety of audiences. They inform citizens about the nature of students' comprehension of the…
Descriptors: National Competency Tests, Mathematics Achievement, Mathematics Skills, Grade 4
Arjoon, Janelle A.; Xu, Xiaoying; Lewis, Jennifer E. – Journal of Chemical Education, 2013
Many of the instruments developed for research use by the chemistry
education community are relatively new. Because psychometric evidence dictates the validity of interpretations made from test scores, gathering and reporting validity and reliability evidence is of utmost importance. Therefore, the purpose of this study was to investigate what…
Descriptors: Science Instruction, Measurement Techniques, Psychometrics, Evidence
Bermundo, Cesar B.; Bermundo, Alex B.; Ballester, Rex C. – Australian Association for Research in Education (NJ1), 2012
iBank is a project that utilizes a software to create an item Bank that store quality questions, generate test and print exam. The items are from analyze teacher-constructed test questions that provides the basis for discussing test results, by determining why a test item is or not discriminating between the better and poorer students, and by…
Descriptors: Test Items, Computer Software, Test Results, Test Construction
van der Linden, Wim J.; Diao, Qi – Journal of Educational Measurement, 2011
In automated test assembly (ATA), the methodology of mixed-integer programming is used to select test items from an item bank to meet the specifications for a desired test form and optimize its measurement accuracy. The same methodology can be used to automate the formatting of the set of selected items into the actual test form. Three different…
Descriptors: Test Items, Test Format, Test Construction, Item Banks
Breakstone, Joel – Theory and Research in Social Education, 2014
This article considers the design process for new formative history assessments. Over the course of 3 years, my colleagues from the Stanford History Education Group and I designed, piloted, and revised dozens of "History Assessments of Thinking" (HATs). As we created HATs, we sought to gather information about their cognitive validity,…
Descriptors: History Instruction, Formative Evaluation, Tests, Correlation
Porter, Andrew; Polikoff, Morgan S.; Barghaus, Katherine M.; Yang, Rui – Educational Researcher, 2013
We describe an innovative automated test construction algorithm for building aligned achievement tests. By incorporating the algorithm into the test construction process, along with other test construction procedures for building reliable and unbiased assessments, the result is much more valid tests than result from current test construction…
Descriptors: Achievement Tests, Automation, Test Construction, Alignment (Education)