Publication Date
In 2025 | 0 |
Since 2024 | 0 |
Since 2021 (last 5 years) | 1 |
Since 2016 (last 10 years) | 2 |
Since 2006 (last 20 years) | 11 |
Descriptor
Item Analysis | 16 |
Test Construction | 16 |
Test Content | 16 |
Test Items | 14 |
Foreign Countries | 6 |
Psychometrics | 6 |
Difficulty Level | 5 |
Item Response Theory | 5 |
Test Validity | 4 |
Construct Validity | 3 |
Evaluation Criteria | 3 |
More ▼ |
Source
Author
Publication Type
Journal Articles | 10 |
Reports - Research | 7 |
Reports - Evaluative | 5 |
Reports - Descriptive | 3 |
Numerical/Quantitative Data | 2 |
Speeches/Meeting Papers | 2 |
Dissertations/Theses -… | 1 |
Tests/Questionnaires | 1 |
Education Level
Higher Education | 6 |
Postsecondary Education | 5 |
Elementary Secondary Education | 4 |
Secondary Education | 2 |
Adult Education | 1 |
Elementary Education | 1 |
Grade 6 | 1 |
High Schools | 1 |
Audience
Researchers | 1 |
Teachers | 1 |
Location
Australia | 1 |
Canada | 1 |
Japan | 1 |
Mississippi | 1 |
Nigeria | 1 |
Singapore | 1 |
South Korea | 1 |
Laws, Policies, & Programs
Assessments and Surveys
Bender Visual Motor Gestalt… | 1 |
Goodenough Harris Drawing Test | 1 |
Graduate Record Examinations | 1 |
International English… | 1 |
Program for International… | 1 |
Test of English as a Foreign… | 1 |
What Works Clearinghouse Rating
Mohammed, Aisha; Dawood, Abdul Kareem Shareef; Alghazali, Tawfeeq; Kadhim, Qasim Khlaif; Sabti, Ahmed Abdulateef; Sabit, Shaker Holh – International Journal of Language Testing, 2023
Cognitive diagnostic models (CDMs) have received much interest within the field of language testing over the last decade due to their great potential to provide diagnostic feedback to all stakeholders and ultimately improve language teaching and learning. A large number of studies have demonstrated the application of CDMs on advanced large-scale…
Descriptors: Reading Comprehension, Reading Tests, Language Tests, English (Second Language)
Shilo, Gila – Educational Research Quarterly, 2015
The purpose of the study was to examine the quality of open test questions directed to high school and college students. One thousand five hundred examination questions from various fields of study were examined using criteria based on the writing centers directions and guidelines. The 273 questions that did not fulfill the criteria were analyzed…
Descriptors: Questioning Techniques, Questionnaires, Test Construction, High School Students
Keller, Lisa A.; Keller, Robert R. – Applied Measurement in Education, 2015
Equating test forms is an essential activity in standardized testing, with increased importance with the accountability systems in existence through the mandate of Adequate Yearly Progress. It is through equating that scores from different test forms become comparable, which allows for the tracking of changes in the performance of students from…
Descriptors: Item Response Theory, Rating Scales, Standardized Tests, Scoring Rubrics
Towns, Marcy H. – Journal of Chemical Education, 2014
Chemistry faculty members are highly skilled in obtaining, analyzing, and interpreting physical measurements, but often they are less skilled in measuring student learning. This work provides guidance for chemistry faculty from the research literature on multiple-choice item development in chemistry. Areas covered include content, stem, and…
Descriptors: Multiple Choice Tests, Test Construction, Psychometrics, Test Items
Bichi, Ado Abdu; Hafiz, Hadiza; Bello, Samira Abdullahi – International Journal of Evaluation and Research in Education, 2016
High-stakes testing is used for the purposes of providing results that have important consequences. Validity is the cornerstone upon which all measurement systems are built. This study applied the Item Response Theory principles to analyse Northwest University Kano Post-UTME Economics test items. The developed fifty (50) economics test items was…
Descriptors: Item Response Theory, Test Items, Difficulty Level, Statistical Analysis
Choi, Bo Young; Park, Heerak; Nam, Suk Kyung; Lee, Jayoung; Cho, Daeyeon; Lee, Sang Min – Career Development Quarterly, 2011
The purpose of this study was to develop a Korean College Stress Inventory (KCSI), which is designed to measure Korean college students' experiences and symptoms of career stress. Even though there have been numerous scales related to career issues, few scales measure the career stress construct and its dimensions. Factor structure, internal…
Descriptors: College Students, Factor Structure, Psychometrics, Stress Variables
Jacobsen, Jared; Ackermann, Richard; Eguez, Jane; Ganguli, Debalina; Rickard, Patricia; Taylor, Linda – Journal of Applied Testing Technology, 2011
A computer adaptive test (CAT) is a delivery methodology that serves the larger goals of the assessment system in which it is embedded. A thorough analysis of the assessment system for which a CAT is being designed is critical to ensure that the delivery platform is appropriate and addresses all relevant complexities. As such, a CAT engine must be…
Descriptors: Delivery Systems, Testing Programs, Computer Assisted Testing, Foreign Countries
Kumazawa, Takaaki – ProQuest LLC, 2011
Although classroom assessment is one of the most frequent practices carried out by teachers in all educational programs, limited research has been conducted to investigate the dependability and validity of criterion-referenced tests (CRTs). The main purpose of this study is to develop a criterion-referenced test for first-year Japanese university…
Descriptors: Criterion Referenced Tests, Test Construction, Test Validity, English (Second Language)
National Assessment Governing Board, 2007
This paper presents the assessment and item specifications for the National Assessment of Educational Progress (NAEP) 2009 mathematics assessment. "Chapter Two" contains descriptions of the five major content areas of mathematics (Number Properties and Operations, Measurement, Geometry, Data Analysis, Statistics, and Probability, and…
Descriptors: Test Items, Mathematics Achievement, Mathematics Tests, National Competency Tests
Niemi, David; Wang, Jia; Wang, Haiwen; Vallone, Julia; Griffin, Noelle – National Center for Research on Evaluation, Standards, and Student Testing (CRESST), 2007
There are usually many testing activities going on in a school, with different tests serving different purposes, thus organization and planning are key in creating an efficient system in assessing the most important educational objectives. In the ideal case, an assessment system will be able to inform on student learning, instruction and…
Descriptors: School Administration, Educational Objectives, Administration, Public Schools
Sireci, Stephen G. – 1995
The purpose of this paper is to clarify the seemingly discrepant views of test theorists and test developers about terminology related to the evaluation of test content. The origin and evolution of the concept of content validity are traced, and the concept is reformulated in a way that emphasizes the notion that content domain definition,…
Descriptors: Construct Validity, Content Validity, Definitions, Item Analysis
Budescu, David V.; And Others – 1994
Modified Parallel Analysis (MPA) is a heuristic method for assessing "approximate unidimensionality" of item pools. It compares the second eigenvalue of the observed correlation matrix with the corresponding eigenvalue extracted from a "parallel" matrix generated by a unidimensional and locally independent model. Revised…
Descriptors: Equations (Mathematics), Heuristics, Item Analysis, Item Banks
Pommerich, Mary – Journal of Technology, Learning, and Assessment, 2004
As testing moves from paper-and-pencil administration toward computerized administration, how to present tests on a computer screen becomes an important concern. Of particular concern are tests that contain necessary information that cannot be displayed on screen all at once for an item. Ideally, the method of presentation should not interfere…
Descriptors: Test Content, Computer Assisted Testing, Multiple Choice Tests, Computer Interfaces

Wilgosh, L.; And Others – Canadian Journal of Special Education, 1990
Item analysis data were collected for the Bender Visual Motor Gestalt Test and Goodenough-Harris Drawing Test, from urban and rural Alberta (Canada) youngsters and Inuit youngsters from the Northwest Territories (Canada). Both tests were inadequate in individual item difficulty levels, suggesting the necessity of revising scoring systems and…
Descriptors: Cultural Context, Difficulty Level, Elementary Education, Eskimos
Graf, Edith Aurora; Peterson, Stephen; Steffen, Manfred; Lawless, René – ETS Research Report Series, 2005
We describe the item modeling development and evaluation process as applied to a quantitative assessment with high-stakes outcomes. In addition to expediting the item-creation process, a model-based approach may reduce pretesting costs, if the difficulty and discrimination of model-generated items may be predicted to a predefined level of…
Descriptors: Psychometrics, Accuracy, Item Analysis, High Stakes Tests
Previous Page | Next Page »
Pages: 1 | 2