Publication Date
In 2025 | 0 |
Since 2024 | 16 |
Since 2021 (last 5 years) | 105 |
Since 2016 (last 10 years) | 258 |
Since 2006 (last 20 years) | 385 |
Descriptor
Foreign Countries | 462 |
Test Items | 462 |
Item Response Theory | 385 |
Difficulty Level | 136 |
Test Construction | 102 |
Mathematics Tests | 88 |
Achievement Tests | 82 |
Test Validity | 80 |
Secondary School Students | 79 |
Test Reliability | 78 |
Item Analysis | 77 |
More ▼ |
Source
Author
Bulut, Okan | 7 |
Kelderman, Henk | 7 |
Baghaei, Purya | 6 |
Janssen, Rianne | 6 |
Meijer, Rob R. | 6 |
Wang, Wen-Chung | 6 |
Glas, Cees A. W. | 5 |
Hartig, Johannes | 5 |
Khorramdel, Lale | 5 |
Yamamoto, Kentaro | 5 |
Andrich, David | 4 |
More ▼ |
Publication Type
Education Level
Audience
Researchers | 4 |
Practitioners | 2 |
Teachers | 1 |
Location
Turkey | 41 |
Germany | 30 |
Canada | 24 |
Netherlands | 23 |
Indonesia | 22 |
Taiwan | 22 |
Australia | 21 |
United States | 20 |
China | 16 |
Iran | 16 |
United Kingdom (England) | 16 |
More ▼ |
Laws, Policies, & Programs
Individuals with Disabilities… | 1 |
No Child Left Behind Act 2001 | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Maria Bolsinova; Jesper Tijmstra; Leslie Rutkowski; David Rutkowski – Journal of Educational and Behavioral Statistics, 2024
Profile analysis is one of the main tools for studying whether differential item functioning can be related to specific features of test items. While relevant, profile analysis in its current form has two restrictions that limit its usefulness in practice: It assumes that all test items have equal discrimination parameters, and it does not test…
Descriptors: Test Items, Item Analysis, Generalizability Theory, Achievement Tests
Ibrahim Kasujja; Hugo Melgar-Quinonez; Joweria Nambooze – SAGE Open, 2023
Background: School feeding programs' evaluation requires the measurement of food insecurity, a more objective indicator, within school in low-income countries. The Global Child Nutrition Foundation (GCNF) uses subjective indicators to report school feeding coverage rates across many countries that participate in the global survey of school meal…
Descriptors: Hunger, Food, Program Effectiveness, Psychometrics
Kaya Uyanik, Gulden; Demirtas Tolaman, Tugba; Gur Erdogan, Duygu – International Journal of Assessment Tools in Education, 2021
This paper aims to examine and assess the questions included in the "Turkish Common Exam" for sixth graders held in the first semester of 2018 which is one of the common exams carried out by The Measurement and Evaluation Centers, in terms of question structure, quality and taxonomic value. To this end, the test questions were examined…
Descriptors: Foreign Countries, Grade 6, Standardized Tests, Test Items
Meike Akveld; George Kinnear – International Journal of Mathematical Education in Science and Technology, 2024
Many universities use diagnostic tests to assess incoming students' preparedness for mathematics courses. Diagnostic test results can help students to identify topics where they need more practice and give lecturers a summary of strengths and weaknesses in their class. We demonstrate a process that can be used to make improvements to a mathematics…
Descriptors: Mathematics Tests, Diagnostic Tests, Test Items, Item Analysis
Kunz, Tanja; Meitinger, Katharina – Field Methods, 2022
Although list-style open-ended questions generally help us gain deeper insights into respondents' thoughts, opinions, and behaviors, the quality of responses is often compromised. We tested a dynamic and a follow-up design to motivate respondents to give higher quality responses than with a static design, but without overburdening them. Our…
Descriptors: Online Surveys, Item Response Theory, Test Items, Test Format
Katharina Meitinger; Tanja Kunz – Sociological Methods & Research, 2024
Previous research reveals that the visual design of open-ended questions should match the response task so that respondents can infer the expected response format. Based on a web survey including specific probes in a list-style open-ended question format, we experimentally tested the effects of varying numbers of answer boxes on several indicators…
Descriptors: Visual Aids, Design, Cognitive Processes, Test Items
Avsar, Asiye Sengül – Participatory Educational Research, 2022
It is necessary to supply proof regarding the construct validity of the scales. Especially, when new scales are developed the construct validity is researched by the Exploratory Factor Analysis (EFA). Generally, factor extraction is performed via the Principal Component Analysis (PCA) which is not exactly factor analysis and the Principal Axis…
Descriptors: Factor Analysis, Automation, Construct Validity, Item Response Theory
Andrés Christiansen; Rianne Janssen – Educational Assessment, Evaluation and Accountability, 2024
In international large-scale assessments, students may not be compelled to answer every test item: a student can decide to skip a seemingly difficult item or may drop out before the end of the test is reached. The way these missing responses are treated will affect the estimation of the item difficulty and student ability, and ultimately affect…
Descriptors: Test Items, Item Response Theory, Grade 4, International Assessment
Oluwaseyi Aina Gbolade Opesemowo – Research in Social Sciences and Technology, 2023
Local Item Dependence (LID) is a desecration of Local Item Independence (LII) which can lead to overestimating or underestimating a candidate's ability in mathematics items and create validity problems. The study investigated the intra and inter-LID of mathematics items. The study made use of ex-post facto research. The population encompassed all…
Descriptors: Foreign Countries, Secondary School Students, Item Response Theory, Test Items
Balbuena, Sherwin – International Journal of Assessment Tools in Education, 2023
Depression is a latent characteristic that is measured through self-reported or clinician-mediated instruments such as scales and inventories. The precision of depression estimates largely depends on the validity of the items used and on the truthfulness of people responding to these items. The existing methodology in instrumentation based on a…
Descriptors: Depression (Psychology), Test Items, Test Validity, Test Reliability
Diyorjon Abdullaev; Djuraeva Laylo Shukhratovna; Jamoldinova Odinaxon Rasulovna; Jumanazarov Umid Umirzakovich; Olga V. Staroverova – International Journal of Language Testing, 2024
Local item dependence (LID) refers to the situation where responses to items in a test or questionnaire are influenced by responses to other items in the test. This could be due to shared prompts, item content similarity, and deficiencies in item construction. LID due to a shared prompt is highly probable in cloze tests where items are nested…
Descriptors: Undergraduate Students, Foreign Countries, English (Second Language), Second Language Learning
Umi Laili Yuhana; Eko Mulyanto Yuniarno; Wenny Rahayu; Eric Pardede – Education and Information Technologies, 2024
In an online learning environment, it is important to establish a suitable assessment approach that can be adapted on the fly to accommodate the varying learning paces of students. At the same time, it is essential that assessment criteria remain compliant with the expected learning outcomes of the relevant education standard which predominantly…
Descriptors: Adaptive Testing, Electronic Learning, Elementary School Students, Student Evaluation
Chidubem Deborah Adamu – Research in Social Sciences and Technology, 2024
The Fourth Industrial Revolution Chemistry Teachers Effectiveness Scale (4IRCTES) was evaluated in secondary schools in Southwest Nigeria to determine its item discrimination, item parameters, and model-data fit. This study utilised a descriptive survey research design and included 4,986 Chemistry teachers in the southwestern region of Nigeria,…
Descriptors: Secondary School Teachers, Science Teachers, Chemistry, Foreign Countries
Sari, Halil Ibrahim; Karaman, Mehmet Akif – International Journal of Assessment Tools in Education, 2018
The current study shows the applications of both classical test theory (CTT) and item response theory (IRT) to the psychology data. The study discusses item level analyses of General Mattering Scale produced by the two theories as well as strengths and weaknesses of both measurement approaches. The survey consisted of a total of five Likert-type…
Descriptors: Measures (Individuals), Test Theory, Item Response Theory, Likert Scales
Qiwei He – International Journal of Assessment Tools in Education, 2023
Collaborative problem solving (CPS) is inherently an interactive, conjoint, dual-strand process that considers how a student reasons about a problem as well as how s/he interacts with others to regulate social processes and exchange information (OECD, 2013). Measuring CPS skills presents a challenge for obtaining consistent, accurate, and reliable…
Descriptors: Cooperative Learning, Problem Solving, Test Items, International Assessment