Publication Date
In 2025 | 0 |
Since 2024 | 1 |
Since 2021 (last 5 years) | 7 |
Since 2016 (last 10 years) | 28 |
Since 2006 (last 20 years) | 35 |
Descriptor
Test Items | 47 |
Item Response Theory | 31 |
Foreign Countries | 17 |
Test Construction | 13 |
Test Validity | 11 |
Difficulty Level | 10 |
Test Reliability | 10 |
Statistical Analysis | 8 |
Item Analysis | 7 |
Psychometrics | 7 |
Comparative Analysis | 6 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Practitioners | 2 |
Researchers | 2 |
Location
Florida | 4 |
Turkey | 3 |
California | 2 |
United States | 2 |
Canada | 1 |
China | 1 |
Colorado | 1 |
Europe | 1 |
Georgia | 1 |
Germany | 1 |
Illinois | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 2 |
Digit Span Test | 1 |
International Association for… | 1 |
Trends in International… | 1 |
Wechsler Intelligence Scale… | 1 |
What Works Clearinghouse Rating
Trantham, Pamela S.; Sikorski, Jonathon; de Ayala, R. J.; Doll, Beth – Educational Assessment, Evaluation and Accountability, 2022
There is an extensive need for school systems to reliably assess the data literacy and data use skills of their educators. To address this need, the current study seeks to refine the NU Data Knowledge Scale (NUDKS) for assessing teacher data literacy for classroom data. A data-based decision-making framework provides the theoretical underpinnings…
Descriptors: Item Response Theory, Information Literacy, Data Use, Knowledge Level
Balbuena, Sherwin – International Journal of Assessment Tools in Education, 2023
Depression is a latent characteristic that is measured through self-reported or clinician-mediated instruments such as scales and inventories. The precision of depression estimates largely depends on the validity of the items used and on the truthfulness of people responding to these items. The existing methodology in instrumentation based on a…
Descriptors: Depression (Psychology), Test Items, Test Validity, Test Reliability
Sari, Halil Ibrahim; Karaman, Mehmet Akif – International Journal of Assessment Tools in Education, 2018
The current study shows the applications of both classical test theory (CTT) and item response theory (IRT) to the psychology data. The study discusses item level analyses of General Mattering Scale produced by the two theories as well as strengths and weaknesses of both measurement approaches. The survey consisted of a total of five Likert-type…
Descriptors: Measures (Individuals), Test Theory, Item Response Theory, Likert Scales
Betts, Joe; Muntean, William; Kim, Doyoung; Jorion, Natalie; Dickison, Philip – Journal of Applied Testing Technology, 2019
Clinical judgment has become an increasingly important aspect of modern health service professionals. To ensure public safety, licensure exams must go beyond assessing only knowledge and skills when evaluating entry-level professions to evaluating clinical judgment. This importance necessitates licensure and certification examinations in these…
Descriptors: Decision Making, Licensing Examinations (Professions), Certification, Nursing Education
Shan Lin; Jian Wang – Journal of Baltic Science Education, 2024
Scientific thinking constitutes a vital component of scientific competencies, crucial for citizens to adapt to the evolving societal landscape. To cultivate students' scientific thinking, teachers should possess an adequate professional knowledge foundation, which encompasses pedagogical content knowledge (PCK). Assessing teachers' PCK of…
Descriptors: Secondary School Teachers, Teacher Attitudes, Biology, Pedagogical Content Knowledge
Fukuzawa, Sherry; deBraga, Michael – Journal of Curriculum and Teaching, 2019
Graded Response Method (GRM) is an alternative to multiple-choice testing where students rank options according to their relevance to the question. GRM requires discrimination and inference between statements and is a cost-effective critical thinking assessment in large courses where open-ended answers are not feasible. This study examined…
Descriptors: Alternative Assessment, Multiple Choice Tests, Test Items, Test Format
Sayin, Ayfer; Sata, Mehmet – International Journal of Assessment Tools in Education, 2022
The aim of the present study was to examine Turkish teacher candidates' competency levels in writing different types of test items by utilizing Rasch analysis. In addition, the effect of the expertise of the raters scoring the items written by the teacher candidates was examined within the scope of the study. 84 Turkish teacher candidates…
Descriptors: Foreign Countries, Item Response Theory, Evaluators, Expertise
Sutiarso, Sugeng; Rosidin, Undang; Sulistiawan, Aan – European Journal of Educational Research, 2022
This research is a developmental research aiming at developing a good mathematical test instrument using polytomous responses based on classical and modern theories. This research design uses the Plomp model, which consists of five stages, (1) preliminary investigation, (2) design, (3) realization/construction, (4) revision, and (5) implementation…
Descriptors: Mathematics Instruction, Mathematics Tests, Item Response Theory, Test Items
Winchip, Emily; Stevenson, Howard; Milner, Alison – Educational Review, 2019
As the Global Education Reform Movement (GERM) spreads, key questions that attempt to identify both the nature and the increasing scope and scale of this phenomenon become empirically significant. The concern of this article is to highlight some of the complexities of measuring one key element of the GERM: the privatisation of public education…
Descriptors: Privatization, Foreign Countries, Item Response Theory, Probability
Schoen, Robert C.; Liu, Sicong; Yang, Xiaotong; Paek, Insu – Grantee Submission, 2017
The Early Fractions Test is a paper-pencil test designed to measure mathematics achievement of third- and fourth-grade students in the domain of fractions. The purpose, or intended use, of the Early Fractions Test is to serve as a student pretest covariate and a test of baseline equivalence in the larger study. In this report, we discuss our…
Descriptors: Mathematics Achievement, Fractions, Mathematics Tests, Grade 3
Deliu, Gabriela; Miron, Cristina; Opariuc-Dan, Cristian – Journal of Baltic Science Education, 2019
The aim of this research is to study the merits and complementarity of Construct Mapping and Categorical Principal Components Analysis as two approaches that explore the dimensionality of multiple-choice items in achievement tests. Data from the two forms of the Romanian National Assessment Tests on Science were used to explore the dimensionality…
Descriptors: Multiple Choice Tests, Test Items, Achievement Tests, Science Tests
Wang, Shuyan – Language Learning and Development, 2023
Relatively late mastery of scalar implicatures has been suggested to correlate with children's immature processing capacities, such as their limited working memory. Yet, many studies that tested for a link between children's working memory and their computation of scalar implicatures have failed to find any correlation. One possible reason is that…
Descriptors: Language Processing, Mandarin Chinese, English, Short Term Memory
Timofte, Roxana S.; Siminiciuc, Laura – Acta Didactica Napocensia, 2018
The scope this article was to develop an instrument to measure Chemistry students' ability regarding 'physical bonding' and to validate it. A number of 24 items were developed by mapping items to cognitive levels described by the Marzano taxonomy. A number of N=73 students were evaluated. Four items exhibited a MNSQ >1.3 and were eliminated…
Descriptors: Item Response Theory, Test Construction, Science Tests, Taxonomy
Mendoza, Arturo; Martínez, Joaquín – International Journal of Language Testing, 2023
Language placement tests (LPTs) are used to assess students' proficiency in a progressive manner in the target language. Based on their performance, students are assigned to stepped language courses. These tests are usually considered low stakes because they do not have significant consequences in students' lives, which is perhaps the reason why…
Descriptors: Language Tests, English (Second Language), Second Language Learning, Second Language Instruction
Bolt, Daniel M.; Wang, Yang Caroline; Meyer, Robert H.; Pier, Libby – Policy Analysis for California Education, PACE, 2019
We illustrate the application of mixture IRT [item response theory] models to evaluate the possibility of respondent confusion due to the negative wording of certain items on a social-emotional learning (SEL) assessment. Using actual student self-report ratings on four social-emotional learning scales collected from students in grades 3-12 from…
Descriptors: Item Response Theory, Rating Scales, Test Items, Social Development