Publication Date
In 2025 | 1 |
Since 2024 | 12 |
Since 2021 (last 5 years) | 42 |
Since 2016 (last 10 years) | 115 |
Descriptor
Test Bias | 115 |
Test Reliability | 100 |
Test Validity | 80 |
Test Items | 41 |
Item Response Theory | 34 |
Test Construction | 34 |
Psychometrics | 29 |
Foreign Countries | 28 |
Scores | 28 |
Scoring | 20 |
Student Evaluation | 20 |
More ▼ |
Source
Author
Publication Type
Education Level
Location
Texas | 4 |
Turkey | 4 |
Canada | 3 |
China | 3 |
Florida | 3 |
New Mexico | 3 |
Singapore | 3 |
Australia | 2 |
Illinois | 2 |
Indonesia | 2 |
Netherlands | 2 |
More ▼ |
Laws, Policies, & Programs
Every Student Succeeds Act… | 4 |
Individuals with Disabilities… | 3 |
Rehabilitation Act 1973… | 3 |
Elementary and Secondary… | 1 |
Elementary and Secondary… | 1 |
Assessments and Surveys
What Works Clearinghouse Rating
Stefanie A. Wind; Yuan Ge – Measurement: Interdisciplinary Research and Perspectives, 2024
Mixed-format assessments made up of multiple-choice (MC) items and constructed response (CR) items that are scored using rater judgments include unique psychometric considerations. When these item types are combined to estimate examinee achievement, information about the psychometric quality of each component can depend on that of the other. For…
Descriptors: Interrater Reliability, Test Bias, Multiple Choice Tests, Responses
William C. M. Belzak; Daniel J. Bauer – Journal of Educational and Behavioral Statistics, 2024
Testing for differential item functioning (DIF) has undergone rapid statistical developments recently. Moderated nonlinear factor analysis (MNLFA) allows for simultaneous testing of DIF among multiple categorical and continuous covariates (e.g., sex, age, ethnicity, etc.), and regularization has shown promising results for identifying DIF among…
Descriptors: Test Bias, Algorithms, Factor Analysis, Error of Measurement
Hwanggyu Lim; Danqi Zhu; Edison M. Choe; Kyung T. Han – Journal of Educational Measurement, 2024
This study presents a generalized version of the residual differential item functioning (RDIF) detection framework in item response theory, named GRDIF, to analyze differential item functioning (DIF) in multiple groups. The GRDIF framework retains the advantages of the original RDIF framework, such as computational efficiency and ease of…
Descriptors: Item Response Theory, Test Bias, Test Reliability, Test Construction
Using Differential Item Functioning to Test for Interrater Reliability in Constructed Response Items
Walker, Cindy M.; Göçer Sahin, Sakine – Educational and Psychological Measurement, 2020
The purpose of this study was to investigate a new way of evaluating interrater reliability that can allow one to determine if two raters differ with respect to their rating on a polytomous rating scale or constructed response item. Specifically, differential item functioning (DIF) analyses were used to assess interrater reliability and compared…
Descriptors: Test Bias, Interrater Reliability, Responses, Correlation
Leighton, Jacqueline P.; Lehman, Blair – Educational Measurement: Issues and Practice, 2020
In this digital ITEMS module, Dr. Jacqueline Leighton and Dr. Blair Lehman review differences between think-aloud interviews to measure problem-solving processes and cognitive labs to measure comprehension processes. Learners are introduced to historical, theoretical, and procedural differences between these methods and how to use and analyze…
Descriptors: Protocol Analysis, Interviews, Problem Solving, Cognitive Processes
Menold, Natalja – Field Methods, 2023
While numerical bipolar rating scales may evoke positivity bias, little is known about the corresponding bias in verbal bipolar rating scales. The choice of verbalization of the middle category may lead to response bias, particularly if it is not in line with the scale polarity. Unipolar and bipolar seven-category rating scales in which the…
Descriptors: Rating Scales, Test Bias, Verbal Tests, Responses
Hung-Yu Huang – Educational and Psychological Measurement, 2025
The use of discrete categorical formats to assess psychological traits has a long-standing tradition that is deeply embedded in item response theory models. The increasing prevalence and endorsement of computer- or web-based testing has led to greater focus on continuous response formats, which offer numerous advantages in both respondent…
Descriptors: Response Style (Tests), Psychological Characteristics, Item Response Theory, Test Reliability
Tien-Ling Hu; Dubravka Svetina Valdivia – Research in Higher Education, 2024
Undergraduate research, recognized as one of the High-Impact Practices (HIPs), has demonstrated a positive association with diverse student learning outcomes. Understanding the pivotal quality factors essential for its efficacy is important for enhancing student success. This study evaluates the psychometric properties of survey items employed to…
Descriptors: Undergraduate Students, Student Research, Student Experience, Psychometrics
Gübes, Nese Öztürk – Participatory Educational Research, 2021
The aim of this study is to show how a many-facet Rasch measurement model (MFRM) can be used for quality control whilst monitoring a musical aptitude examination. The data used in this study was gathered from a musical aptitude examination which was applied in 2019-2020 academic year for selecting teacher candidates to a music education department…
Descriptors: Foreign Countries, Music Education, Teacher Education Programs, Preservice Teacher Education
Xiao, Leifeng; Hau, Kit-Tai – Applied Measurement in Education, 2023
We compared coefficient alpha with five alternatives (omega total, omega RT, omega h, GLB, and coefficient H) in two simulation studies. Results showed for unidimensional scales, (a) all indices except omega h performed similarly well for most conditions; (b) alpha is still good; (c) GLB and coefficient H overestimated reliability with small…
Descriptors: Test Theory, Test Reliability, Factor Analysis, Test Length
Nicholas W. Affrunti; Eric Rossen – National Association of School Psychologists, 2023
In this data brief, we examine the scores and pass rates for the Praxis School Psychologist tests (both Praxis 5402 and the newer version, Praxis 5403) by racial-ethnic group and gender for the period September 1, 2022 to August 31, 2023. The Praxis School Psychologist tests are the most often used external assessment of competency by school…
Descriptors: School Psychology, School Psychologists, Counselor Certification, Test Bias
Catherine Mata; Katharine Meyer; Lindsay Page – Annenberg Institute for School Reform at Brown University, 2024
This article examines the risk of crossover contamination in individual-level randomization, a common concern in experimental research, in the context of a large-enrollment college course. While individual-level randomization is more efficient for assessing program effectiveness, it also increases the potential for control group students to cross…
Descriptors: Chemistry, Science Instruction, Undergraduate Students, Large Group Instruction
Muslihin, Heri Yusuf; Suryana, Dodi; Ahman; Suherman, Uman; Dahlan, Tina Hayati – International Journal of Instruction, 2022
Self-determination can affect students to have a positive way of thinking and acting, also to make realistic choices so they can make a decision responsibly. This study aimed to develop a questionnaire to measure student self-determination and validate it. This study was conducted in 2019, involved 406 university students as participants…
Descriptors: Test Validity, Test Reliability, Item Response Theory, Questionnaires
Maïano, Christophe; Morin, Alexandre J. S.; Gagnon, Cynthia; Olivier, Elizabeth; Tracey, Danielle; Craven, Rhonda G.; Bouchard, Stéphane – Journal of Autism and Developmental Disorders, 2023
The objective of the study was to validate adapted versions of the Glasgow Anxiety Scale for people with Intellectual Disabilities (GAS-ID) simultaneously developed in English and French. A sample of 361 youth with mild to moderate intellectual disability (ID) (M = 15.78 years) from Australia (English-speaking) and Canada (French-speaking)…
Descriptors: Intellectual Disability, Anxiety, French, English
van Rensburg, Clarisse; Mostert, Karina – Journal of Student Affairs in Africa, 2023
Student well-being has gradually become a topic of interest in higher education, and the accurate, valid, and reliable measure of well-being constructs is crucial in the South African context. This study examined item bias and configural, metric and scalar invariance of the Satisfaction with Life Scale (SWLS) for South African first-year…
Descriptors: Life Satisfaction, Measures (Individuals), Foreign Countries, College Freshmen