Publication Date
In 2025 | 1 |
Since 2024 | 12 |
Descriptor
Source
Author
Ahmet Volkan Yüzüak | 1 |
Amit Sevak | 1 |
Anke M. Scheeren | 1 |
Catherine Mata | 1 |
Daniel Fishtein | 1 |
Daniel J. Bauer | 1 |
Danqi Zhu | 1 |
Dubravka Svetina Valdivia | 1 |
Edison M. Choe | 1 |
Emrah Higde | 1 |
Hilal Aktamis | 1 |
More ▼ |
Publication Type
Journal Articles | 9 |
Reports - Research | 9 |
Reports - Evaluative | 2 |
Dissertations/Theses -… | 1 |
Education Level
Higher Education | 4 |
Postsecondary Education | 4 |
High Schools | 1 |
Secondary Education | 1 |
Audience
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Stefanie A. Wind; Yuan Ge – Measurement: Interdisciplinary Research and Perspectives, 2024
Mixed-format assessments made up of multiple-choice (MC) items and constructed response (CR) items that are scored using rater judgments include unique psychometric considerations. When these item types are combined to estimate examinee achievement, information about the psychometric quality of each component can depend on that of the other. For…
Descriptors: Interrater Reliability, Test Bias, Multiple Choice Tests, Responses
William C. M. Belzak; Daniel J. Bauer – Journal of Educational and Behavioral Statistics, 2024
Testing for differential item functioning (DIF) has undergone rapid statistical developments recently. Moderated nonlinear factor analysis (MNLFA) allows for simultaneous testing of DIF among multiple categorical and continuous covariates (e.g., sex, age, ethnicity, etc.), and regularization has shown promising results for identifying DIF among…
Descriptors: Test Bias, Algorithms, Factor Analysis, Error of Measurement
Hwanggyu Lim; Danqi Zhu; Edison M. Choe; Kyung T. Han – Journal of Educational Measurement, 2024
This study presents a generalized version of the residual differential item functioning (RDIF) detection framework in item response theory, named GRDIF, to analyze differential item functioning (DIF) in multiple groups. The GRDIF framework retains the advantages of the original RDIF framework, such as computational efficiency and ease of…
Descriptors: Item Response Theory, Test Bias, Test Reliability, Test Construction
Hung-Yu Huang – Educational and Psychological Measurement, 2025
The use of discrete categorical formats to assess psychological traits has a long-standing tradition that is deeply embedded in item response theory models. The increasing prevalence and endorsement of computer- or web-based testing has led to greater focus on continuous response formats, which offer numerous advantages in both respondent…
Descriptors: Response Style (Tests), Psychological Characteristics, Item Response Theory, Test Reliability
Tien-Ling Hu; Dubravka Svetina Valdivia – Research in Higher Education, 2024
Undergraduate research, recognized as one of the High-Impact Practices (HIPs), has demonstrated a positive association with diverse student learning outcomes. Understanding the pivotal quality factors essential for its efficacy is important for enhancing student success. This study evaluates the psychometric properties of survey items employed to…
Descriptors: Undergraduate Students, Student Research, Student Experience, Psychometrics
Catherine Mata; Katharine Meyer; Lindsay Page – Annenberg Institute for School Reform at Brown University, 2024
This article examines the risk of crossover contamination in individual-level randomization, a common concern in experimental research, in the context of a large-enrollment college course. While individual-level randomization is more efficient for assessing program effectiveness, it also increases the potential for control group students to cross…
Descriptors: Chemistry, Science Instruction, Undergraduate Students, Large Group Instruction
Ser Ming Mark Lee; Wei Cheng Liu – Asia Pacific Journal of Education, 2024
Programme evaluation has developed tremendously over the past 50 years, with a proliferation of evaluation research, an increase in the institutionalization of evaluation, and growth in the professionalization of evaluation. However, existing research and developments are still largely in North America, Europe, Australia, and New Zealand, with…
Descriptors: Foreign Countries, Evaluation Research, Evaluation Methods, Evaluation Criteria
Zhong Jian Chee; Anke M. Scheeren; Marieke de Vries – Autism: The International Journal of Research and Practice, 2024
Despite several psychometric advantages over the 50-item Autism Spectrum Quotient, an instrument used to measure autistic traits, the abridged AQ-28 and its cross-cultural validity have not been examined as extensively. Therefore, this study aimed to examine the factor structure and measurement invariance of the AQ-28 in 818 Dutch (M[subscript…
Descriptors: Autism Spectrum Disorders, Questionnaires, Factor Structure, Factor Analysis
Emrah Higde; Ahmet Volkan Yüzüak; Zekiye Merve Öcal; Hilal Aktamis – Journal of Baltic Science Education, 2024
The Many-Facet Rasch model is frequently used to analyse and minimize disparities in rater (judge) severity in performance evaluations, in which raters assign scores to test-takers' performances. In this research, the aim of the present study was to analyse science teacher candidates' laboratory activities by using the Many-facet Rasch model.…
Descriptors: Science Laboratories, Learning Activities, Science Process Skills, Student Attitudes
Rosario A. Marroquín-Flores; Rose Marie Tijerina; Mason Tedeschi; Sofia Banjara; Redmon Warmsley; Luke McFather; Zianna Casas; Lisa B. Limeri – CBE - Life Sciences Education, 2024
Students who hold minoritized identities are underrepresented in science, technology, engineering, and math (STEM) fields. Educational institutions often apply a deficit lens to understanding disproportionate outcomes between minoritized students and those from the cultural majority. Community Cultural Wealth (CCW) is an asset-based framework that…
Descriptors: Undergraduate Students, Minority Group Students, Low Income Students, STEM Education
Medjy Pierre-Louis – ProQuest LLC, 2024
School systems across the United States increasingly use performance-based assessments (PBAs) as alternatives to traditional standardized tests, like the SAT, to make post-secondary and workforce readiness (PWR) determinations. However, very little research has been conducted to validate such alternative assessments as valid indicators of a…
Descriptors: High School Students, Rural Schools, Performance Based Assessment, Student Evaluation
Patrick C. Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Institute, 2024
Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international largescale assessments of cognitive and…
Descriptors: Performance Based Assessment, Evaluation Criteria, Evaluation Methods, Test Bias