ERIC - Search Results

Publication Date

In 2025	1
Since 2024	12

Source

Annenberg Institute for…	1
Asia Pacific Journal of…	1
Autism: The International…	1
CBE - Life Sciences Education	1
ETS Research Institute	1
Educational and Psychological…	1
Journal of Baltic Science…	1
Journal of Educational…	1
Journal of Educational and…	1
Measurement:…	1
ProQuest LLC	1
Research in Higher Education	1
More ▼

Publication Type

Journal Articles	9
Reports - Research	9
Reports - Evaluative	2
Dissertations/Theses -…	1

Education Level

Higher Education	4
Postsecondary Education	4
High Schools	1
Secondary Education	1

Audience

Location

Colorado	1
Georgia (Atlanta)	1
Malaysia	1
Netherlands	1
Singapore	1
Texas	1
Turkey	1

Laws, Policies, & Programs

Assessments and Surveys

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Detecting Rater Bias in Mixed-Format Assessments

Peer reviewed

Direct link

Stefanie A. Wind; Yuan Ge – Measurement: Interdisciplinary Research and Perspectives, 2024

Mixed-format assessments made up of multiple-choice (MC) items and constructed response (CR) items that are scored using rater judgments include unique psychometric considerations. When these item types are combined to estimate examinee achievement, information about the psychometric quality of each component can depend on that of the other. For…

Descriptors: Interrater Reliability, Test Bias, Multiple Choice Tests, Responses

Using Regularization to Identify Measurement Bias across Multiple Background Characteristics: A Penalized Expectation-Maximization Algorithm

Peer reviewed

Direct link

William C. M. Belzak; Daniel J. Bauer – Journal of Educational and Behavioral Statistics, 2024

Testing for differential item functioning (DIF) has undergone rapid statistical developments recently. Moderated nonlinear factor analysis (MNLFA) allows for simultaneous testing of DIF among multiple categorical and continuous covariates (e.g., sex, age, ethnicity, etc.), and regularization has shown promising results for identifying DIF among…

Descriptors: Test Bias, Algorithms, Factor Analysis, Error of Measurement

Detecting Differential Item Functioning among Multiple Groups Using IRT Residual DIF Framework

Peer reviewed

Direct link

Hwanggyu Lim; Danqi Zhu; Edison M. Choe; Kyung T. Han – Journal of Educational Measurement, 2024

This study presents a generalized version of the residual differential item functioning (RDIF) detection framework in item response theory, named GRDIF, to analyze differential item functioning (DIF) in multiple groups. The GRDIF framework retains the advantages of the original RDIF framework, such as computational efficiency and ease of…

Descriptors: Item Response Theory, Test Bias, Test Reliability, Test Construction

Exploring the Influence of Response Styles on Continuous Scale Assessments: Insights from a Novel Modeling Approach

Peer reviewed

Direct link

Hung-Yu Huang – Educational and Psychological Measurement, 2025

The use of discrete categorical formats to assess psychological traits has a long-standing tradition that is deeply embedded in item response theory models. The increasing prevalence and endorsement of computer- or web-based testing has led to greater focus on continuous response formats, which offer numerous advantages in both respondent…

Descriptors: Response Style (Tests), Psychological Characteristics, Item Response Theory, Test Reliability

Assessing the Psychometric Properties of Quality Experience in Undergraduate Research Using Item Response Theory

Peer reviewed

Direct link

Tien-Ling Hu; Dubravka Svetina Valdivia – Research in Higher Education, 2024

Undergraduate research, recognized as one of the High-Impact Practices (HIPs), has demonstrated a positive association with diverse student learning outcomes. Understanding the pivotal quality factors essential for its efficacy is important for enhancing student success. This study evaluates the psychometric properties of survey items employed to…

Descriptors: Undergraduate Students, Student Research, Student Experience, Psychometrics

Examining the Relationship between Randomization Strategies and Control Group Crossover in Higher Education Interventions. EdWorkingPaper No. 24-1083

Download full text

Catherine Mata; Katharine Meyer; Lindsay Page – Annenberg Institute for School Reform at Brown University, 2024

This article examines the risk of crossover contamination in individual-level randomization, a common concern in experimental research, in the context of a large-enrollment college course. While individual-level randomization is more efficient for assessing program effectiveness, it also increases the potential for control group students to cross…

Descriptors: Chemistry, Science Instruction, Undergraduate Students, Large Group Instruction

Programme Evaluation in Action: Theory to Practice from an Asian Educational Context

Peer reviewed

Direct link

Ser Ming Mark Lee; Wei Cheng Liu – Asia Pacific Journal of Education, 2024

Programme evaluation has developed tremendously over the past 50 years, with a proliferation of evaluation research, an increase in the institutionalization of evaluation, and growth in the professionalization of evaluation. However, existing research and developments are still largely in North America, Europe, Australia, and New Zealand, with…

Descriptors: Foreign Countries, Evaluation Research, Evaluation Methods, Evaluation Criteria

The Factor Structure and Measurement Invariance of the Autism Spectrum Quotient-28: A Cross-Cultural Comparison between Malaysia and the Netherlands

Peer reviewed

Direct link

Zhong Jian Chee; Anke M. Scheeren; Marieke de Vries – Autism: The International Journal of Research and Practice, 2024

Despite several psychometric advantages over the 50-item Autism Spectrum Quotient, an instrument used to measure autistic traits, the abridged AQ-28 and its cross-cultural validity have not been examined as extensively. Therefore, this study aimed to examine the factor structure and measurement invariance of the AQ-28 in 818 Dutch (M[subscript…

Descriptors: Autism Spectrum Disorders, Questionnaires, Factor Structure, Factor Analysis

A Many-Facet Rasch Measurement Approach to Analyze the Prepared Science Laboratory Activities Based on Science Process Skills and Views of Pre-Service Science Teachers

Peer reviewed
PDF on ERIC

Download full text

Emrah Higde; Ahmet Volkan Yüzüak; Zekiye Merve Öcal; Hilal Aktamis – Journal of Baltic Science Education, 2024

The Many-Facet Rasch model is frequently used to analyse and minimize disparities in rater (judge) severity in performance evaluations, in which raters assign scores to test-takers' performances. In this research, the aim of the present study was to analyse science teacher candidates' laboratory activities by using the Many-facet Rasch model.…

Descriptors: Science Laboratories, Learning Activities, Science Process Skills, Student Attitudes

Using a QuantCrit Approach to Develop and Collect Evidence of Validity for a Measure of Community Cultural Wealth

Peer reviewed

Direct link

Rosario A. Marroquín-Flores; Rose Marie Tijerina; Mason Tedeschi; Sofia Banjara; Redmon Warmsley; Luke McFather; Zianna Casas; Lisa B. Limeri – CBE - Life Sciences Education, 2024

Students who hold minoritized identities are underrepresented in science, technology, engineering, and math (STEM) fields. Educational institutions often apply a deficit lens to understanding disproportionate outcomes between minoritized students and those from the cultural majority. Community Cultural Wealth (CCW) is an asset-based framework that…

Descriptors: Undergraduate Students, Minority Group Students, Low Income Students, STEM Education

An Alternative to SAT?: An Investigation into the Validity of a High School Capstone Project as an Assessment of Post-Secondary Readiness

Direct link

Medjy Pierre-Louis – ProQuest LLC, 2024

School systems across the United States increasingly use performance-based assessments (PBAs) as alternatives to traditional standardized tests, like the SAT, to make post-secondary and workforce readiness (PWR) determinations. However, very little research has been conducted to validate such alternative assessments as valid indicators of a…

Descriptors: High School Students, Rural Schools, Performance Based Assessment, Student Evaluation

Charting the Future of Assessments. Full Report

Download full text

Patrick C. Kyllonen; Amit Sevak; Teresa Ober; Ikkyu Choi; Jesse Sparks; Daniel Fishtein – ETS Research Institute, 2024

Assessment refers to a broad array of approaches for measuring or evaluating a person's (or group of persons') skills, behaviors, dispositions, or other attributes. Assessments range from standardized tests used in admissions, employee selection, licensure examinations, and domestic and international largescale assessments of cognitive and…

Descriptors: Performance Based Assessment, Evaluation Criteria, Evaluation Methods, Test Bias

Test Bias	12
Test Reliability	7
Error of Measurement	5
Test Validity	5
Interrater Reliability	4
Item Response Theory	4
Psychometrics	4
Test Construction	4
Foreign Countries	3
Student Evaluation	3
Undergraduate Students	3
Evaluation Criteria	2
Evaluation Methods	2
Factor Analysis	2
Performance Based Assessment	2
Science Instruction	2
Skill Development	2
Academic Support Services	1
Algorithms	1
Artificial Intelligence	1
Asian Culture	1
Autism Spectrum Disorders	1
Capstone Experiences	1
Career Readiness	1
Chemistry	1
More ▼

Ahmet Volkan Yüzüak	1
Amit Sevak	1
Anke M. Scheeren	1
Catherine Mata	1
Daniel Fishtein	1
Daniel J. Bauer	1
Danqi Zhu	1
Dubravka Svetina Valdivia	1
Edison M. Choe	1
Emrah Higde	1
Hilal Aktamis	1
Hung-Yu Huang	1
Hwanggyu Lim	1
Ikkyu Choi	1
Jesse Sparks	1
Katharine Meyer	1
Kyung T. Han	1
Lindsay Page	1
Lisa B. Limeri	1
Luke McFather	1
Marieke de Vries	1
Mason Tedeschi	1
Medjy Pierre-Louis	1
Patrick C. Kyllonen	1
Redmon Warmsley	1
More ▼