ERIC Number: EJ1418764
Record Type: Journal
Publication Date: 2024
Pages: 37
Abstractor: As Provided
ISBN: N/A
ISSN: ISSN-1360-2357
EISSN: EISSN-1573-7608
Available Date: N/A
An Empirical Assessment of SMOTE Variants Techniques and Interpretation Methods in Improving the Accuracy and the Interpretability of Student Performance Models
Hayat Sahlaoui; El Arbi Abdellaoui Alaoui; Said Agoujil; Anand Nayyar
Education and Information Technologies, v29 n5 p5447-5483 2024
Predicting student performance using educational data is a significant area of machine learning research. However, class imbalance in datasets and the challenge of developing interpretable models can hinder accuracy. This study compares different variations of the Synthetic Minority Oversampling Technique (SMOTE) combined with classification algorithms to create prediction models. The results show that SMOTE with Edited Nearest Neighbors is superior, and the balanced random forest classifier performs better when using SMOTE-ENN, achieving 96% accuracy, precision, and F-value. Smote also has faster execution time. For model interpretability, combining Local Interpretable Model-agnostic Explanations (LIME) and SHapley Additive exPlanations (SHAP) provides deeper insights. LIME is suitable for single-prediction interpretation, while SHAP is better for overall model interpretation. This research offers guidelines to mitigate data imbalance and improve fairness in education through data-driven innovations like early warning systems. It also educates academics on explainability approaches to facilitate wider use of machine learning methods.
Descriptors: Sampling, Classification, Algorithms, Prediction, Models, Accuracy, Artificial Intelligence, Data, Academic Achievement, Data Interpretation
Springer. Available from: Springer Nature. One New York Plaza, Suite 4600, New York, NY 10004. Tel: 800-777-4643; Tel: 212-460-1500; Fax: 212-460-1700; e-mail: customerservice@springernature.com; Web site: https://bibliotheek.ehb.be:2123/
Publication Type: Journal Articles; Reports - Research
Education Level: N/A
Audience: N/A
Language: English
Sponsor: N/A
Authoring Institution: N/A
Grant or Contract Numbers: N/A
Author Affiliations: N/A