Improve Imbalanced Multiclass Classification Based on Modified SMOTE and Feature Selection for Student Grade Prediction

Improve Imbalanced Multiclass Classification Based on Modified SMOTE and Feature Selection for Student Grade Prediction

DOI: 10.4018/978-1-7998-8686-0.ch014
OnDemand:
(Individual Chapters)
Available
$37.50
No Current Special Offers
TOTAL SAVINGS: $37.50

Abstract

In higher education institutions (HEI), the ability to predict student grades as an early warning system is one of the important areas that gained attention to improve educational outcomes. Over the years, machine learning techniques have facilitated and successfully addressed student grade prediction for identifying the potentially weak students in a particular course. However, dealing with an imbalanced multiclass classification dataset is challenging due to biased results towards predicting the minority class. Therefore, this chapter proposes a method that can increase the classification performance by using a modified synthetic minority oversampling technique and feature selection (MSMOTE-FS). The experiments tested the proposed method's effectiveness by utilizing four oversampling techniques and six standard classification algorithms. This finding indicated that the proposed method gives promising results to improve the accuracy in multiclass classification of student grade prediction.
Chapter Preview
Top

Developing prediction models using machine learning for student grade prediction has been increased to determine student success. Previous researchers have given the contribution in predicting student grade prediction in many areas of the case study. Zhang et al. (X. Zhang et al., 2018) introduced Synthetic Minority over-sampling Technique (SMOTE) and feature selection to reduce the overfitting for student grade prediction using 538 of student data. They compared the performance using four traditional classifiers, including Naïve Bayes (NB), Decision Tree (DT), Multilayer Perceptron (MLP), and Support Vector Machine (SVM). The results showed that MLP was found the most effective with an accuracy of 65.90% and 64.02% for training and testing, respectively.

Complete Chapter List

Search this Book:
Reset