Research Article Open Access

Hybrid Soft Voting Ensemble of XGBoost and DNN for At-Risk Student Performance Prediction

Eugene Wan1, Po Chan Chiu1, Mohammad bin Hossin1, Hamizan Sharbini1, King Kuok Kuok2, Noor Hazlini Borhan1 and Chih How Bong1
  • 1 Faculty of Computer Science and Information Technology, Universiti Malaysia Sarawak, 94300 Kota Samarahan, Malaysia
  • 2 Faculty of Engineering, Computing and Science, Swinburne University of Technology Sarawak Campus, 93350 Kuching, Malaysia

Abstract

Early identification of at-risk students in higher education is important for timely academic intervention, yet conventional prediction methods often struggle with data imbalance and limited model precision. This study proposes a hybrid soft voting ensemble model that integrates Extreme Gradient Boosting (XGBoost) and Deep Neural Network (DNN) to enhance multi-class student grade prediction (A-F classification) and at-risk student identification. This proposed approach is evaluated using two datasets: a publicly available Kaggle Student Performance Dataset and a real-world dataset collected from a Database Concept and Design course at Universiti Malaysia Sarawak (UNIMAS). Both datasets undergo comprehensive pre-processing, including class imbalance handling using SMOTE and feature normalization using StandardScaler. Comparative evaluations were conducted against baseline models, including KNN, SVM, XGBoost and DNN, with all models optimised via hyperparameter tuning. Experimental results demonstrate that the proposed hybrid ensemble model outperforms the baseline models, achieving an accuracy of 77.37% and a macro F1-score of 74.50% on Dataset 1, and an accuracy of 74.13% with a macro F1-score of 81.53% on Dataset 2. The ensemble specifically demonstrates better sensitivity in detecting minority "at-risk" categories (Grades F and D). This study highlights the effectiveness of hybrid ensemble learning in improving predictive performance and supporting data-driven educational decision-making for early intervention in higher education.

Journal of Computer Science
Volume 22 No. 5, 2026, 1620-1635

DOI: https://doi.org/10.3844/jcssp.2026.1620.1635

Submitted On: 13 January 2026 Published On: 28 May 2026

How to Cite: Wan, E., Chiu, P. C., Hossin, M. B., Sharbini, H., Kuok, K. K., Borhan, N. H. & Bong, C. H. (2026). Hybrid Soft Voting Ensemble of XGBoost and DNN for At-Risk Student Performance Prediction. Journal of Computer Science, 22(5), 1620-1635. https://doi.org/10.3844/jcssp.2026.1620.1635

  • 40 Views
  • 10 Downloads
  • 0 Citations

Download

Keywords

  • At-Risk Student Performance Prediction
  • Machine Learning
  • Predictive Analytics
  • Hybrid Soft Voting Ensemble