Research Article Open Access

Uncertainty-Aware Ensemble Models for Improved Defect Detection in Noisy Data

Madhavi Perla1, Gadi Lava Raju2, A Radha Krishna3, E. Sree Devi4, Bechoo Lal5, Aruna Bhaskar K5 and Solleti Phani Kumar5
  • 1 Department of Computer Science and Engineering, AI&ML, GMR Institute of Technology (GMRIT), Rajam, Andhra Pradesh, India
  • 2 Department of Computer Science and Engineering, Aditya University, Aditya Nagar, Surampalem, Andhra Pradesh, India
  • 3 CSE (AI and ML) Department, Pragati Engineering College, Surampalem, Andhra Pradesh, India
  • 4 Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation (KLEF), K L University, Guntur, Andhra Pradesh, India
  • 5 Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation (KLEF), K L University, Guntur, Andhra Pradesh, India

Abstract

Software defect prediction plays a crucial role in ensuring software quality and reliability, especially as modern systems become more complex and data rich. This study introduces an uncertainty-aware ensemble learning framework aimed at improving defect classification performance in noisy and imbalanced datasets, particularly those from the PROMISE and NASA KC1 repositories. The proposed model integrates multiple classifiers in a multi-learner ensemble structure to enhance generalization, improve true positive rates, and address the limitations of conventional single-model approaches. Key techniques include chi-square-based feature selection, ensemble pruning to avoid overfitting, and neural network-based classification through Extreme Learning Machines (ELMs). The methodology emphasizes the use of both homogeneous and heterogeneous ensembles, with training and prediction phases structured to handle data sparsity, high dimensionality, and class imbalance. Runtime experiments using decision trees, Naïve Bayes, and cost-sensitive learning demonstrated superior results for the ensemble model compared to traditional classifiers. Evaluation metrics such as accuracy, F-measure (0.9729), recall (0.7143), true positive rate (0.9857), and ROC AUC further validated the ensemble’s predictive robustness. Experimental results on the KC1 dataset showed that the proposed model outperformed baseline models in both accuracy and area under the ROC curve. Advanced data balancing techniques, including under-sampling, over-sampling, and active learning, were employed to improve the model’s ability to identify minority class instances. These findings suggest that uncertainty-aware ensemble approaches are effective tools for improving defect detection, particularly in noisy and imbalanced environments.

Journal of Computer Science
Volume 22 No. 4, 2026, 1396-1405

DOI: https://doi.org/10.3844/jcssp.2026.1396.1405

Submitted On: 31 March 2025 Published On: 27 April 2026

How to Cite: Perla, M., Raju, G. L., Krishna, A. R., Devi, E. S., Lal, B., K, A. B. & Kumar, S. P. (2026). Uncertainty-Aware Ensemble Models for Improved Defect Detection in Noisy Data. Journal of Computer Science, 22(4), 1396-1405. https://doi.org/10.3844/jcssp.2026.1396.1405

  • 29 Views
  • 15 Downloads
  • 0 Citations

Download

Keywords

  • Software Defect Prediction
  • Ensemble Learning
  • Promise Dataset
  • Neural Networks Class Imbalance
  • ROC Curve
  • Extreme Learning Machine (ELM)
  • Naïve Bayes
  • Feature Selection
  • Software Quality