Research Article Open Access

Diagnosis of Hepatitis Disease with Logistic Regression and Artificial Neural Networks

Alaa M. Elsayad1, Ahmed M. Nassef2 and Mujahed Al-Dhaifallah3
  • 1 Prince Sattam Bin Abdulaziz University, Saudi Arabia
  • 2 Tanta University, Egypt
  • 3 King Fahd University of Petroleum & Minerals, Saudi Arabia

Abstract

Hepatitis C refers to the inflammatory state of the liver caused by viruses, bacteria, fungi, and exposure to toxins such as alcohol and self-immunity. The diagnosis requires investigating many laboratory tests and comparing the results to those of the former patients with the same conditions. This study presents the results of our experiments to build a hybrid system that combines both neural networks and logistic regression for the diagnosing of the hepatitis dataset using clinical and laboratory test results. The first experiment compared the performances of Multilayer Perceptual Neural Networks (MLPNN) and Radial Basis Function Neural Network (RBFNN) versus the conventional and stepwise Logistic Regression (LR) algorithms, where the results demonstrated the ability of neural networks to deliver better performance than LR models. In the second experiment, the features selected by backward and forward LR models have been evaluated for the improvement of the performances of MLPNN and RBFNN models. The hepatitis dataset was downloaded from the machine-learning repository by the University of California at Ervine. Missing values have been imputed with a separate Classification and Regression Tree (C&RT) for each attribute. Classification models have been evaluated in terms of statistical accuracy, specificity, sensitivity, F1-score and the Area Under the Receiver Operating Characteristic Curve (AUCROC). Experimental results showed that the performances of neural network models have been improved when employing stepwise LR models to select only the predictive attributes. The hybrid system which combined both backward stepwise LR for attribute selection and MLPNN for classification has outperformed other systems in the diagnosis of the hepatitis dataset with 0.973 AUCROC for the training subset and 0.886 for the test one.

Journal of Computer Science
Volume 16 No. 3, 2020, 364-377

DOI: https://doi.org/10.3844/jcssp.2020.364.377

Submitted On: 11 December 2019 Published On: 27 March 2020

How to Cite: Elsayad, A. M., Nassef, A. M. & Al-Dhaifallah, M. (2020). Diagnosis of Hepatitis Disease with Logistic Regression and Artificial Neural Networks. Journal of Computer Science, 16(3), 364-377. https://doi.org/10.3844/jcssp.2020.364.377

  • 2,928 Views
  • 1,135 Downloads
  • 6 Citations

Download

Keywords

  • Hepatitis Dataset
  • Stepwise Logistic Regression
  • Attribute Selection
  • Multilayer Perceptron Neural Networks
  • Radial Basis Function Neural Networks