Evaluation of Classification Models for Predicting Mortality Rate Using Thyroid Cancer Data

Norah Saleh Alghamdi

doi:10.3844/jcssp.2019.131.142

Research Article Open Access

Evaluation of Classification Models for Predicting Mortality Rate Using Thyroid Cancer Data

Norah Saleh Alghamdi¹

¹ Princess Nourah bint Abdulrahman University, Saudi Arabia

Abstract

Machine Learning (ML) can potentially enhance predictions in real-life domains. This study presents an evaluation and comparison of different ML methods which can be applied on thyroid cancer dataset, called Prostate, Lung, Colorectal and Ovarian (PLCO), of approximately 155,000 participants with thyroid cancer occurrence and mortality incidence. The ML models are explored for predicting mortality rates of patients with thyroid cancer. These models include the Logistic Regression model (LR), K-Neighbors model (KN), Support Vector Classifier (SVC), Gaussian Naïve Bayes (GNB), decision tree classifier (DT), Random Forest classifier (RF), ada boost classifier (AdaB) and Gradient Boosting classifier (GB). The results reveal that AdaB and GB classifiers have the best performance among the models. The results also show that different predictive models can significantly differ with others in terms of their performance evaluated by various metrics. This study shows that the chosen parameters for classifiers will affect their performance; therefore, it is important to explore and evaluate them before final implementation.

Journal of Computer Science

Volume 15 No. 1, 2019, 131-142

DOI: https://doi.org/10.3844/jcssp.2019.131.142

Submitted On: 11 October 2018 Published On: 21 January 2019

How to Cite: Alghamdi, N. S. (2019). Evaluation of Classification Models for Predicting Mortality Rate Using Thyroid Cancer Data. Journal of Computer Science, 15(1), 131-142. https://doi.org/10.3844/jcssp.2019.131.142

Copyright: © 2019 Norah Saleh Alghamdi. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

5,074 Views
2,361 Downloads
1 Citations

Download

Keywords

Machine Learning
Classification
Thyroid Cancer
Data Mining
Predictive Model
Unsupervised Learning Algorithm
Supervised Algorithm