Research Article Open Access

An Interval Type-2 Fuzzy Association Rule Mining Approach to Pattern Discovery in Breast Cancer Dataset

Olufunke Oladipupo1, Oluwole Olajide1, Stephen Adubi1, Jelili Oyelade1 and Zacchaeus Omogbadegun2
  • 1 Covenant University, Nigeria
  • 2 Chrisland University, Nigeria
Journal of Computer Science
Volume 17 No. 3, 2021, 330-348

DOI: https://doi.org/10.3844/jcssp.2021.330.348

Submitted On: 17 September 2020 Published On: 6 April 2021

How to Cite: Oladipupo, O., Olajide, O., Adubi, S., Oyelade, J. & Omogbadegun, Z. (2021). An Interval Type-2 Fuzzy Association Rule Mining Approach to Pattern Discovery in Breast Cancer Dataset. Journal of Computer Science, 17(3), 330-348. https://doi.org/10.3844/jcssp.2021.330.348

Abstract

In the literature, several methods explored to analyze breast cancer dataset have failed to sufficiently handle quantitative attribute sharp boundary problem to resolve inter and intra uncertainties in breast cancer dataset analysis. In this study an Interval Type-2 fuzzy association rule mining approach is proposed for pattern discovery in breast cancer dataset. In the first part of this analysis, the interval Type-2 fuzzification of the breast cancer dataset is carried out using Hao and Mendel approach. In the second part, FP-growth algorithm is adopted for associative pattern discovery from the fuzzified dataset from the first part. To define the intuitive words for breast cancer determinant factors and expert data interval, thirty (30) medical experts from specialized hospitals were consulted through questionnaire poling method. To establish the adequacy of the linguistic word defined by the expert, Jaccard similarity measure is used. This analysis is able to discover associative rules with minimum number of symptoms at confidence values as high as 91%. It also identifies High Bare Nuclei and High Uniformity of Cell Shape as strong determinant factors for diagnosing breast cancer. The proposed approach performed better in terms of rules generated when compared with traditional quantitative association rule mining. It is able to eliminate redundant rules which reduce the number of generated rules by 39.5% and memory usage by 22.6%. The discovered rules are viable in building a comprehensive and compact expert driven knowledge-base for breast cancer decision support or expert system.

  • 208 Views
  • 102 Downloads
  • 0 Citations

Download

Keywords

  • Data Mining
  • Breast Cancer
  • Interval Type-2 Fuzzy Association Rule Mining