Stroke Risk Assessment with Classical ML Methods

Mohammad Aref Alshraideh; Najwan Alshraideh; Abedalrahman Alshraideh; Yara Alkayed; Yasmin Al Trabsheh; Heba Alshraideh; Bahaaldeen Alshraideh

doi:10.3844/jcssp.2025.134.145

Research Article Open Access

Stroke Risk Assessment with Classical ML Methods

Mohammad Aref Alshraideh¹, Najwan Alshraideh², Abedalrahman Alshraideh³, Yara Alkayed², Yasmin Al Trabsheh⁴, Heba Alshraideh² and Bahaaldeen Alshraideh⁵

¹ Department of Artificial Intelligence, the University of Jordan, Jordan
² Department of Internal Medicine, the University of Jordan, Jordan
³ Internal Medicine, East Midlands Deanery, NHS, England, United Kingdom
⁴ Clinical Attache, United Lincolnshire Hospitals, NHS, England, United Kingdom
⁵ Department of Special Surgery, Division of Urology, the University of Jordan, Jordan

Abstract

Stroke, often caused by a disruption in the supply of essential oxygen, blood, and nutrients to the brain, represents a significant global health challenge. Due to limited resources, developing countries like Ethiopia face unique obstacles in identifying and treating strokes. This study explores the potential of Machine Learning (ML) techniques to predict stroke risk and facilitate early detection and intervention. By doing so, it aims to reduce the burdens of disability, mortality, and healthcare costs associated with strokes. In this research, we utilized four machine learning models: Stochastic Gradient Descent (SGD), Support Vector Machine (SVM), K-Nearest Neighbor (KNN), and Random Forest Classifier (RFC). These models were developed using a dataset from Kaggle, which contained information from 5,110 individuals and various attributes related to each person. Our methodology followed a systematic approach that included data understanding, preparation, experimentation, rectifying inconsistencies, removing duplicates, and resolving errors within the dataset. The ML models were created and rigorously assessed within the Anaconda Python programming environment, with performance evaluation conducted through Confusion Matrix analysis. Our findings revealed that the Random Forest Decision Tree classifier outperformed the others, boasting an accuracy rate of 99.3%. The support vector machine was closely behind at 96.63%, while the k-nearest neighbor and stochastic gradient descent achieved acceptable accuracy. Consequently, we recommend the utilization of the random forest decision tree classifier for further stroke risk prediction endeavors.

Journal of Computer Science

Volume 21 No. 1, 2025, 134-145

DOI: https://doi.org/10.3844/jcssp.2025.134.145

Submitted On: 19 August 2024 Published On: 18 December 2024

How to Cite: Alshraideh, M. A., Alshraideh, N., Alshraideh, A., Alkayed, Y., Al Trabsheh, Y., Alshraideh, H. & Alshraideh, B. (2025). Stroke Risk Assessment with Classical ML Methods. Journal of Computer Science, 21(1), 134-145. https://doi.org/10.3844/jcssp.2025.134.145

Copyright: © 2025 Mohammad Aref Alshraideh, Najwan Alshraideh, Abedalrahman Alshraideh, Yara Alkayed, Yasmin Al Trabsheh, Heba Alshraideh and Bahaaldeen Alshraideh. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

3,014 Views
1,444 Downloads
4 Citations

Download

Keywords

Stroke Risk
Stroke Risk Prediction
Random Forest
SGD
SVM