Performance Analysis and Comparison of Cerebral Stroke Prediction Models on Imbalanced Datasets

98

Views

0

Downloads

Phankokkruad, Manop and Wacharawichanant, Sirirat (2022) Performance Analysis and Comparison of Cerebral Stroke Prediction Models on Imbalanced Datasets In: 7th IEEE/ACIS International Conference on Big Data, Cloud Computing, and Data Science Engineering (BCD 2022), August 4-6 , 2022, Danang, Vietnam. (In Press)

Abstract

A cerebral stroke is an interrupt blood flow to the brain leading cause of death. A number of risk factors increase the risk of stroke occurence because of lifestyle. Machine learning is effective techniques can be applied in prediction of stroke. The different kind of algorithms give the various accuracy and performance in the prediction. This study has proposed the four machine learning algorithm for classifiers to predict of cerebral stroke. The proposed model with various classifier has considered the risk factors such as age, hypertension, heart disease, average glucose level, BMI, and smoking status as feature attributes to predict cerebral stroke. This study conducted on two stroke datasets, and improve the imbalanced of between classes by using SMOTE. The result shows that XGBoost provided the highest accuracy of around 98.08% and 96.73% by comparing to the other machine learning algorithms. In addition, this study evaluates the models by analyzing the statistical parameters include accuracy, precision, sensitivity, F1 score, and AUC. The evaluation reveals that the XGBoost, Random Forest, AdaBoost and KNN classifier achieved the average AUC value of 0.851, 0.868, 0.670 and 0.851, respectively. All models provided the high confidence values, whereas the model with XGBoost classifier gave the highest performance.

Item Type:

Conference or Workshop Item (Paper)

Subjects:

Subjects > Computer Science > Artificial Intelligence

Deposited by:

Manop Phankokkruad

Date Deposited:

2022-07-04 11:39:55

Last Modified:

2023-01-12 22:42:24

Impact and Interest:

Statistics