Cost-Sensitive Extreme Gradient Boosting for Imbalanced Classification of Breast Cancer Diagnosis

315

Views

0

Downloads

Phankokkruad, Manop (2020) Cost-Sensitive Extreme Gradient Boosting for Imbalanced Classification of Breast Cancer Diagnosis In: 2020 10th IEEE International Conference on Control System, Computing and Engineering (ICCSCE), 2020-08-21, Penang, Malaysia.

Abstract

The clinical information can enhance the doctors for predicting and diagnosing the diseases also making the right decisions. Breast cancer is the most dangerous disease, early diagnosis can improve a chance of survival and can support clinical treatment. Detecting breast cancer takes a lot of time and it is hard to classification. However, the problem of the classification occurs when there is an unequal distribution of classes the dataset. This is caused by the low performance in the traditional machine learning models. For this reason, this work proposed the cost-sensitive XGBoost model, which is an improved version of the XGBoost model in conjunction with cost-sensitive learning. The models were applied to classify the four breast cancer datasets that contained the imbalanced data. In the experiment, this work determined the best parameters on each dataset by the hyperparameters optimization technique before configuring the models. The results indicated that the cost-sensitive XGBoost model had been skillful, and could improve classification accuracy in four datasets. In addition, this work evaluated the model performance by accuracy, ROC AUC, and k- Fold cross-validation to ensure that the new models is accurate.

Item Type:

Conference or Workshop Item (Paper)

Identification Number (DOI):

Deposited by:

ระบบ อัตโนมัติ

Date Deposited:

2021-09-09 23:53:49

Last Modified:

2021-09-30 06:08:41

Impact and Interest:

Statistics