TY - JOUR
T1 - Influence-Balanced XGBoost
T2 - Improving XGBoost for Imbalanced Data Using Influence Functions
AU - Sutou, Akiyoshi
AU - Wang, Jinfang
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2024
Y1 - 2024
N2 - Decision tree boosting algorithms, such as XGBoost, have demonstrated superior predictive performance on tabular data for supervised learning compared to neural networks. However, recent studies on loss functions for imbalanced data have primarily focused on deep learning. The goal of this study is to improve the XGBoost algorithm for better performance on unbalanced data. To this end, Influence-balanced loss (IBL), originally introduced in deep learning, was applied to enhance the performance of the XGBoost algorithm. As a side effect, the proposed method was also found to perform well on datasets prone to over-specialization. Furthermore, we conducted a comparison between the proposed method and conventional techniques using 38 publicly available datasets. Our method outperforms other methods in terms of F1-score and Matthews correlation coefficient.
AB - Decision tree boosting algorithms, such as XGBoost, have demonstrated superior predictive performance on tabular data for supervised learning compared to neural networks. However, recent studies on loss functions for imbalanced data have primarily focused on deep learning. The goal of this study is to improve the XGBoost algorithm for better performance on unbalanced data. To this end, Influence-balanced loss (IBL), originally introduced in deep learning, was applied to enhance the performance of the XGBoost algorithm. As a side effect, the proposed method was also found to perform well on datasets prone to over-specialization. Furthermore, we conducted a comparison between the proposed method and conventional techniques using 38 publicly available datasets. Our method outperforms other methods in terms of F1-score and Matthews correlation coefficient.
KW - Imbalanced data
KW - influence-balanced loss
KW - over-specialization
KW - XGBoost
UR - http://www.scopus.com/inward/record.url?scp=85213028690&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85213028690&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2024.3520159
DO - 10.1109/ACCESS.2024.3520159
M3 - Article
AN - SCOPUS:85213028690
SN - 2169-3536
VL - 12
SP - 193473
EP - 193486
JO - IEEE Access
JF - IEEE Access
ER -