TY - JOUR
T1 - Quasi-linear SVM classifier with segmented local offsets for imbalanced data classification
AU - Liang, Peifeng
AU - Zheng, Feng
AU - Li, Weite
AU - Hu, Jinglu
N1 - Publisher Copyright:
© 2018 Institute of Electrical Engineers of Japan. Published by John Wiley & Sons, Inc.
PY - 2019/2/1
Y1 - 2019/2/1
N2 - Within-class imbalance problems often occur in imbalanced data classification, which worsen the imbalance distribution problem and increase the learning concept complexity. However, most existing methods for the imbalanced data classification focus on rectifying the between-class imbalance problem, which is insufficient and inappropriate in many different scenarios. This paper proposes a simple yet effective support vector machine (SVM) classifier with local offset adjustment for imbalance classification problems. First, a geometry-based partitioning method is modified for imbalanced datasets to divide the input space into multiple linearly separable partitions along the potential separation boundary. Then an F-score-based method is applied to obtain local offsets optimized on each local cluster. Finally, by constructing a quasi-linear kernel based on the partitioning information, a quasi-linear SVM classifier with local offsets is constructed for the imbalanced datasets. Simulation results on different real-world datasets show that the proposed method is effective for imbalanced data classifications.
AB - Within-class imbalance problems often occur in imbalanced data classification, which worsen the imbalance distribution problem and increase the learning concept complexity. However, most existing methods for the imbalanced data classification focus on rectifying the between-class imbalance problem, which is insufficient and inappropriate in many different scenarios. This paper proposes a simple yet effective support vector machine (SVM) classifier with local offset adjustment for imbalance classification problems. First, a geometry-based partitioning method is modified for imbalanced datasets to divide the input space into multiple linearly separable partitions along the potential separation boundary. Then an F-score-based method is applied to obtain local offsets optimized on each local cluster. Finally, by constructing a quasi-linear kernel based on the partitioning information, a quasi-linear SVM classifier with local offsets is constructed for the imbalanced datasets. Simulation results on different real-world datasets show that the proposed method is effective for imbalanced data classifications.
KW - imbalanced data classification
KW - kernel composition
KW - local linear partition
KW - local offset method
KW - support vector machine
KW - within-class imbalances
UR - http://www.scopus.com/inward/record.url?scp=85054555903&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85054555903&partnerID=8YFLogxK
U2 - 10.1002/tee.22808
DO - 10.1002/tee.22808
M3 - Article
AN - SCOPUS:85054555903
SN - 1931-4973
VL - 14
SP - 289
EP - 296
JO - IEEJ Transactions on Electrical and Electronic Engineering
JF - IEEJ Transactions on Electrical and Electronic Engineering
IS - 2
ER -