TY - GEN
T1 - An Effective Feature Selection Scheme for Healthcare Data Classification Using Binary Particle Swarm Optimization
AU - Chen, Yiyuan
AU - Wang, Yufeng
AU - Cao, Liang
AU - Jin, Qun
N1 - Funding Information:
This work was supported by the JiangSu Educational Bureau Project under Grant 14KJA510004, State Key Laboratory of Novel Software Technology under grant KFKT2017B14.
PY - 2018/12/26
Y1 - 2018/12/26
N2 - Feature selection (FS) is one of fundamental data processing techniques in various machine learning algorithms, especially for classification of healthcare data. However, it is a challenging issue due to the large search space. This paper proposed a confidence based and cost effective feature selection method using binary particle swarm optimization, CCFS. First, CCFS improves search effectiveness by developing a new updating mechanism, in which confidence of each feature is explicitly considered, including the correlation between feature and categories, and historically selected frequency of each feature. Second, the classification accuracy, the feature reduction ratio, and the feature cost are comprehensively incorporated into the design of the fitness function. The proposed method has been verified in UCI cancer classification dataset (Lung Cancer). The experimental result shows the effectiveness of the proposed method, in terms of accuracy and feature selection cost.
AB - Feature selection (FS) is one of fundamental data processing techniques in various machine learning algorithms, especially for classification of healthcare data. However, it is a challenging issue due to the large search space. This paper proposed a confidence based and cost effective feature selection method using binary particle swarm optimization, CCFS. First, CCFS improves search effectiveness by developing a new updating mechanism, in which confidence of each feature is explicitly considered, including the correlation between feature and categories, and historically selected frequency of each feature. Second, the classification accuracy, the feature reduction ratio, and the feature cost are comprehensively incorporated into the design of the fitness function. The proposed method has been verified in UCI cancer classification dataset (Lung Cancer). The experimental result shows the effectiveness of the proposed method, in terms of accuracy and feature selection cost.
KW - Binary Particle Swarm Optimization
KW - Feature selection
KW - Healthcare data classification
KW - Swarm intelligence
UR - http://www.scopus.com/inward/record.url?scp=85061313170&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85061313170&partnerID=8YFLogxK
U2 - 10.1109/ITME.2018.00160
DO - 10.1109/ITME.2018.00160
M3 - Conference contribution
AN - SCOPUS:85061313170
T3 - Proceedings - 9th International Conference on Information Technology in Medicine and Education, ITME 2018
SP - 703
EP - 707
BT - Proceedings - 9th International Conference on Information Technology in Medicine and Education, ITME 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 9th International Conference on Information Technology in Medicine and Education, ITME 2018
Y2 - 19 October 2018 through 21 October 2018
ER -