TY - JOUR
T1 - Feature subset selection
T2 - A correlation-based SVM filter approach
AU - Li, Boyang
AU - Wang, Qiangwei
AU - Hu, Jinglu
PY - 2011/3
Y1 - 2011/3
N2 - The central criterion of feature selection is that good feature sets contain features that are highly correlated with the output, yet uncorrelated with each other. Based on this criterion, we address the problem of feature selection through correlation-based feature clustering and support vector machine (SVM) based feature ranking. Correlation-based clustering is proposed to group features into some clusters based on the correlation between two features. As a result, a feature is highly correlated to any other feature in the same cluster but uncorrelated to the features in other clusters. From each cluster, we select a feature as the delegate based on its influence quantities on the output. The influence quantities are measured by the feature sensitivity in the SVM. The proposed approach can identify relevant features and eliminate redundancy among them effectively. The effectiveness of the proposed approach is demonstrated through comparisons with other methods using real-world data with different dimensions.
AB - The central criterion of feature selection is that good feature sets contain features that are highly correlated with the output, yet uncorrelated with each other. Based on this criterion, we address the problem of feature selection through correlation-based feature clustering and support vector machine (SVM) based feature ranking. Correlation-based clustering is proposed to group features into some clusters based on the correlation between two features. As a result, a feature is highly correlated to any other feature in the same cluster but uncorrelated to the features in other clusters. From each cluster, we select a feature as the delegate based on its influence quantities on the output. The influence quantities are measured by the feature sensitivity in the SVM. The proposed approach can identify relevant features and eliminate redundancy among them effectively. The effectiveness of the proposed approach is demonstrated through comparisons with other methods using real-world data with different dimensions.
KW - Correlation-based clustering
KW - Feature ranking
KW - Feature selection
KW - Support vector machine
UR - http://www.scopus.com/inward/record.url?scp=79551667272&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79551667272&partnerID=8YFLogxK
U2 - 10.1002/tee.20641
DO - 10.1002/tee.20641
M3 - Article
AN - SCOPUS:79551667272
SN - 1931-4973
VL - 6
SP - 173
EP - 179
JO - IEEJ Transactions on Electrical and Electronic Engineering
JF - IEEJ Transactions on Electrical and Electronic Engineering
IS - 2
ER -