TY - JOUR
T1 - Entropy-based sliced inverse regression
AU - Hino, Hideitsu
AU - Wakayama, Keigo
AU - Murata, Noboru
N1 - Funding Information:
The authors would like to express their special thanks to the editor, the associate editor and reviewers whose comments led to valuable improvements of the manuscript. The authors are grateful to S. Weisberg and L. Scrucca for providing implementation of SIR, IRE, SAVE, DR, and MSIR. Part of this work was supported by “Development of Large Scale Energy Storage System with High-safety and Cost-competitiveness” from NEDO, Japan.
PY - 2013
Y1 - 2013
N2 - Abstract The importance of dimension reduction has been increasing according to the growth of the size of available data in many fields. An appropriate dimension reduction method of raw data helps to reduce computational time and to expose the intrinsic structure of complex data. Sliced inverse regression is a well-known dimension reduction method for regression, which assumes an elliptical distribution for the explanatory variable, and ingeniously reduces the problem of dimension reduction to a simple eigenvalue problem. Sliced inverse regression is based on the strong assumptions on the data distribution and the form of regression function, and there are a number of methods to relax or remove these assumptions to extend the applicability of the inverse regression method. However, each method is known to have its drawbacks either theoretically or empirically. To alleviate drawbacks in the existing methods, a dimension reduction method for regression based on the notion of conditional entropy minimization is proposed. Using entropy as a measure of dispersion of data, a low dimensional subspace is estimated without assuming any specific distribution nor any regression function. The proposed method is shown to perform comparable or superior to the conventional methods through experiments using artificial and real-world datasets.
AB - Abstract The importance of dimension reduction has been increasing according to the growth of the size of available data in many fields. An appropriate dimension reduction method of raw data helps to reduce computational time and to expose the intrinsic structure of complex data. Sliced inverse regression is a well-known dimension reduction method for regression, which assumes an elliptical distribution for the explanatory variable, and ingeniously reduces the problem of dimension reduction to a simple eigenvalue problem. Sliced inverse regression is based on the strong assumptions on the data distribution and the form of regression function, and there are a number of methods to relax or remove these assumptions to extend the applicability of the inverse regression method. However, each method is known to have its drawbacks either theoretically or empirically. To alleviate drawbacks in the existing methods, a dimension reduction method for regression based on the notion of conditional entropy minimization is proposed. Using entropy as a measure of dispersion of data, a low dimensional subspace is estimated without assuming any specific distribution nor any regression function. The proposed method is shown to perform comparable or superior to the conventional methods through experiments using artificial and real-world datasets.
KW - Keywords Sliced inverse regression Dimension reduction Entropy
UR - http://www.scopus.com/inward/record.url?scp=84879054518&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84879054518&partnerID=8YFLogxK
U2 - 10.1016/j.csda.2013.05.017
DO - 10.1016/j.csda.2013.05.017
M3 - Article
AN - SCOPUS:84879054518
SN - 0167-9473
VL - 67
SP - 105
EP - 114
JO - Computational Statistics and Data Analysis
JF - Computational Statistics and Data Analysis
ER -