TY - GEN
T1 - A Consideration on Efficient Detection Method of Anormal Responses in High-dimensional Questionnaire Data
AU - Kurosawa, Kosuke
AU - Suganuma, Mutsumi
AU - Kameyama, Wataru
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - We have been studying to detect anormal responses in high-dimensional questionnaire data, that may affect the overall analysis results and are to be removed in the preprocess, more efficiently. In this paper, we apply principal component analysis (PCA) and multiple correspondence analysis (MCA) as dimension reduction methods, and x-means and gaussian mixture model (GMM) as clustering algorithms to the high-dimensional questionnaire data. Then, we examine the combinations of these methods for detecting anormal responses that are significantly far from the cluster centers or the distribution centers. Also, we employ principal component pursuit (PCP), where the absolute value sum for each response in the sparse matrix is used as anormal score to directly detect anormal responses. As a result, we find both of MCA+x-means and PCP achieve to detect reasonable anormal responses with shorter execution time.
AB - We have been studying to detect anormal responses in high-dimensional questionnaire data, that may affect the overall analysis results and are to be removed in the preprocess, more efficiently. In this paper, we apply principal component analysis (PCA) and multiple correspondence analysis (MCA) as dimension reduction methods, and x-means and gaussian mixture model (GMM) as clustering algorithms to the high-dimensional questionnaire data. Then, we examine the combinations of these methods for detecting anormal responses that are significantly far from the cluster centers or the distribution centers. Also, we employ principal component pursuit (PCP), where the absolute value sum for each response in the sparse matrix is used as anormal score to directly detect anormal responses. As a result, we find both of MCA+x-means and PCP achieve to detect reasonable anormal responses with shorter execution time.
KW - Anomaly Detection
KW - Dimension Reduction
KW - Multiple Component Analysis
KW - Principal Component Pursuit
KW - Questionnaire Data
UR - http://www.scopus.com/inward/record.url?scp=85147245782&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85147245782&partnerID=8YFLogxK
U2 - 10.1109/GCCE56475.2022.10014170
DO - 10.1109/GCCE56475.2022.10014170
M3 - Conference contribution
AN - SCOPUS:85147245782
T3 - GCCE 2022 - 2022 IEEE 11th Global Conference on Consumer Electronics
SP - 917
EP - 918
BT - GCCE 2022 - 2022 IEEE 11th Global Conference on Consumer Electronics
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 11th IEEE Global Conference on Consumer Electronics, GCCE 2022
Y2 - 18 October 2022 through 21 October 2022
ER -