TY - CONF
T1 - Readable contrast mining method for heterogeneous bipartite networks with class label
AU - Nishiguchi, Mao
AU - Shirai, Yasuyuki
AU - Morita, Hiroyuki
AU - Goto, Yusuke
N1 - Funding Information:
This work was supported by JSPS KAKENHI Grant Numbers 17K00441 and 18K01891.
Publisher Copyright:
© Proceedings of the 24th Pacific Asia Conference on Information Systems: Information Systems (IS) for the Future, PACIS 2020. All rights reserved.
PY - 2020
Y1 - 2020
N2 - Recent IoT trends have been driving usage of multi-source data. In general, since the multi-source data have different nature, we need to make heterogeneous networks to represent them simultaneously. This paper presents a novel mining method to discover factors that characterize differences between classes for heterogeneous bipartite networks. In order to find the differences from such the large-scale networks efficiently, it is important to use distributed representations that preserve first-order and second-order proximity between vertices of the networks. We propose an effective representation method for heterogeneous bipartite networks with class label. And we propose a readable visualization method for revealing the factors on the embedding space. From computational experiments using real-world data which include the multimedia access logs and the results of questionnaire for the users, we show that the proposed method can acquire distributed representations with higher explanatory power than existing methods, and can discover important factors.
AB - Recent IoT trends have been driving usage of multi-source data. In general, since the multi-source data have different nature, we need to make heterogeneous networks to represent them simultaneously. This paper presents a novel mining method to discover factors that characterize differences between classes for heterogeneous bipartite networks. In order to find the differences from such the large-scale networks efficiently, it is important to use distributed representations that preserve first-order and second-order proximity between vertices of the networks. We propose an effective representation method for heterogeneous bipartite networks with class label. And we propose a readable visualization method for revealing the factors on the embedding space. From computational experiments using real-world data which include the multimedia access logs and the results of questionnaire for the users, we show that the proposed method can acquire distributed representations with higher explanatory power than existing methods, and can discover important factors.
KW - Contrast data mining
KW - Heterogeneous bipartite network embedding
KW - Multi-source data analysis
UR - http://www.scopus.com/inward/record.url?scp=85089126874&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85089126874&partnerID=8YFLogxK
M3 - Paper
AN - SCOPUS:85089126874
T2 - 24th Pacific Asia Conference on Information Systems: Information Systems (IS) for the Future, PACIS 2020
Y2 - 20 June 2020 through 24 June 2020
ER -