TY - GEN
T1 - Wasserstein k-means with sparse simplex projection
AU - Fukunaga, Takumi
AU - Kasai, Hiroyuki
N1 - Funding Information:
ACKNOWLEDGMENT H. Kasai was partially supported by JSPS KAKENHI Grant Numbers JP16K00031 and JP17H01732, and by Support Center for Advanced Telecomm. Technology Research (SCAT).
Publisher Copyright:
© 2020 IEEE
PY - 2020
Y1 - 2020
N2 - This paper presents a proposal of a faster Wasserstein k-means algorithm for histogram data by reducing Wasserstein distance computations and exploiting sparse simplex projection. We shrink data samples, centroids, and the ground cost matrix, which leads to considerable reduction of the computations used to solve optimal transport problems without loss of clustering quality. Furthermore, we dynamically reduced the computational complexity by removing lower-valued data samples and harnessing sparse simplex projection while keeping the degradation of clustering quality lower. We designate this proposed algorithm as sparse simplex projection based Wasserstein k-means, or SSPW k-means. Numerical evaluations conducted with comparison to results obtained using Wasserstein k-means algorithm demonstrate the effectiveness of the proposed SSPW k-means for real-world datasets.
AB - This paper presents a proposal of a faster Wasserstein k-means algorithm for histogram data by reducing Wasserstein distance computations and exploiting sparse simplex projection. We shrink data samples, centroids, and the ground cost matrix, which leads to considerable reduction of the computations used to solve optimal transport problems without loss of clustering quality. Furthermore, we dynamically reduced the computational complexity by removing lower-valued data samples and harnessing sparse simplex projection while keeping the degradation of clustering quality lower. We designate this proposed algorithm as sparse simplex projection based Wasserstein k-means, or SSPW k-means. Numerical evaluations conducted with comparison to results obtained using Wasserstein k-means algorithm demonstrate the effectiveness of the proposed SSPW k-means for real-world datasets.
UR - http://www.scopus.com/inward/record.url?scp=85098612986&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85098612986&partnerID=8YFLogxK
U2 - 10.1109/ICPR48806.2021.9412131
DO - 10.1109/ICPR48806.2021.9412131
M3 - Conference contribution
AN - SCOPUS:85098612986
T3 - Proceedings - International Conference on Pattern Recognition
SP - 1627
EP - 1634
BT - Proceedings of ICPR 2020 - 25th International Conference on Pattern Recognition
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 25th International Conference on Pattern Recognition, ICPR 2020
Y2 - 10 January 2021 through 15 January 2021
ER -