Abstract
Principal points for binary distributions are able to be defined based on Flurys principal points (1990). However, finding principal points for binary distributions is hard in a straightforward manner. In this article, a method for approximating principal points for binary distributions is proposed by formulating it as an uncapacitated location problem. Moreover, it is shown that the problem of finding principal points can be solved with the aid of submodular functions. It leads to a solution whose value is at least (1 - 1/e) times the optimal value.
Original language | English |
---|---|
Pages (from-to) | 2291-2309 |
Number of pages | 19 |
Journal | Communications in Statistics - Theory and Methods |
Volume | 44 |
Issue number | 11 |
DOIs | |
Publication status | Published - 2015 Jun 3 |
Keywords
- Clustering
- Data analysis
- Multivariate binary distribution
- Uncapacitated location problem
ASJC Scopus subject areas
- Statistics and Probability