TY - JOUR
T1 - A sampling-based speaker clustering using utterance-oriented Dirichlet process mixture model and its evaluation on large-scale data
AU - Tawara, Naohiro
AU - Ogawa, Tetsuji
AU - Watanabe, Shinji
AU - Nakamura, Atsushi
AU - Kobayashi, Tetsunori
N1 - Publisher Copyright:
© 2015 The Authors.
PY - 2015/10/28
Y1 - 2015/10/28
N2 - An infinite mixture model is applied to model-based speaker clustering with sampling-based optimization to make it possible to estimate the number of speakers. For this purpose, a framework of non-parametric Bayesian modeling is implemented with the Markov chain Monte Carlo and incorporated in the utterance-oriented speaker model. The proposed model is called the utterance-oriented Dirichlet process mixture model (UO-DPMM). The present paper demonstrates that UO-DPMM is successfully applied on large-scale data and outperforms the conventional hierarchical agglomerative clustering, especially for large amounts of utterances.
AB - An infinite mixture model is applied to model-based speaker clustering with sampling-based optimization to make it possible to estimate the number of speakers. For this purpose, a framework of non-parametric Bayesian modeling is implemented with the Markov chain Monte Carlo and incorporated in the utterance-oriented speaker model. The proposed model is called the utterance-oriented Dirichlet process mixture model (UO-DPMM). The present paper demonstrates that UO-DPMM is successfully applied on large-scale data and outperforms the conventional hierarchical agglomerative clustering, especially for large amounts of utterances.
KW - Gibbs sampling
KW - Non-parametric Bayesian model
KW - Sampling approach
KW - Speaker clustering
KW - Utterance-oriented Dirichlet process mixture model
UR - http://www.scopus.com/inward/record.url?scp=84949294383&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84949294383&partnerID=8YFLogxK
U2 - 10.1017/ATSIP.2015.19
DO - 10.1017/ATSIP.2015.19
M3 - Article
AN - SCOPUS:84949294383
SN - 2048-7703
VL - 4
JO - APSIPA Transactions on Signal and Information Processing
JF - APSIPA Transactions on Signal and Information Processing
ER -