TY - GEN
T1 - Fully Bayesian inference of multi-mixture Gaussian model and its evaluation using speaker clustering
AU - Tawara, Naohiro
AU - Ogawa, Tetsuji
AU - Watanabe, Shinji
AU - Kobayashi, Tetsunori
PY - 2012
Y1 - 2012
N2 - This study aims to verify effective optimization methods for estimating parametric, fully Bayesian models in speech processing. For that purpose, we investigate the impact of the difference in optimization methods for the multi-scale Gaussian mixture model, which is suitable for speaker clustering, on the clustering accuracy. The Markov chain Monte Carlo (MCMC)-based method was compared with the variational Bayesian method in the speaker clustering experiment; with a small amount of data, the MCMC-based method was more effective; with large scale data (more than one million samples), the difference between these methods in terms of the clustering accuracy decreased and the MCMC-based method was computationally efficient.
AB - This study aims to verify effective optimization methods for estimating parametric, fully Bayesian models in speech processing. For that purpose, we investigate the impact of the difference in optimization methods for the multi-scale Gaussian mixture model, which is suitable for speaker clustering, on the clustering accuracy. The Markov chain Monte Carlo (MCMC)-based method was compared with the variational Bayesian method in the speaker clustering experiment; with a small amount of data, the MCMC-based method was more effective; with large scale data (more than one million samples), the difference between these methods in terms of the clustering accuracy decreased and the MCMC-based method was computationally efficient.
KW - Gibbs sampling
KW - Speaker clustering
KW - multi-scale Gaussian mixture model
KW - variational Bayesian method
UR - http://www.scopus.com/inward/record.url?scp=84867626020&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84867626020&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2012.6289105
DO - 10.1109/ICASSP.2012.6289105
M3 - Conference contribution
AN - SCOPUS:84867626020
SN - 9781467300469
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 5253
EP - 5256
BT - 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012 - Proceedings
T2 - 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012
Y2 - 25 March 2012 through 30 March 2012
ER -