TY - JOUR
T1 - Bayesian extension of MUSIC for sound source localization and tracking
AU - Otsuka, Takuma
AU - Nakadai, Kazuhiro
AU - Ogata, Tetsuya
AU - Okuno, Hiroshi G.
PY - 2011/12/1
Y1 - 2011/12/1
N2 - This paper presents a Bayesian extension of MUSIC-based sound source localization (SSL) and tracking method. SSL is important for distant speech enhancement and simultaneous speech separation for improving speech recognition, as well as for auditory scene analysis by mobile robots. One of the draw- backs of existing SSL methods is the necessity of careful param- eter tunings, e.g., the sound source detection threshold depend- ing on the reverberation time and the number of sources. Our contribution consists of (1) automatic parameter estimation in the variational Bayesian framework and (2) tracking of sound sources with reliability. Experimental results demonstrate our method robustly tracks multiple sound sources in a reverberant environment with RT20 = 840 (ms).
AB - This paper presents a Bayesian extension of MUSIC-based sound source localization (SSL) and tracking method. SSL is important for distant speech enhancement and simultaneous speech separation for improving speech recognition, as well as for auditory scene analysis by mobile robots. One of the draw- backs of existing SSL methods is the necessity of careful param- eter tunings, e.g., the sound source detection threshold depend- ing on the reverberation time and the number of sources. Our contribution consists of (1) automatic parameter estimation in the variational Bayesian framework and (2) tracking of sound sources with reliability. Experimental results demonstrate our method robustly tracks multiple sound sources in a reverberant environment with RT20 = 840 (ms).
KW - MUSIC algorithm
KW - Particle filter
KW - Simultaneous sound source localization
KW - Variational Bayes
UR - http://www.scopus.com/inward/record.url?scp=84865790519&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84865790519&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:84865790519
SN - 2308-457X
SP - 3109
EP - 3112
JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
T2 - 12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
Y2 - 27 August 2011 through 31 August 2011
ER -