Bayesian extension of MUSIC for sound source localization and tracking

Takuma Otsuka*, Kazuhiro Nakadai, Tetsuya Ogata, Hiroshi G. Okuno

*この研究の対応する著者

研究成果: Conference article査読

9 被引用数 (Scopus)

抄録

This paper presents a Bayesian extension of MUSIC-based sound source localization (SSL) and tracking method. SSL is important for distant speech enhancement and simultaneous speech separation for improving speech recognition, as well as for auditory scene analysis by mobile robots. One of the draw- backs of existing SSL methods is the necessity of careful param- eter tunings, e.g., the sound source detection threshold depend- ing on the reverberation time and the number of sources. Our contribution consists of (1) automatic parameter estimation in the variational Bayesian framework and (2) tracking of sound sources with reliability. Experimental results demonstrate our method robustly tracks multiple sound sources in a reverberant environment with RT20 = 840 (ms).

本文言語English
ページ(範囲)3109-3112
ページ数4
ジャーナルProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
出版ステータスPublished - 2011 12月 1
外部発表はい
イベント12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011 - Florence, Italy
継続期間: 2011 8月 272011 8月 31

ASJC Scopus subject areas

  • 言語および言語学
  • 人間とコンピュータの相互作用
  • 信号処理
  • ソフトウェア
  • モデリングとシミュレーション

フィンガープリント

「Bayesian extension of MUSIC for sound source localization and tracking」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル