Real-time sound source localization and separation for robot audition

Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano

研究成果: Conference contribution

81 被引用数 (Scopus)

抄録

Robot audition in the real world should cope with environment noises and reverberation and motor noises caused by the robot's own movements. This paper presents the active direction-pass filter (ADPF) to separate sounds originating from the specified direction with a pair of microphones. The ADPF is implemented by hierarchical integration of visual and auditory processing with hypothetical reasoning on interaural phase difference (IPD) and interaural intensity difference (IID) for each subband. In creating hypotheses, the reference data of IPD and IID is calculated by the auditory epipolar geometry on demand. Since the performance of the ADPF depends on the direction, the ADPF controls the direction by motor movement. The human tracking and sound source separation based on the ADPF is implemented on an upper-torso humanoid and runs in real-time with 4 PCs connected over Gigabit ethernet. The signal-to-noise ratio (SNR) of each sound separated by the ADPF from a mixture of two speeches with the same loudness is improved to about 10 dB from 0 dB.

本文言語English
ホスト出版物のタイトル7th International Conference on Spoken Language Processing, ICSLP 2002
出版社International Speech Communication Association
ページ193-196
ページ数4
出版ステータスPublished - 2002
外部発表はい
イベント7th International Conference on Spoken Language Processing, ICSLP 2002 - Denver, United States
継続期間: 2002 9月 162002 9月 20

Other

Other7th International Conference on Spoken Language Processing, ICSLP 2002
国/地域United States
CityDenver
Period02/9/1602/9/20

ASJC Scopus subject areas

  • 言語および言語学
  • 言語学および言語

フィンガープリント

「Real-time sound source localization and separation for robot audition」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル