Auditory fovea based speech enchancement and its application to human-robot dialog system

Kazuhiro Nakadai, Hiroshi G. Okuno, Hiroaki Kitano

研究成果: Conference contribution

抄録

This paper presents an active direction-pass filter (ADPF) that separates sound from a specified direction by using a pair of microphones. Its application to front-end processing for speech recognition is also reported. The ADPF improves sound source separation by accurate sound direction obtained by multi-modal integration and active motor control that keeps the robot facing to a sound source, because the resolution of the center direction is much higher than that of peripherals, indicating similar property of visual fovea. In order to recognize separated sound streams, a Hidden Markov Model (HMM) based automatic speech recognition is built with multiple acoustic models trained by the output of the ADPF under various conditions. The experimental results by a preliminary dialog system prove that it works well even when two speakers speak simultaneously.

本文言語English
ホスト出版物のタイトル7th International Conference on Spoken Language Processing, ICSLP 2002
出版社International Speech Communication Association
ページ1817-1820
ページ数4
出版ステータスPublished - 2002
外部発表はい
イベント7th International Conference on Spoken Language Processing, ICSLP 2002 - Denver, United States
継続期間: 2002 9月 162002 9月 20

Other

Other7th International Conference on Spoken Language Processing, ICSLP 2002
国/地域United States
CityDenver
Period02/9/1602/9/20

ASJC Scopus subject areas

  • 言語および言語学
  • 言語学および言語

フィンガープリント

「Auditory fovea based speech enchancement and its application to human-robot dialog system」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル