Computational Auditory Scene Analysis and Its Application to Robot Audition

Hiroshi G. Okuno*, Tetsuya Ogata, Kazunori Komatani, Kazuhiro Nakadai

*この研究の対応する著者

研究成果: Conference contribution

15 被引用数 (Scopus)

抄録

We are engaged in research on computational auditory scene analysis to attain sophisticated robot (computer) human interaction by recognizing auditory awareness. The objective of our research is the understanding of an arbitrary sound mixture including non-speech sounds and music as well as voiced speech, obtained by robot's ears (or microphones embedded in the robot). The main issues are sound source localization, separation, and recognition at signal processing levels, and signal-to-symbol transformation at the interface level to symbol processing levels. The latter is critical in developmental communication and we are developing an automatic onomatopoeia recognition system. This paper overviews our activities in robot audition, in particular, active direction-pass filter (ADPF) that separates sounds originating from a specific direction by integrating sound source localization and visual processing. ADPF is implemented on three kinds of robots and demonstrates separating and recognizing three simultaneous speeches with a pair of microphones.

本文言語English
ホスト出版物のタイトルProceedings - International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2004
編集者T. Ibaraki, T. Inui, K. Tanaka
ページ73-80
ページ数8
DOI
出版ステータスPublished - 2004 12月 27
外部発表はい
イベントProceedings - International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2004 - Kyoto, Japan
継続期間: 2004 3月 12004 3月 2

出版物シリーズ

名前Proceedings - International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2004

Conference

ConferenceProceedings - International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2004
国/地域Japan
CityKyoto
Period04/3/104/3/2

ASJC Scopus subject areas

  • 工学(全般)

フィンガープリント

「Computational Auditory Scene Analysis and Its Application to Robot Audition」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル