TY - GEN
T1 - Computational Auditory Scene Analysis and Its Application to Robot Audition
AU - Okuno, Hiroshi G.
AU - Ogata, Tetsuya
AU - Komatani, Kazunori
AU - Nakadai, Kazuhiro
PY - 2004/12/27
Y1 - 2004/12/27
N2 - We are engaged in research on computational auditory scene analysis to attain sophisticated robot (computer) human interaction by recognizing auditory awareness. The objective of our research is the understanding of an arbitrary sound mixture including non-speech sounds and music as well as voiced speech, obtained by robot's ears (or microphones embedded in the robot). The main issues are sound source localization, separation, and recognition at signal processing levels, and signal-to-symbol transformation at the interface level to symbol processing levels. The latter is critical in developmental communication and we are developing an automatic onomatopoeia recognition system. This paper overviews our activities in robot audition, in particular, active direction-pass filter (ADPF) that separates sounds originating from a specific direction by integrating sound source localization and visual processing. ADPF is implemented on three kinds of robots and demonstrates separating and recognizing three simultaneous speeches with a pair of microphones.
AB - We are engaged in research on computational auditory scene analysis to attain sophisticated robot (computer) human interaction by recognizing auditory awareness. The objective of our research is the understanding of an arbitrary sound mixture including non-speech sounds and music as well as voiced speech, obtained by robot's ears (or microphones embedded in the robot). The main issues are sound source localization, separation, and recognition at signal processing levels, and signal-to-symbol transformation at the interface level to symbol processing levels. The latter is critical in developmental communication and we are developing an automatic onomatopoeia recognition system. This paper overviews our activities in robot audition, in particular, active direction-pass filter (ADPF) that separates sounds originating from a specific direction by integrating sound source localization and visual processing. ADPF is implemented on three kinds of robots and demonstrates separating and recognizing three simultaneous speeches with a pair of microphones.
UR - http://www.scopus.com/inward/record.url?scp=10444249505&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=10444249505&partnerID=8YFLogxK
U2 - 10.1109/ICKS.2004.1313411
DO - 10.1109/ICKS.2004.1313411
M3 - Conference contribution
AN - SCOPUS:10444249505
SN - 0769521509
SN - 9780769521503
T3 - Proceedings - International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2004
SP - 73
EP - 80
BT - Proceedings - International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2004
A2 - Ibaraki, T.
A2 - Inui, T.
A2 - Tanaka, K.
T2 - Proceedings - International Conference on Informatics Research for Development of Knowledge Society Infrastructure, ICKS 2004
Y2 - 1 March 2004 through 2 March 2004
ER -