TY - GEN
T1 - Exploiting known sound source signals to improve ICA-based robot audition in speech separation and recognition
AU - Takeda, Ryu
AU - Nakadai, Kazuhiro
AU - Komatani, Kazunori
AU - Ogata, Tetsuya
AU - Okuno, Hiroshi G.
PY - 2007/12/1
Y1 - 2007/12/1
N2 - This paper describes a new semi-blind source separation (semi-BSS) technique with independent component analysis (ICA) for enhancing a target source of interest and for suppressing other known interference sources. The semi-BSS technique is necessary for double-talk free robot audition systems in order to utilize known sound source signals such as self speech, music, or TV-sound, through a line-in or ubiquitous network. Unlike the conventional semi-BSS with ICA, we use the time-frequency domain convolution model to describe the reflection of the sound and a new mixing process of sounds for ICA. In other words, we consider that reflected sounds during some delay time are different from the original. ICA then separates the reflections as other interference sources. The model enables us to eliminate the frame size limitations of the frequency-domain ICA, and ICA can separate the known sources under a highly reverberative environment. Experimental results show that our method outperformed the conventional semi-BSS using ICA under simulated normal and highly reverberative environments.
AB - This paper describes a new semi-blind source separation (semi-BSS) technique with independent component analysis (ICA) for enhancing a target source of interest and for suppressing other known interference sources. The semi-BSS technique is necessary for double-talk free robot audition systems in order to utilize known sound source signals such as self speech, music, or TV-sound, through a line-in or ubiquitous network. Unlike the conventional semi-BSS with ICA, we use the time-frequency domain convolution model to describe the reflection of the sound and a new mixing process of sounds for ICA. In other words, we consider that reflected sounds during some delay time are different from the original. ICA then separates the reflections as other interference sources. The model enables us to eliminate the frame size limitations of the frequency-domain ICA, and ICA can separate the known sources under a highly reverberative environment. Experimental results show that our method outperformed the conventional semi-BSS using ICA under simulated normal and highly reverberative environments.
UR - http://www.scopus.com/inward/record.url?scp=51349166253&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=51349166253&partnerID=8YFLogxK
U2 - 10.1109/IROS.2007.4399297
DO - 10.1109/IROS.2007.4399297
M3 - Conference contribution
AN - SCOPUS:51349166253
SN - 1424409128
SN - 9781424409129
T3 - IEEE International Conference on Intelligent Robots and Systems
SP - 1757
EP - 1762
BT - Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2007
T2 - 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2007
Y2 - 29 October 2007 through 2 November 2007
ER -