TY - JOUR
T1 - Fundamental limitation of frequency domain blind source separation for convolutive mixture of speech
AU - Araki, Shoko
AU - Makino, Shoji
AU - Nishikawa, Tsuyoki
AU - Saruwatari, Hiroshi
PY - 2001
Y1 - 2001
N2 - Despite several recent proposals to achieve Blind Source Separation (BSS) for realistic acoustic signal, separation performance is still not enough. In particular, when the length of impulse response is long, performance is highly limited. In this paper, we show it is useless to be constrained by the condition, P ≪ T, where T is the frame size of FFT and P is the length of room impulse response. From our experiments, a frame size of 256 or 512 (32 or 64 ms at a sampling frequency of 8 kHz) is best even for the long room reverberation of TR = 150 and 300 ms. We also clarified the reason for poor performance of BSS in long reverberant environment, finding that separation is achieved chiefly for the sound from the direction of jammer because BSS cannot calculate the inverse of the room transfer function both for the target and jammer signals.
AB - Despite several recent proposals to achieve Blind Source Separation (BSS) for realistic acoustic signal, separation performance is still not enough. In particular, when the length of impulse response is long, performance is highly limited. In this paper, we show it is useless to be constrained by the condition, P ≪ T, where T is the frame size of FFT and P is the length of room impulse response. From our experiments, a frame size of 256 or 512 (32 or 64 ms at a sampling frequency of 8 kHz) is best even for the long room reverberation of TR = 150 and 300 ms. We also clarified the reason for poor performance of BSS in long reverberant environment, finding that separation is achieved chiefly for the sound from the direction of jammer because BSS cannot calculate the inverse of the room transfer function both for the target and jammer signals.
UR - http://www.scopus.com/inward/record.url?scp=0034848298&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0034848298&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2001.940212
DO - 10.1109/ICASSP.2001.940212
M3 - Article
AN - SCOPUS:0034848298
SN - 0736-7791
VL - 5
SP - 2737
EP - 2740
JO - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
JF - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
ER -