TY - JOUR
T1 - A method for solving the permutation problem of frequency-domain BSS using reference signal
AU - Isa, Takashi
AU - Sekiya, Toshiyuki
AU - Ogawa, Tetsuji
AU - Kobayashi, Tetsunori
PY - 2006
Y1 - 2006
N2 - This paper presents a method for solving the permutation problem. This is a problem specific to frequency domain blind source separation within the framework of independent component analysis. Towards this problem, we propose a method which uses reference signals. For each frequency bin, the permutation alignment is fixed by calculating correlation coefficients between the reference signal and the separated signal. Reference signals are obtained as signals corresponding to each individual original sources. The reference signals are chosen or obtained subjectively, and do not need to be separated well. For example, the conventional beamforming technique gives suitable reference signals. To show the effectiveness of this method, we conducted a experiment of continuous speech recognition in a real room. The experimental results of double talk recognition with 20K vocabulary show that the proposed method is effective to achieve 20% error reduction rate compared with the established DOA-based approach.
AB - This paper presents a method for solving the permutation problem. This is a problem specific to frequency domain blind source separation within the framework of independent component analysis. Towards this problem, we propose a method which uses reference signals. For each frequency bin, the permutation alignment is fixed by calculating correlation coefficients between the reference signal and the separated signal. Reference signals are obtained as signals corresponding to each individual original sources. The reference signals are chosen or obtained subjectively, and do not need to be separated well. For example, the conventional beamforming technique gives suitable reference signals. To show the effectiveness of this method, we conducted a experiment of continuous speech recognition in a real room. The experimental results of double talk recognition with 20K vocabulary show that the proposed method is effective to achieve 20% error reduction rate compared with the established DOA-based approach.
UR - http://www.scopus.com/inward/record.url?scp=84862630399&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84862630399&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:84862630399
SN - 2219-5491
JO - European Signal Processing Conference
JF - European Signal Processing Conference
T2 - 14th European Signal Processing Conference, EUSIPCO 2006
Y2 - 4 September 2006 through 8 September 2006
ER -