TY - JOUR
T1 - Source separation using multiple directivity patterns produced by ICA-based BSS
AU - Isa, Takashi
AU - Sekiya, Toshiyuki
AU - Ogawa, Tetsuji
AU - Kobayashi, Tetsunori
PY - 2006
Y1 - 2006
N2 - In this paper, we propose a multistage source separation method constructed by combining blind source separation (BSS) based on independent component analysis (ICA) and segregation using multiple directivity patterns (SMDP) introduced in our previous paper. We obtain the directivity patterns needed in SMDP by ICAbased BSS. In the SMDP, simultaneous equations of amplitudes of sound sources are generated by using these multiple directivities. The solution of these equations gives good disturbance estimates. We apply spectral subtraction using these disturbance estimates and the speech enhancement of the target source is performed. We conducted experimentation in a real room in the source-number-given condition where there is no priori information about the sound sources and the characteristics of room acoustics. The experimental results of double talk recognition show that the proposed technique is effective in reducing the error rate by 30% compared to frequency domain BSS.
AB - In this paper, we propose a multistage source separation method constructed by combining blind source separation (BSS) based on independent component analysis (ICA) and segregation using multiple directivity patterns (SMDP) introduced in our previous paper. We obtain the directivity patterns needed in SMDP by ICAbased BSS. In the SMDP, simultaneous equations of amplitudes of sound sources are generated by using these multiple directivities. The solution of these equations gives good disturbance estimates. We apply spectral subtraction using these disturbance estimates and the speech enhancement of the target source is performed. We conducted experimentation in a real room in the source-number-given condition where there is no priori information about the sound sources and the characteristics of room acoustics. The experimental results of double talk recognition show that the proposed technique is effective in reducing the error rate by 30% compared to frequency domain BSS.
UR - http://www.scopus.com/inward/record.url?scp=84862589822&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84862589822&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:84862589822
SN - 2219-5491
JO - European Signal Processing Conference
JF - European Signal Processing Conference
T2 - 14th European Signal Processing Conference, EUSIPCO 2006
Y2 - 4 September 2006 through 8 September 2006
ER -