TY - JOUR
T1 - Spatial filter calibration based on minimization of modified LSD
AU - Tanaka, Nobuaki
AU - Ogawa, Tetsuji
AU - Kobayashi, Tetsunori
PY - 2011/12/1
Y1 - 2011/12/1
N2 - A new sound source separation method has been developed that is robust against individual variability in microphones and acoustic lines. A specific area that has a target sound source was enhanced by using a spatial filter developed by time-frequency masking. However, there is a strong likelihood that the spatial filters will be distorted due to the impact of individual variability in microphone characteristics and acoustic lines. To solve this problem, calibration of these spatial filters' shapes was attempted using a modified log-spectral distance (MLSD) minimization criterion, which uses utterances made by each individual (i.e., a sound source) at the desired positions. The effectiveness of this spatial filter calibration was experimentally verified in speech recognition experiments; MLSD-based calibration had fewer word errors than the cases without calibration and calibration using other criteria.
AB - A new sound source separation method has been developed that is robust against individual variability in microphones and acoustic lines. A specific area that has a target sound source was enhanced by using a spatial filter developed by time-frequency masking. However, there is a strong likelihood that the spatial filters will be distorted due to the impact of individual variability in microphone characteristics and acoustic lines. To solve this problem, calibration of these spatial filters' shapes was attempted using a modified log-spectral distance (MLSD) minimization criterion, which uses utterances made by each individual (i.e., a sound source) at the desired positions. The effectiveness of this spatial filter calibration was experimentally verified in speech recognition experiments; MLSD-based calibration had fewer word errors than the cases without calibration and calibration using other criteria.
KW - Modified LSD
KW - Sound source separation
KW - Spatial filter calibration
KW - Time-frequency masking
UR - http://www.scopus.com/inward/record.url?scp=84865714069&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84865714069&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:84865714069
SN - 2308-457X
SP - 1761
EP - 1764
JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
T2 - 12th Annual Conference of the International Speech Communication Association, INTERSPEECH 2011
Y2 - 27 August 2011 through 31 August 2011
ER -