TY - GEN
T1 - Template-based Spectral Estimation using microphone array for speech recognition
AU - Tamura, Satoshi
AU - Hishikawa, Eriko
AU - Taguchi, Wataru
AU - Hayamizu, Satoru
PY - 2010
Y1 - 2010
N2 - This paper proposes a Template-based Spectral Estimation (TSE) method for noise reduction of microphone array processing aiming at speech recognition enhancement. In the proposed method, a noise template in a complex plane is calculated for each frequency bin using non-speech audio signals observed at microphones. Then for every noise-overlapped speech signals, a speech signal can be reformed by applying the template and the gradient descent method. Experiments were conducted to evaluate not only performance of noise reduction but also improvement of speech recognition. Then NRR 16.7dB improvement was achieved by combining TSE and Spectral Subtraction (SS) methods. For speech recognition, 44% relative recognition error reduction was obtained comparing with the conventional SS method.
AB - This paper proposes a Template-based Spectral Estimation (TSE) method for noise reduction of microphone array processing aiming at speech recognition enhancement. In the proposed method, a noise template in a complex plane is calculated for each frequency bin using non-speech audio signals observed at microphones. Then for every noise-overlapped speech signals, a speech signal can be reformed by applying the template and the gradient descent method. Experiments were conducted to evaluate not only performance of noise reduction but also improvement of speech recognition. Then NRR 16.7dB improvement was achieved by combining TSE and Spectral Subtraction (SS) methods. For speech recognition, 44% relative recognition error reduction was obtained comparing with the conventional SS method.
KW - Microphone array
KW - Noise reduction
KW - Spectral sub-truction
KW - Speech recognition enhancement
UR - http://www.scopus.com/inward/record.url?scp=79959832425&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79959832425&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:79959832425
T3 - Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010
SP - 2050
EP - 2053
BT - Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010
PB - International Speech Communication Association
ER -