TY - GEN
T1 - Deep recurrent de-noising auto-encoder and blind de-reverberation for reverberated speech recognition
AU - Weninger, Felix
AU - Watanabe, Shinji
AU - Tachioka, Yuuki
AU - Schuller, Bjorn
PY - 2014
Y1 - 2014
N2 - This paper describes our joint efforts to provide robust automatic speech recognition (ASR) for reverberated environments, such as in hands-free human-machine interaction. We investigate blind feature space de-reverberation and deep recurrent de-noising auto-encoders (DAE) in an early fusion scheme. Results on the 2014 REVERB Challenge development set indicate that the DAE front-end provides complementary performance gains to multi-condition training, feature transformations, and model adaptation. The proposed ASR system achieves word error rates of 17.62 % and 36.6 % on simulated and real data, which is a significant improvement over the Challenge baseline (25.16 and 47.2 %).
AB - This paper describes our joint efforts to provide robust automatic speech recognition (ASR) for reverberated environments, such as in hands-free human-machine interaction. We investigate blind feature space de-reverberation and deep recurrent de-noising auto-encoders (DAE) in an early fusion scheme. Results on the 2014 REVERB Challenge development set indicate that the DAE front-end provides complementary performance gains to multi-condition training, feature transformations, and model adaptation. The proposed ASR system achieves word error rates of 17.62 % and 36.6 % on simulated and real data, which is a significant improvement over the Challenge baseline (25.16 and 47.2 %).
KW - De-reverberation
KW - automatic speech recognition
KW - feature enhancement
KW - recurrent neural networks
UR - http://www.scopus.com/inward/record.url?scp=84905216003&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84905216003&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2014.6854478
DO - 10.1109/ICASSP.2014.6854478
M3 - Conference contribution
AN - SCOPUS:84905216003
SN - 9781479928927
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 4623
EP - 4627
BT - 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014
Y2 - 4 May 2014 through 9 May 2014
ER -