TY - JOUR
T1 - Data selection by sequence summarizing neural network in mismatch condition training
AU - Žmolíková, Kateřina
AU - Karafiát, Martin
AU - Veselý, Karel
AU - Delcroix, Marc
AU - Watanabe, Shinji
AU - Burget, Lukáš
AU - Cěrnocký, Jan
N1 - Publisher Copyright:
Copyright © 2016 ISCA.
PY - 2016
Y1 - 2016
N2 - Data augmentation is a simple and efficient technique to improve the robustness of a speech recognizer when deployed in mismatched training-test conditions. Our paper proposes a new approach for selecting data with respect to similarity of acoustic conditions. The similarity is computed based on a sequence summarizing neural network which extracts vectors containing acoustic summary (e.g. noise and reverberation characteristics) of an utterance. Several configurations of this network and different methods of selecting data using these "summary-vectors" were explored. The results are reported on a mismatched condition using AMI training set with the proposed data selection and CHiME3 test set.
AB - Data augmentation is a simple and efficient technique to improve the robustness of a speech recognizer when deployed in mismatched training-test conditions. Our paper proposes a new approach for selecting data with respect to similarity of acoustic conditions. The similarity is computed based on a sequence summarizing neural network which extracts vectors containing acoustic summary (e.g. noise and reverberation characteristics) of an utterance. Several configurations of this network and different methods of selecting data using these "summary-vectors" were explored. The results are reported on a mismatched condition using AMI training set with the proposed data selection and CHiME3 test set.
KW - Automatic speech recognition
KW - Data augmentation
KW - Data selection
KW - Mismatch training condition
KW - Sequence summarization
UR - http://www.scopus.com/inward/record.url?scp=84994382229&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84994382229&partnerID=8YFLogxK
U2 - 10.21437/Interspeech.2016-741
DO - 10.21437/Interspeech.2016-741
M3 - Conference article
AN - SCOPUS:84994382229
SN - 2308-457X
VL - 08-12-September-2016
SP - 2354
EP - 2358
JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
T2 - 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016
Y2 - 8 September 2016 through 16 September 2016
ER -