TY - JOUR
T1 - The fifth 'CHiME' speech separation and recognition challenge
T2 - 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018
AU - Barker, Jon
AU - Watanabe, Shinji
AU - Vincent, Emmanuel
AU - Trmal, Jan
N1 - Funding Information:
We would like to thank Google for funding the full data collection and annotation, Microsoft Research for providing Kinects, and Microsoft India for sponsoring the 5th ‘CHiME’ Workshop. E. Vincent acknowledges support from the French National Research Agency in the framework of the project VOCADOM “Robust voice command adapted to the user and to the context for AAL” (ANR-16-CE33-0006).
Publisher Copyright:
© 2018 International Speech Communication Association. All rights reserved.
PY - 2018
Y1 - 2018
N2 - The CHiME challenge series aims to advance robust automatic speech recognition (ASR) technology by promoting research at the interface of speech and language processing, signal processing, and machine learning. This paper introduces the 5th CHiME Challenge, which considers the task of distant multi-microphone conversational ASR in real home environments. Speech material was elicited using a dinner party scenario with efforts taken to capture data that is representative of natural conversational speech and recorded by 6 Kinect microphone arrays and 4 binaural microphone pairs. The challenge features a single-array track and a multiple-array track and, for each track, distinct rankings will be produced for systems focusing on robustness with respect to distant-microphone capture vs. systems attempting to address all aspects of the task including conversational language modeling. We discuss the rationale for the challenge and provide a detailed description of the data collection procedure, the task, and the baseline systems for array synchronization, speech enhancement, and conventional and end-to-end ASR.
AB - The CHiME challenge series aims to advance robust automatic speech recognition (ASR) technology by promoting research at the interface of speech and language processing, signal processing, and machine learning. This paper introduces the 5th CHiME Challenge, which considers the task of distant multi-microphone conversational ASR in real home environments. Speech material was elicited using a dinner party scenario with efforts taken to capture data that is representative of natural conversational speech and recorded by 6 Kinect microphone arrays and 4 binaural microphone pairs. The challenge features a single-array track and a multiple-array track and, for each track, distinct rankings will be produced for systems focusing on robustness with respect to distant-microphone capture vs. systems attempting to address all aspects of the task including conversational language modeling. We discuss the rationale for the challenge and provide a detailed description of the data collection procedure, the task, and the baseline systems for array synchronization, speech enhancement, and conventional and end-to-end ASR.
KW - 'CHiME' challenge
KW - Conversational speech
KW - Microphone array
KW - Noise
KW - Reverberation
KW - Robust ASR
UR - http://www.scopus.com/inward/record.url?scp=85054986374&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85054986374&partnerID=8YFLogxK
U2 - 10.21437/Interspeech.2018-1768
DO - 10.21437/Interspeech.2018-1768
M3 - Conference article
AN - SCOPUS:85054986374
SN - 2308-457X
VL - 2018-September
SP - 1561
EP - 1565
JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Y2 - 2 September 2018 through 6 September 2018
ER -