Conferencingspeech Challenge: Towards Far-Field Multi-Channel Speech Enhancement for Video Conferencing

Wei Rao, Yihui Fu, Yanxin Hu, Xin Xu, Yvkai Jv, Jiangyu Han, Zhongjie Jiang, Lei Xie, Yannan Wang, Shinji Watanabe, Zheng Hua Tan, Hui Bu, Tao Yu, Shidong Shang

研究成果: Conference contribution

8 被引用数 (Scopus)

抄録

The ConferencingSpeech 2021 challenge is proposed to stimulate research on far-field multi-channel speech enhancement for video conferencing. The challenge consists of two separate tasks: 1) Task 1 is multi-channel speech enhancement with single microphone array and focusing on practical application with real-time requirement and 2) Task 2 is multi-channel speech enhancement with multiple distributed micro-phone arrays, which is a non-real-time track and does not have any constraints so that participants could explore any algorithms to obtain high speech quality. Targeting the real video conferencing room application, the challenge database was recorded from real speakers and all recording facilities were located by following the real setup of conferencing room. In this challenge, we open-sourced the list of open source clean speech and noise datasets, simulation scripts, and a baseline system for participants to develop their own system. The final ranking of the challenge will be decided by the subjective evaluation which is performed using Absolute Category Ratings (ACR) to estimate Mean Opinion Score (MOS), speech MOS (S-MOS), and noise MOS (N-MOS). This paper describes the challenge, tasks, datasets, subjective evaluation, and challenge results. The baseline system which is a complex ratio mask based neural network and its experimental results are also presented.

本文言語English
ホスト出版物のタイトル2021 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2021 - Proceedings
出版社Institute of Electrical and Electronics Engineers Inc.
ページ679-686
ページ数8
ISBN(電子版)9781665437394
DOI
出版ステータスPublished - 2021
外部発表はい
イベント2021 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2021 - Cartagena, Colombia
継続期間: 2021 12月 132021 12月 17

出版物シリーズ

名前2021 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2021 - Proceedings

Conference

Conference2021 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2021
国/地域Colombia
CityCartagena
Period21/12/1321/12/17

ASJC Scopus subject areas

  • コンピュータ ビジョンおよびパターン認識
  • 信号処理
  • 言語学および言語

フィンガープリント

「Conferencingspeech Challenge: Towards Far-Field Multi-Channel Speech Enhancement for Video Conferencing」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル