Reducing algorithmic delay using low-overlap window for online Wave-U-Net

Sotaro Nakaoka, Li Li, Shoji Makino, Takeshi Yamada

研究成果: Conference contribution

1 被引用数 (Scopus)

抄録

Wave-U-Net is an end-to-end single-channel source separation method that works in the time domain and thus can take the phase information into account during separation. It has shown high performance in tasks such as singing voice separation and speech enhancement. We previously proposed an extension of Wave-U-Net to online processing with a short input using teacher-student learning. Since online Wave-U-Net processes input signals frame-by-frame, where the frames are segmented by applying a window function, the window length is generally the lower bound of the algorithmic delay. In this paper, based on the fact that the separation performance of online Wave-U-Net is concentrated at the center of the segment, we propose to reduce the algorithmic delay by applying windows with a zero region near the edges into the online Wave-U-Net. Experimental results showed that the proposed method reduced the algorithmic delay by 40% of that of the conventional method while keeping the high speech enhancement performance with source-to-distortion ratio improvement of about 15 dB, thus enabling low-delay and high-performance speech enhancement.

本文言語English
ホスト出版物のタイトル2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings
出版社Institute of Electrical and Electronics Engineers Inc.
ページ1210-1214
ページ数5
ISBN(電子版)9789881476890
出版ステータスPublished - 2021
イベント2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Tokyo, Japan
継続期間: 2021 12月 142021 12月 17

出版物シリーズ

名前2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings

Conference

Conference2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021
国/地域Japan
CityTokyo
Period21/12/1421/12/17

ASJC Scopus subject areas

  • 人工知能
  • コンピュータ ビジョンおよびパターン認識
  • 信号処理
  • 器械工学

フィンガープリント

「Reducing algorithmic delay using low-overlap window for online Wave-U-Net」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル