Ensemble learning for speech enhancement

Jonathan Le Roux, Shinji Watanabe, John R. Hershey

研究成果: Conference contribution

18 被引用数 (Scopus)

抄録

Over the years, countless algorithms have been proposed to solve the problem of speech enhancement from a noisy mixture. Many have succeeded in improving at least parts of the signal, while often deteriorating others. Based on the assumption that different algorithms are likely to enjoy different qualities and suffer from different flaws, we investigate the possibility of combining the strengths of multiple speech enhancement algorithms, formulating the problem in an ensemble learning framework. As a first example of such a system, we consider the prediction of a time-frequency mask obtained from the clean speech, based on the outputs of various algorithms applied on the noisy mixture. We consider several approaches involving various notions of context and various machine learning algorithms for classification, in the case of binary masks, and regression, in the case of continuous masks. We show that combining several algorithms in this way can lead to an improvement in enhancement performance, while simple averaging or voting techniques fail to do so.

本文言語English
ホスト出版物のタイトル2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2013
DOI
出版ステータスPublished - 2013
外部発表はい
イベント2013 14th IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2013 - New Paltz, NY, United States
継続期間: 2013 10月 202013 10月 23

出版物シリーズ

名前IEEE Workshop on Applications of Signal Processing to Audio and Acoustics

Other

Other2013 14th IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2013
国/地域United States
CityNew Paltz, NY
Period13/10/2013/10/23

ASJC Scopus subject areas

  • 電子工学および電気工学
  • コンピュータ サイエンスの応用

フィンガープリント

「Ensemble learning for speech enhancement」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル