Gamma Boltzmann Machine for Simultaneously Modeling Linear- And Log-amplitude Spectra

Toru Nakashika, Kohei Yatabe

研究成果: Conference contribution

1 被引用数 (Scopus)

抄録

In audio applications, one of the most important representations of audio signals is the amplitude spectrogram. It is utilized in many machine-learning-based information processing methods including the ones using the restricted Boltzmann machines (RBM). However, the ordinary Gaussian-Bernoulli RBM (the most popular RBM among its variations) cannot directly handle amplitude spectra because the Gaussian distribution is a symmetric model allowing negative values which never appear in the amplitude. In this paper, after proposing a general gamma Boltzmann machine, we propose a practical model called the gamma-Bernoulli RBM that simultaneously handles both linearand log-amplitude spectrograms. Its conditional distribution of the observable data is given by the gamma distribution, and thus the proposed RBM can naturally handle the data represented by positive numbers as the amplitude spectra. It can also treat amplitude in the logarithmic scale which is important for audio signals from the perceptual point of view. The advantage of the proposed model compared to the ordinary Gaussian-Bernoulli RBM was confirmed by PESQ and MSE in the experiment of representing the amplitude spectrograms of speech signals.

本文言語English
ホスト出版物のタイトル2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2020 - Proceedings
出版社Institute of Electrical and Electronics Engineers Inc.
ページ471-476
ページ数6
ISBN(電子版)9789881476883
出版ステータスPublished - 2020 12月 7
イベント2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2020 - Virtual, Auckland, New Zealand
継続期間: 2020 12月 72020 12月 10

出版物シリーズ

名前2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2020 - Proceedings

Conference

Conference2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2020
国/地域New Zealand
CityVirtual, Auckland
Period20/12/720/12/10

ASJC Scopus subject areas

  • 人工知能
  • コンピュータ ネットワークおよび通信
  • コンピュータ ビジョンおよびパターン認識
  • ハードウェアとアーキテクチャ
  • 信号処理
  • 決定科学(その他)
  • 器械工学

フィンガープリント

「Gamma Boltzmann Machine for Simultaneously Modeling Linear- And Log-amplitude Spectra」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル