Rectified linear unit can assist griffin-lim phase recovery

Kohei Yatabe, Yoshiki Masuyama, Yasuhiro Oikawa

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Citations (Scopus)

Abstract

Phase recovery is an essential process for reconstructing a time-domain signal from the corresponding spectrogram when its phase is contaminated or unavailable. Recently, a phase recovery method using deep neural network (DNN) was proposed, which interested us because the inverse short-time Fourier transform (inverse STFT) was utilized within the network. This inverse STFT converts a spectrogram into its time-domain counterpart, and then the activation function, leaky rectified linear unit (ReLU), is applied. Such nonlinear operation in time domain resembles the speech enhancement method called the harmonic regeneration noise reduction (HRNR). In HRNR, a time-domain nonlinearity, typically ReLU, is applied for assistance in enhancing the higher-order harmonics. From this point of view, one question arose in our mind: Can time-domain ReLU solely assist phase recovery? Inspired by this curious connection between the recent DNN-based phase recovery method and HRNR in speech enhancement, the ReLU assisted Griffin-Lim algorithm is proposed in this paper to investigate the above question. Through an experiment of speech denoising with the oracle Wiener filter, some positive effect of the time-domain nonlinearity is confirmed in terms of the scores of the short-time objective intelligibility (STOI).

Original languageEnglish
Title of host publication16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages555-559
Number of pages5
ISBN (Electronic)9781538681510
DOIs
Publication statusPublished - 2018 Nov 2
Event16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018 - Tokyo, Japan
Duration: 2018 Sept 172018 Sept 20

Publication series

Name16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018 - Proceedings

Other

Other16th International Workshop on Acoustic Signal Enhancement, IWAENC 2018
Country/TerritoryJapan
CityTokyo
Period18/9/1718/9/20

Keywords

  • Consistency
  • Harmonic regeneration
  • Redundancy
  • Spectrogram
  • Time-domain nonlinearity

ASJC Scopus subject areas

  • Signal Processing
  • Acoustics and Ultrasonics

Fingerprint

Dive into the research topics of 'Rectified linear unit can assist griffin-lim phase recovery'. Together they form a unique fingerprint.

Cite this