TY - JOUR
T1 - Joint Amplitude and Phase Refinement for Monaural Source Separation
AU - Masuyama, Yoshiki
AU - Yatabe, Kohei
AU - Nagatomo, Kento
AU - Oikawa, Yasuhiro
N1 - Publisher Copyright:
© 1994-2012 IEEE.
PY - 2020
Y1 - 2020
N2 - Monaural source separation is often conducted by manipulating the amplitude spectrogram of a mixture (e.g., via time-frequency masking and spectral subtraction). The obtained amplitudes are converted back to the time domain by using the phase of the mixture or by applying phase reconstruction. Although phase reconstruction performs well for the true amplitudes, its performance is degraded when the amplitudes contain error. To deal with this problem, we propose an optimization-based method to refine both amplitudes and phases based on the given amplitudes. It aims to find time-domain signals whose amplitude spectrograms are close to the given ones in terms of the generalized alpha-beta divergences. To solve the optimization problem, the alternating direction method of multipliers (ADMM) is utilized. We confirmed the effectiveness of the proposed method through speech-nonspeech separation in various conditions.
AB - Monaural source separation is often conducted by manipulating the amplitude spectrogram of a mixture (e.g., via time-frequency masking and spectral subtraction). The obtained amplitudes are converted back to the time domain by using the phase of the mixture or by applying phase reconstruction. Although phase reconstruction performs well for the true amplitudes, its performance is degraded when the amplitudes contain error. To deal with this problem, we propose an optimization-based method to refine both amplitudes and phases based on the given amplitudes. It aims to find time-domain signals whose amplitude spectrograms are close to the given ones in terms of the generalized alpha-beta divergences. To solve the optimization problem, the alternating direction method of multipliers (ADMM) is utilized. We confirmed the effectiveness of the proposed method through speech-nonspeech separation in various conditions.
KW - Phase reconstruction
KW - alpha-beta divergences
KW - mixture consistency
KW - spectrogram consistency
UR - http://www.scopus.com/inward/record.url?scp=85096357001&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85096357001&partnerID=8YFLogxK
U2 - 10.1109/LSP.2020.3031464
DO - 10.1109/LSP.2020.3031464
M3 - Article
AN - SCOPUS:85096357001
SN - 1070-9908
VL - 27
SP - 1939
EP - 1943
JO - IEEE Signal Processing Letters
JF - IEEE Signal Processing Letters
M1 - 9226071
ER -