TY - JOUR
T1 - Supervised determined source separation with multichannel variational autoencoder
AU - Kameoka, Hirokazu
AU - Li, Li
AU - Inoue, Shota
AU - Makino, Shoji
N1 - Funding Information:
This work was supported by JSPS KAKENHI 17H01763.
Publisher Copyright:
© 2019 Massachusetts Institute of Technology.
PY - 2019/9/1
Y1 - 2019/9/1
N2 - This letter proposes a multichannel source separation technique, the multichannel variational autoencoder (MVAE) method, which uses a conditional VAE (CVAE) to model and estimate the power spectrograms of the sources in a mixture. By training the CVAE using the spectrograms of training examples with source-class labels, we can use the trained decoder distribution as a universal generative model capable of generating spectrograms conditioned on a specified class index. By treating the latent space variables and the class index as the unknown parameters of this generative model, we can develop a convergence-guaranteed algorithm for supervised determined source separation that consists of iter-atively estimating the power spectrograms of the underlying sources, as well as the separation matrices. In experimental evaluations, our MVAE produced better separation performance than a baseline method.
AB - This letter proposes a multichannel source separation technique, the multichannel variational autoencoder (MVAE) method, which uses a conditional VAE (CVAE) to model and estimate the power spectrograms of the sources in a mixture. By training the CVAE using the spectrograms of training examples with source-class labels, we can use the trained decoder distribution as a universal generative model capable of generating spectrograms conditioned on a specified class index. By treating the latent space variables and the class index as the unknown parameters of this generative model, we can develop a convergence-guaranteed algorithm for supervised determined source separation that consists of iter-atively estimating the power spectrograms of the underlying sources, as well as the separation matrices. In experimental evaluations, our MVAE produced better separation performance than a baseline method.
UR - http://www.scopus.com/inward/record.url?scp=85071355196&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85071355196&partnerID=8YFLogxK
U2 - 10.1162/neco_a_01217
DO - 10.1162/neco_a_01217
M3 - Letter
C2 - 31335290
AN - SCOPUS:85071355196
SN - 0899-7667
VL - 31
SP - 1891
EP - 1914
JO - Neural Computation
JF - Neural Computation
IS - 9
ER -