TY - GEN
T1 - Investigation of network architecture for single-channel end-to-end denoising
AU - Hasumi, Takuya
AU - Kobayashi, Tetsunori
AU - Ogawa, Tetsuji
N1 - Publisher Copyright:
© 2021 European Signal Processing Conference, EUSIPCO. All rights reserved.
PY - 2021/1/24
Y1 - 2021/1/24
N2 - This paper examines the effectiveness of a fully convolutional time-domain audio separation network (Conv-TasNet) on single-channel denoising. Conv-TasNet, which has a structure to explicitly estimate a mask for encoded features, has shown to be effective in single-channel sound source separation in noise-free environments, but it has not been applied to denoising. Therefore, the present study investigates a method of learning Conv-TasNet for denoising and clarifies the optimal structure for single-channel end-to-end modeling. Experimental comparisons conducted using the CHiME-3 dataset demonstrate that Conv-TasNet performs well in denoising and yields improvements in single-channel end-to-end denoising over existing denoising autoencoder-based modeling.
AB - This paper examines the effectiveness of a fully convolutional time-domain audio separation network (Conv-TasNet) on single-channel denoising. Conv-TasNet, which has a structure to explicitly estimate a mask for encoded features, has shown to be effective in single-channel sound source separation in noise-free environments, but it has not been applied to denoising. Therefore, the present study investigates a method of learning Conv-TasNet for denoising and clarifies the optimal structure for single-channel end-to-end modeling. Experimental comparisons conducted using the CHiME-3 dataset demonstrate that Conv-TasNet performs well in denoising and yields improvements in single-channel end-to-end denoising over existing denoising autoencoder-based modeling.
KW - End-to-end modeling
KW - Fully convolutional time-domain audio separation network
KW - Single-channel denoising
KW - Speech recognition
KW - Time-domain convolutional denoising autoencoders
UR - http://www.scopus.com/inward/record.url?scp=85099301758&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85099301758&partnerID=8YFLogxK
U2 - 10.23919/Eusipco47968.2020.9287753
DO - 10.23919/Eusipco47968.2020.9287753
M3 - Conference contribution
AN - SCOPUS:85099301758
T3 - European Signal Processing Conference
SP - 441
EP - 445
BT - 28th European Signal Processing Conference, EUSIPCO 2020 - Proceedings
PB - European Signal Processing Conference, EUSIPCO
T2 - 28th European Signal Processing Conference, EUSIPCO 2020
Y2 - 24 August 2020 through 28 August 2020
ER -