TY - GEN
T1 - Comparative Study on DNN-based Minimum Variance Beamforming Robust to Small Movements of Sound Sources
AU - Saijo, Kohei
AU - Katagiri, Kazuhiro
AU - Fujieda, Masaru
AU - Kobayashi, Tetsunori
AU - Ogawa, Tetsuji
N1 - Publisher Copyright:
© 2021 APSIPA.
PY - 2021
Y1 - 2021
N2 - This paper discusses a deep neural network (DNN)-based minimum variance (MV) beamformer suitable for the case where the target sound source moves slightly in front of the microphones. In practical applications of speech enhancement, such as a guidance terminal installed in a train station, the target sound source can be assumed to be located approximately in front of the microphones, although it may move slightly. Speech enhancement techniques used under such conditions can be classified into two types: one is to enhance the sound source while adaptively estimating its location, and the other is to enhance the area in front of the microphone array. The former requires localization of the target source but has a high degree of freedom of the beamformer, which can lead to high noise suppression performance, while the latter does not require the source localization but has a low degree of freedom of the beamformer. Speech enhancement experiments conducted to compare the performance of these approaches demonstrated that the MV beamformer based on adaptive sound source localization can provide more accurate enhancement than that based on area enhancement even when the sound source is moving.
AB - This paper discusses a deep neural network (DNN)-based minimum variance (MV) beamformer suitable for the case where the target sound source moves slightly in front of the microphones. In practical applications of speech enhancement, such as a guidance terminal installed in a train station, the target sound source can be assumed to be located approximately in front of the microphones, although it may move slightly. Speech enhancement techniques used under such conditions can be classified into two types: one is to enhance the sound source while adaptively estimating its location, and the other is to enhance the area in front of the microphone array. The former requires localization of the target source but has a high degree of freedom of the beamformer, which can lead to high noise suppression performance, while the latter does not require the source localization but has a low degree of freedom of the beamformer. Speech enhancement experiments conducted to compare the performance of these approaches demonstrated that the MV beamformer based on adaptive sound source localization can provide more accurate enhancement than that based on area enhancement even when the sound source is moving.
UR - http://www.scopus.com/inward/record.url?scp=85126643202&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85126643202&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85126643202
T3 - 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings
SP - 603
EP - 607
BT - 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021
Y2 - 14 December 2021 through 17 December 2021
ER -