TY - JOUR
T1 - Time-Frequency-Bin-Wise Linear Combination of Beamformers for Distortionless Signal Enhancement
AU - Yamaoka, Kouei
AU - Ono, Nobutaka
AU - Makino, Shoji
N1 - Funding Information:
This work was supported in part by JSPS KAKENHI under Grants JP20H00613 and JP19J20420, and in part by JST CREST under Grant JPMJCR19A3, Japan.
Publisher Copyright:
© 2014 IEEE.
PY - 2021
Y1 - 2021
N2 - In this paper, we address signal enhancement in underdetermined situations and propose new beamforming algorithms. Beamforming in (over) determined situations can successfully reduce noise signals without distortion of a desired signal, which is known to be a desirable property, especially for automatic speech recognition systems. Even in underdetermined situations, time-frequency (TF) masking attains outstanding performance in noise reduction, although it tends to generate artifacts. Integrating these two approaches to benefit from both their advantages, we here propose time-frequency-bin-wise switching (TFS) and time-frequency-bin-wise linear combination (TFLC) beamforming. In the proposed methods, we utilize the best combination of beamformers among multiple beamformers at each TF bin, each of which suppresses a particular combination of interferers. First, we propose a general formulation of signal enhancement employing multiple spatial filters. Then a joint optimization problem of designing the spatial filters and estimating the suitable weights to combine them is considered under a unified minimum variance criterion. Finally, we present efficient algorithms to solve the problem. In experiments, we used an objective criterion that quantifies the amount of signal distortion caused by the enhancement function and confirmed that the proposed methods effectively suppress interferers without distortion of the target signal.
AB - In this paper, we address signal enhancement in underdetermined situations and propose new beamforming algorithms. Beamforming in (over) determined situations can successfully reduce noise signals without distortion of a desired signal, which is known to be a desirable property, especially for automatic speech recognition systems. Even in underdetermined situations, time-frequency (TF) masking attains outstanding performance in noise reduction, although it tends to generate artifacts. Integrating these two approaches to benefit from both their advantages, we here propose time-frequency-bin-wise switching (TFS) and time-frequency-bin-wise linear combination (TFLC) beamforming. In the proposed methods, we utilize the best combination of beamformers among multiple beamformers at each TF bin, each of which suppresses a particular combination of interferers. First, we propose a general formulation of signal enhancement employing multiple spatial filters. Then a joint optimization problem of designing the spatial filters and estimating the suitable weights to combine them is considered under a unified minimum variance criterion. Finally, we present efficient algorithms to solve the problem. In experiments, we used an objective criterion that quantifies the amount of signal distortion caused by the enhancement function and confirmed that the proposed methods effectively suppress interferers without distortion of the target signal.
KW - Beamforming
KW - linear combination
KW - nonlinear signal processing
KW - time-frequency masking
KW - underdetermined
UR - http://www.scopus.com/inward/record.url?scp=85120528886&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85120528886&partnerID=8YFLogxK
U2 - 10.1109/TASLP.2021.3126950
DO - 10.1109/TASLP.2021.3126950
M3 - Article
AN - SCOPUS:85120528886
SN - 2329-9290
VL - 29
SP - 3461
EP - 3475
JO - IEEE/ACM Transactions on Audio Speech and Language Processing
JF - IEEE/ACM Transactions on Audio Speech and Language Processing
ER -