TY - JOUR
T1 - Online directional speech enhancement using geometrically constrained independent vector analysis
AU - Li, Li
AU - Koishida, Kazuhito
AU - Makino, Shoji
N1 - Funding Information:
This work was partially supported by JSPS KAKENHI Grant Number 18J20059. A part of this work was performed while Li Li was an intern at Microsoft corporation.
Publisher Copyright:
Copyright © 2020 ISCA
PY - 2020
Y1 - 2020
N2 - This paper proposes an online dual-microphone system for directional speech enhancement, which employs geometrically constrained independent vector analysis (IVA) based on the auxiliary function approach and vectorwise coordinate descent. Its offline version has recently been proposed and shown to outperform the conventional auxiliary function approach-based IVA (AuxIVA) thanks to the properly designed spatial constraints. We extend the offline algorithm to online by incorporating the autoregressive approximation of an auxiliary variable. Experimental evaluations revealed that the proposed online algorithm could work in real-time and achieved superior speech enhancement performance to online AuxIVA in both situations where a fixed target was interfered by a spatially stationary or dynamic interference.
AB - This paper proposes an online dual-microphone system for directional speech enhancement, which employs geometrically constrained independent vector analysis (IVA) based on the auxiliary function approach and vectorwise coordinate descent. Its offline version has recently been proposed and shown to outperform the conventional auxiliary function approach-based IVA (AuxIVA) thanks to the properly designed spatial constraints. We extend the offline algorithm to online by incorporating the autoregressive approximation of an auxiliary variable. Experimental evaluations revealed that the proposed online algorithm could work in real-time and achieved superior speech enhancement performance to online AuxIVA in both situations where a fixed target was interfered by a spatially stationary or dynamic interference.
KW - Geometric constraint
KW - Independent vector analysis (IVA)
KW - Multichannel speech enhancement
KW - Online
KW - Real-time
UR - http://www.scopus.com/inward/record.url?scp=85098111070&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85098111070&partnerID=8YFLogxK
U2 - 10.21437/Interspeech.2020-1484
DO - 10.21437/Interspeech.2020-1484
M3 - Conference article
AN - SCOPUS:85098111070
SN - 2308-457X
VL - 2020-October
SP - 61
EP - 65
JO - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
JF - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
T2 - 21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020
Y2 - 25 October 2020 through 29 October 2020
ER -