TY - GEN
T1 - Speaker verification-based evaluation of single-channel speech separation
AU - Maciejewski, Matthew
AU - Watanabe, Shinji
AU - Khudanpur, Sanjeev
N1 - Publisher Copyright:
Copyright © 2021 ISCA.
PY - 2021
Y1 - 2021
N2 - Speech enhancement techniques typically focus on intrinsic metrics of signal quality. The overwhelming majority of deep learning-based single-channel speech separation studies, for instance, have relied on a single class of metrics to evaluate the systems by. These metrics, usually variants of Signal-to-Distortion Ratio (SDR), measure fidelity to the “ground truth” waveform. This can be problematic, not only for lack of diversity in evaluation metrics, but also in cases where a perfect ground truth waveform may be unavailable. In this work, we explore the value of speaker verification as an extrinsic metric of separation quality, with additional utility as evidence of the benefits of separation as pre-processing for downstream tasks.
AB - Speech enhancement techniques typically focus on intrinsic metrics of signal quality. The overwhelming majority of deep learning-based single-channel speech separation studies, for instance, have relied on a single class of metrics to evaluate the systems by. These metrics, usually variants of Signal-to-Distortion Ratio (SDR), measure fidelity to the “ground truth” waveform. This can be problematic, not only for lack of diversity in evaluation metrics, but also in cases where a perfect ground truth waveform may be unavailable. In this work, we explore the value of speaker verification as an extrinsic metric of separation quality, with additional utility as evidence of the benefits of separation as pre-processing for downstream tasks.
KW - Speaker verification
KW - speech separation
UR - http://www.scopus.com/inward/record.url?scp=85119171773&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85119171773&partnerID=8YFLogxK
U2 - 10.21437/Interspeech.2021-1924
DO - 10.21437/Interspeech.2021-1924
M3 - Conference contribution
AN - SCOPUS:85119171773
T3 - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
SP - 2353
EP - 2357
BT - 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021
PB - International Speech Communication Association
T2 - 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021
Y2 - 30 August 2021 through 3 September 2021
ER -