抄録
The present paper dealt with speaker clustering for speech corrupted by noise. In general, the performance of speaker clustering significantly depends on how well the similarities between speech utterances can be measured. The recently proposed i-vector-based cosine similarity has yielded the state-of-the-art performance in speaker clustering systems. However, this similarity often fails to capture the speaker similarity under noisy conditions. Therefore, we attempted to examine the efficiency of spectral clustering on i-vector-based similarity for speech corrupted by noise because spectral clustering can yield robustness against noise by non-linear projection. Experimental comparisons demonstrated that spectral clustering yielded significant improvement from conventional methods, such as agglomerative clustering and k-means clustering, under non-stationary noise conditions.
本文言語 | English |
---|---|
ホスト出版物のタイトル | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings |
出版社 | Institute of Electrical and Electronics Engineers Inc. |
ページ | 2041-2045 |
ページ数 | 5 |
巻 | 2015-August |
ISBN(印刷版) | 9781467369978 |
DOI | |
出版ステータス | Published - 2015 8月 4 |
イベント | 40th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015 - Brisbane, Australia 継続期間: 2014 4月 19 → 2014 4月 24 |
Other
Other | 40th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015 |
---|---|
国/地域 | Australia |
City | Brisbane |
Period | 14/4/19 → 14/4/24 |
ASJC Scopus subject areas
- 信号処理
- ソフトウェア
- 電子工学および電気工学