A comparative study of spectral clustering for i-vector-based speaker clustering under noisy conditions

    研究成果: Conference contribution

    4 被引用数 (Scopus)

    抄録

    The present paper dealt with speaker clustering for speech corrupted by noise. In general, the performance of speaker clustering significantly depends on how well the similarities between speech utterances can be measured. The recently proposed i-vector-based cosine similarity has yielded the state-of-the-art performance in speaker clustering systems. However, this similarity often fails to capture the speaker similarity under noisy conditions. Therefore, we attempted to examine the efficiency of spectral clustering on i-vector-based similarity for speech corrupted by noise because spectral clustering can yield robustness against noise by non-linear projection. Experimental comparisons demonstrated that spectral clustering yielded significant improvement from conventional methods, such as agglomerative clustering and k-means clustering, under non-stationary noise conditions.

    本文言語English
    ホスト出版物のタイトルICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
    出版社Institute of Electrical and Electronics Engineers Inc.
    ページ2041-2045
    ページ数5
    2015-August
    ISBN(印刷版)9781467369978
    DOI
    出版ステータスPublished - 2015 8月 4
    イベント40th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015 - Brisbane, Australia
    継続期間: 2014 4月 192014 4月 24

    Other

    Other40th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015
    国/地域Australia
    CityBrisbane
    Period14/4/1914/4/24

    ASJC Scopus subject areas

    • 信号処理
    • ソフトウェア
    • 電子工学および電気工学

    引用スタイル