Online meeting recognizer with multichannel speaker diarization

Shoko Araki*, Takaaki Hori, Masakiyo Fujimoto, Shinji Watanabe, Takuya Yoshioka, Tomohiro Nakatani, Atsushi Nakamura

*この研究の対応する著者

研究成果: Conference contribution

10 被引用数 (Scopus)

抄録

We present our newly developed real-time conversation analyzer for group meetings. The goal of the system is to estimate automatically "who speaks when and what" in an online manner. In our system, "who speaks when" information is first obtained by estimating the directions of arrival (DOAs) of signals. Then, "who speaks what" is estimated with our automatic speech recognition (ASR) system, after suppressing reverberation, background noise, and interference speakers' voices. In this paper, we focus particularly on the speaker diarization ("who speaks when" estimation) method, and we show that the speaker diarization information helps the ASR to reduce insertion errors.

本文言語English
ホスト出版物のタイトルConference Record of the 44th Asilomar Conference on Signals, Systems and Computers, Asilomar 2010
ページ1697-1701
ページ数5
DOI
出版ステータスPublished - 2010 12月 1
外部発表はい
イベント44th Asilomar Conference on Signals, Systems and Computers, Asilomar 2010 - Pacific Grove, CA, United States
継続期間: 2010 11月 72010 11月 10

出版物シリーズ

名前Conference Record - Asilomar Conference on Signals, Systems and Computers
ISSN(印刷版)1058-6393

Other

Other44th Asilomar Conference on Signals, Systems and Computers, Asilomar 2010
国/地域United States
CityPacific Grove, CA
Period10/11/710/11/10

ASJC Scopus subject areas

  • 信号処理
  • コンピュータ ネットワークおよび通信

フィンガープリント

「Online meeting recognizer with multichannel speaker diarization」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル