Word Spotting in Conversational Speech Based on Phonemic Unit Likelihood by Mutual Information Criterion

Shigeki Okawa, Tetsunori Kobayashi, Katsuhiko Shirai

研究成果: Paper査読

1 被引用数 (Scopus)

抄録

This paper proposes a novel scheme for keyword-spotting in conversational speech using frame-level likelihood of phonemes and statistics of their duration. Since spontaneous utterances include many ill-formed sentences, it is most difficult to realize a highly advanced continuous speech recognition system based on a top-down syntax driven process. We, therefore, propose a bottom-up method to detect keywords in continuous speech based on a dynamical programming technique using both phonemic and durational likelihood. Our algorithm basically depends on island-driven both-side-free DP method. In the performance test of the speaker-dependent keyword spotting, it was found that, compared to the conventional continuous DP method, the erroneous candidates and the processing time decreases to 1/6 in new method. This result shows the feasibility of our method for continuous speech recognition, especially for conversational style utterances.

本文言語English
ページ1281-1284
ページ数4
出版ステータスPublished - 1993
イベント3rd European Conference on Speech Communication and Technology, EUROSPEECH 1993 - Berlin, Germany
継続期間: 1993 9月 221993 9月 25

Conference

Conference3rd European Conference on Speech Communication and Technology, EUROSPEECH 1993
国/地域Germany
CityBerlin
Period93/9/2293/9/25

ASJC Scopus subject areas

  • ソフトウェア
  • 言語学および言語
  • コンピュータ サイエンスの応用
  • 通信

フィンガープリント

「Word Spotting in Conversational Speech Based on Phonemic Unit Likelihood by Mutual Information Criterion」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル