Phrase recognition in conversational speech using prosodic and phonemic information

Shigeki Okawa*, Takashi Endo, Tetsunori Kobayashi, Katsuhiko Shirai

*この研究の対応する著者

研究成果: Article査読

5 被引用数 (Scopus)

抄録

In this paper, a new scheme for phrase recognition in conversational speech is proposed, in which prosodic and phonemic information processing are usefully combined. This approach is employed both to produce candidates of phrase boundaries and to discriminate phonemes. The fundamental frequency patterns of continuous utterances are statistically analyzed and the likelihood of the occurrence of a phrase boundary is calculated for every frame. At the same time, the likelihood of phonemic characteristics of each frame can be obtained using a hierarchical clustering method. These two scores, along with lexical and grammatical constraints, can be effectively utilized to develop a possible word sequences or a word lattices which correspond to the continuous speech utterances. Our preliminary experiment shows the feasibility of applying prosody for continuous speech recognition especially for conversational style utterances.

本文言語English
ページ(範囲)44-50
ページ数7
ジャーナルIEICE Transactions on Information and Systems
E76-D
1
出版ステータスPublished - 1993 1月 1

ASJC Scopus subject areas

  • ソフトウェア
  • ハードウェアとアーキテクチャ
  • コンピュータ ビジョンおよびパターン認識
  • 電子工学および電気工学
  • 人工知能

フィンガープリント

「Phrase recognition in conversational speech using prosodic and phonemic information」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル