AUTOMATIC LABELING OF KNOWN SPEECH SAMPLES USING A RULE-BASED NETWORK REPRESENTATION AND SEGMENTATION TECHNIQUE.

Kazuyo Tanaka*, Satoru Hayamizu, Kozo Ohta

*この研究の対応する著者

研究成果: Article査読

抄録

An automatic labeling technique for known speech samples is proposed to construct a fine speech data base. A word (or sentence) is represented by a phonetic network which covers the acoustic variation contained in the utterances of the word (or sentence). An input speech sample is segmented using its parameter pattern dynamics and labeled to the optimal phonetic label (called APSEG) sequence by matching th segment sequence to the generated phonetic network using constrained dynamic programming. The feasibility of the method is confirmed when it is applied ot a word set containing 53 city names.

本文言語English
ページ(範囲)30-37
ページ数8
ジャーナルDenshi Gijutsu Sogo Kenkyusho Iho/Bulletin of the Electrotechnical Laboratory
52
3
出版ステータスPublished - 1988
外部発表はい

ASJC Scopus subject areas

  • 凝縮系物理学
  • 電子工学および電気工学

フィンガープリント

「AUTOMATIC LABELING OF KNOWN SPEECH SAMPLES USING A RULE-BASED NETWORK REPRESENTATION AND SEGMENTATION TECHNIQUE.」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル