Japanese Dictation Toolkit -1997 version

Tatsuya Kawahara*, Akinobu Lee, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Katsunobu Itou, Akinori Ito, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano

*この研究の対応する著者

研究成果: Article査読

28 被引用数 (Scopus)

抄録

The Japanese Dictation Toolkit has been designed and developed as a baseline platform for Japanese LVCSR (Large Vocabulary Continuous Speech Recognition). The platform consists of a standard recognition engine, Japanese phone models and Japanese statistical language models. We set up a variety of Japanese phone HMMs from a context-independent monophone to a triphone model of thousands of states. They are trained with ASJ (The Acoustical Society of Japan) databases. A lexicon and word N-gram (2-gram and 3-gram) models are constructed with a corpus of Mainichi newspaper. The recognition engine JULIUS is developed for evaluation of both acoustic and language models. As an integrated system of these modules, we have implemented a baseline 5,000-word dictation system and evaluated various components. The software repository is available to the public.

本文言語English
ページ(範囲)233-239
ページ数7
ジャーナルJournal of the Acoustical Society of Japan (E) (English translation of Nippon Onkyo Gakkaishi)
20
3
DOI
出版ステータスPublished - 1999

ASJC Scopus subject areas

  • 音響学および超音波学

フィンガープリント

「Japanese Dictation Toolkit -1997 version」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル