SHARABLE SOFTWARE REPOSITORY FOR JAPANESE LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION

Tatsuya Kawahara*, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Katsunobu Itou, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano

*Corresponding author for this work

Research output: Contribution to conferencePaperpeer-review

29 Citations (Scopus)

Abstract

The project of Japanese LVCSR (Large Vocabulary Continuous Speech Recognition) platform is introduced. 1 It is a collaboration of researchers of different academic institutes and intended to develop a sharable software repository of not only databases but also models and programs. The platform consists of a standard recognition engine, Japanese phone models and Japanese statistical language models. A set of Japanese phone HMMs are trained with ASJ (Acoustic Society of Japan) databases of 20K sentence utterances per each gender. Japanese word N-gram (2-gram and 3-gram) models are constructed with a corpus of Mainichi newspaper of four years. The recognition engine JULIUS is developed for assessment of both acoustic and language models. The modules are integrated as a Japanese LVCSR system and evaluated on 5000-word dictation task. The software repository is available to the public.

Original languageEnglish
Publication statusPublished - 1998
Event5th International Conference on Spoken Language Processing, ICSLP 1998 - Sydney, Australia
Duration: 1998 Nov 301998 Dec 4

Conference

Conference5th International Conference on Spoken Language Processing, ICSLP 1998
Country/TerritoryAustralia
CitySydney
Period98/11/3098/12/4

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'SHARABLE SOFTWARE REPOSITORY FOR JAPANESE LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION'. Together they form a unique fingerprint.

Cite this