Multi-pass ASR using vocabulary expansion

Katsutoshi Ohtsuki, Nobuaki Hiroshima, Shoichi Matsunaga, Yoshihiko Hayashi

研究成果: Paper査読

1 被引用数 (Scopus)


Current automatic speech recognition (ASR) systems have to limit their vocabulary size depending on available memory size, expected processing time, and available text data for building a vocabulary and a language model. Although the vocabularies of ASR systems are designed to achieve high coverage for the expected input data, it cannot be avoided that input data includes out-of-vocabulary (OOV) words. This is called the OOV problem. We propose dynamic vocabulary expansion using a conceptual base and multi-pass speech recognition using an expanded vocabulary. Relevant words to content of input speech are extracted based on a speech recognition result obtained using a reference vocabulary. An expanded vocabulary that includes fewer OOV words is built by adding the extracted words to the reference vocabulary. The second recognition process is performed using the new vocabulary. The experimental results for broadcast news speech show our method achieves a 30% reduction in OOV rate and improves speech recognition accuracy.

出版ステータスPublished - 2004
イベント8th International Conference on Spoken Language Processing, ICSLP 2004 - Jeju, Jeju Island, Korea, Republic of
継続期間: 2004 10月 42004 10月 8


Other8th International Conference on Spoken Language Processing, ICSLP 2004
国/地域Korea, Republic of
CityJeju, Jeju Island

ASJC Scopus subject areas

  • 言語および言語学
  • 言語学および言語


「Multi-pass ASR using vocabulary expansion」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。