Investigation of ASR systems for resource-deficient languages

I. Dawa*, Yoshinori Sagisaka, Satoshi Nakamura

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

Because the minority languages in China have their special characteristics, it is not suitable to directly adopt the traditional automatic speech recognition (ASR) methods which are used for some major languages, such as Chinese, English, Japanese, etc. In this paper, we take Mongolian (a resource-deficient language) as an example and build the acoustic and language models for applying the ATRASR system. In this paper, we specially focus on the language modeling aspect by considering the special characteristics of the Mongolian. We trained a multi-class N-gram language model based on similar word clustering. By applying the proposed language model, the system could improve the performance by 5.5% compared with the conventional word N-gram.

Original languageEnglish
Pages (from-to)550-557
Number of pages8
JournalZidonghua Xuebao/ Acta Automatica Sinica
Volume36
Issue number4
DOIs
Publication statusPublished - 2010 Apr
Externally publishedYes

Keywords

  • Agglutinative language
  • Continuous speech recognition
  • Mongolian language
  • Multi-class N-gram model
  • Similar word clustering

ASJC Scopus subject areas

  • Software
  • Control and Systems Engineering
  • Information Systems
  • Computer Graphics and Computer-Aided Design

Fingerprint

Dive into the research topics of 'Investigation of ASR systems for resource-deficient languages'. Together they form a unique fingerprint.

Cite this