Abstract
Because the minority languages in China have their special characteristics, it is not suitable to directly adopt the traditional automatic speech recognition (ASR) methods which are used for some major languages, such as Chinese, English, Japanese, etc. In this paper, we take Mongolian (a resource-deficient language) as an example and build the acoustic and language models for applying the ATRASR system. In this paper, we specially focus on the language modeling aspect by considering the special characteristics of the Mongolian. We trained a multi-class N-gram language model based on similar word clustering. By applying the proposed language model, the system could improve the performance by 5.5% compared with the conventional word N-gram.
Original language | English |
---|---|
Pages (from-to) | 550-557 |
Number of pages | 8 |
Journal | Zidonghua Xuebao/ Acta Automatica Sinica |
Volume | 36 |
Issue number | 4 |
DOIs | |
Publication status | Published - 2010 Apr |
Externally published | Yes |
Keywords
- Agglutinative language
- Continuous speech recognition
- Mongolian language
- Multi-class N-gram model
- Similar word clustering
ASJC Scopus subject areas
- Software
- Control and Systems Engineering
- Information Systems
- Computer Graphics and Computer-Aided Design