TY - GEN
T1 - Modeling characteristics of agglutinative languages with multi-class language model for ASR system
AU - Dawa, I.
AU - Sagisaka, Y.
AU - Nakamura, S.
PY - 2009/12/10
Y1 - 2009/12/10
N2 - In this paper, we discuss a new language model that considers the characteristics of the agglutinative languages. We used Mongolian (a Cyrillic language system used in Mongolia) as an example from which to build the language model. We developed a Multi-class N-gram language model based on similar word clustering that focuses on the variable suffixes of a word in Mongolian. By applying our proposed language model, the resulting recognition system can improve performance by 6.85% compared with a conventional word N-gram when applying the ATRASR engine. We also confirmed that our new model will be convenient for rapid development of an ASR system for resource-deficient languages, especially for agglutinative languages such as Mongolian.
AB - In this paper, we discuss a new language model that considers the characteristics of the agglutinative languages. We used Mongolian (a Cyrillic language system used in Mongolia) as an example from which to build the language model. We developed a Multi-class N-gram language model based on similar word clustering that focuses on the variable suffixes of a word in Mongolian. By applying our proposed language model, the resulting recognition system can improve performance by 6.85% compared with a conventional word N-gram when applying the ATRASR engine. We also confirmed that our new model will be convenient for rapid development of an ASR system for resource-deficient languages, especially for agglutinative languages such as Mongolian.
UR - http://www.scopus.com/inward/record.url?scp=71249128067&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=71249128067&partnerID=8YFLogxK
U2 - 10.1109/ICSDA.2009.5278368
DO - 10.1109/ICSDA.2009.5278368
M3 - Conference contribution
AN - SCOPUS:71249128067
SN - 9781424444007
T3 - 2009 Oriental COCOSDA International Conference on Speech Database and Assessments, ICSDA 2009
SP - 104
EP - 109
BT - 2009 Oriental COCOSDA International Conference on Speech Database and Assessments, ICSDA 2009
T2 - 2009 Oriental COCOSDA International Conference on Speech Database and Assessments, ICSDA 2009
Y2 - 10 August 2009 through 12 August 2009
ER -