抄録
This paper proposes methods for determining an appropriate mixing ratio of dialects in automatic speech recognition (ASR) for dialects. To handle ASR for various dialects, it has been re- ported to be effective to train a language model using a dialect- mixed corpus. One reason behind this is geographical continu- ity of spoken dialect; we regard spoken dialect as a mixture of various dialects. This mixing ratio changes at every moment as well as depends on a speaker. We can improve recognition accu- racy by giving an appropriate dialect mixing ratio for a speaker's dialect. The mixing ratio is generally unknown and requires to be estimated and updated referring to input utterances. We han- dle two methods for updating it based on recognition results; one is to compute contribution of dialects for each recognized word, and the other is to predict mixture information referring to a whole recognized sentence based on topic modeling. The experimental result shows that the mixing ratio estimated by these methods realized higher recognition accuracy than a fixed mixing ratio.
本文言語 | English |
---|---|
ホスト出版物のタイトル | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH |
出版社 | International Speech and Communication Association |
ページ | 1492-1496 |
ページ数 | 5 |
出版ステータス | Published - 2013 |
外部発表 | はい |
イベント | 14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013 - Lyon, France 継続期間: 2013 8月 25 → 2013 8月 29 |
Other
Other | 14th Annual Conference of the International Speech Communication Association, INTERSPEECH 2013 |
---|---|
国/地域 | France |
City | Lyon |
Period | 13/8/25 → 13/8/29 |
ASJC Scopus subject areas
- 言語および言語学
- 人間とコンピュータの相互作用
- 信号処理
- ソフトウェア
- モデリングとシミュレーション