TY - GEN
T1 - A language model adaptation using multiple varied corpora
AU - Yamamoto, H.
AU - Sagisaka, Y.
PY - 2001
Y1 - 2001
N2 - A new language model adaptation scheme is proposed to cope with multiple varied speech recognition tasks. Both topic difference and sentence style difference resulting from the speaker's role are reflected in the proposed language model adaptation. An adaptation is carried out using two different language corpora where only the topic or speaker's style is matched. New word clustering techniques are introduced to extract the topic or style dependency separately. Word neighboring characteristics in the two adaptation source data are regarded as different features in this clustering. All words are classified into commonly used word classes and topic or style dependent classes. Furthermore, target topic and sentence style dependent words and their neighboring characteristics are emphasized according to their frequency in the adaptation target data. In the evaluation experiment, the proposed method shows a 13% lower perplexity and a 9% lower word error rate in continuous speech recognition compared with the conventional adaptation method.
AB - A new language model adaptation scheme is proposed to cope with multiple varied speech recognition tasks. Both topic difference and sentence style difference resulting from the speaker's role are reflected in the proposed language model adaptation. An adaptation is carried out using two different language corpora where only the topic or speaker's style is matched. New word clustering techniques are introduced to extract the topic or style dependency separately. Word neighboring characteristics in the two adaptation source data are regarded as different features in this clustering. All words are classified into commonly used word classes and topic or style dependent classes. Furthermore, target topic and sentence style dependent words and their neighboring characteristics are emphasized according to their frequency in the adaptation target data. In the evaluation experiment, the proposed method shows a 13% lower perplexity and a 9% lower word error rate in continuous speech recognition compared with the conventional adaptation method.
UR - http://www.scopus.com/inward/record.url?scp=84962802488&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84962802488&partnerID=8YFLogxK
U2 - 10.1109/ASRU.2001.1034666
DO - 10.1109/ASRU.2001.1034666
M3 - Conference contribution
AN - SCOPUS:84962802488
T3 - 2001 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2001 - Conference Proceedings
SP - 389
EP - 392
BT - 2001 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2001 - Conference Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2001
Y2 - 9 December 2001 through 13 December 2001
ER -