An embedded knowledge integration for hybrid language modeling

Shuwu Zhang, Hirofumi Yamamoto, Yoshinori Sagisaka

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

This paper describes an embedded architecture to couple utilizable language knowledge and innovative language models, as well as modeling approaches, for intensive language modeling in speech recognition. In this embedded mechanism, three innovative language modeling approaches at different levels, ie., composite N-gram, dis tance-related unit association maximum entropy (DU-AME), and linkgram, have different functions to extend the definitions of basic language units, favorably improve the underlying model instead of conventional N-grams and provide effective combination with longer history syntactic lnk dependency knowledge, respectively. In this threelevel hybrid language modeling, each lower level modeling serves the higher level modeling(s). The results in each level are well utized or embedded in the higher level(s). These models can be trained level by level Accordingly, some prospective language constraints can finally be embedded in a wellorganized hybrid model. Experimental data based on the embedded modeling show that the hybrid model reduces WER 14.5% compared with the conventional word-based bigram model As a result, it can be expected to improve the conventional statistical language modelng.

Original languageEnglish
Title of host publication6th International Conference on Spoken Language Processing, ICSLP 2000
PublisherInternational Speech Communication Association
ISBN (Electronic)7801501144, 9787801501141
Publication statusPublished - 2000
Externally publishedYes
Event6th International Conference on Spoken Language Processing, ICSLP 2000 - Beijing, China
Duration: 2000 Oct 162000 Oct 20

Publication series

Name6th International Conference on Spoken Language Processing, ICSLP 2000

Other

Other6th International Conference on Spoken Language Processing, ICSLP 2000
Country/TerritoryChina
CityBeijing
Period00/10/1600/10/20

ASJC Scopus subject areas

  • Linguistics and Language
  • Language and Linguistics

Fingerprint

Dive into the research topics of 'An embedded knowledge integration for hybrid language modeling'. Together they form a unique fingerprint.

Cite this