Spectral Mapping onto Probabilistic Domain Using Neural Networks and Its Application to Speaker Adaptive Phoneme Recognition

T. Kobayashi, K. Shirai

研究成果: Paper査読

抄録

A feature parameter space called PRPG (Probability Ratios between Phoneme Group pairs) is utilized for speaker adaptive phoneme recognition. The coordinate conversion is performed by neural networks. Each outputnode of the network represents a posteriori probability of phoneme group. Therefore, distance in the PRPG coordinate system corresponds directly to the difference of likelihood. The area with the same information for speech recognition is compressed into one point. Moreover, by the definition of the coordinate system, the meaning of axes are equivalent among different speakers, so the speaker adaptation can be easily performed without trajectory mapping. The experimental results show that the scores of the speaker-adaptive recognition in the PRPG domain are always superior to those of the speaker-dependent recognition in the spectral domain.

本文言語English
ページ385-388
ページ数4
出版ステータスPublished - 1992
イベント2nd International Conference on Spoken Language Processing, ICSLP 1992 - Banff, Canada
継続期間: 1992 10月 131992 10月 16

Conference

Conference2nd International Conference on Spoken Language Processing, ICSLP 1992
国/地域Canada
CityBanff
Period92/10/1392/10/16

ASJC Scopus subject areas

  • 言語および言語学
  • 言語学および言語

フィンガープリント

「Spectral Mapping onto Probabilistic Domain Using Neural Networks and Its Application to Speaker Adaptive Phoneme Recognition」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル