抄録
A new feature parameter space for speech recognition called PRPG (Probability Ratios between Phoneme Group pairs) has been proposed and speaker adaptive phoneme recognition has been performed. In the coordinate system proposed here, the area with the same information for speech recognition is compressed into one point. The mapping function from spectral coordinate system to proposed one is realized using a neural network. The code-vectors designed on this coordinate system are assured to be informationtheoretically more efficient than that of spectral coordinate system. Moreover, by the definition of the coordinate system, the meaning of axes are equivalent among different speakers, so the speaker adaptation can be easily performed without trajectory mapping. The experimental results show that the 40% of errors are reduced by the coordinate conversion in the speaker-dependent tasks. The scores of the speakeradaptive tasks in the proposed feature domain are always superior to those of the speaker-dependent tasks in the spectral domain.
本文言語 | English |
---|---|
ホスト出版物のタイトル | ICASSP 1992 - 1992 International Conference on Acoustics, Speech, and Signal Processing |
出版社 | Institute of Electrical and Electronics Engineers Inc. |
ページ | 457-460 |
ページ数 | 4 |
巻 | 1 |
ISBN(電子版) | 0780305329 |
DOI | |
出版ステータス | Published - 1992 |
イベント | 1992 International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1992 - San Francisco, United States 継続期間: 1992 3月 23 → 1992 3月 26 |
Other
Other | 1992 International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1992 |
---|---|
国/地域 | United States |
City | San Francisco |
Period | 92/3/23 → 92/3/26 |
ASJC Scopus subject areas
- ソフトウェア
- 信号処理
- 電子工学および電気工学