Speaker adaptive Phoneme recognition based on feature mapping from spectral domain to probabilistic domain

Tetsunori Kobayashi, Y. Uchiyama, J. Osada, K. Shirai

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    1 Citation (Scopus)

    Abstract

    A new feature parameter space for speech recognition called PRPG (Probability Ratios between Phoneme Group pairs) has been proposed and speaker adaptive phoneme recognition has been performed. In the coordinate system proposed here, the area with the same information for speech recognition is compressed into one point. The mapping function from spectral coordinate system to proposed one is realized using a neural network. The code-vectors designed on this coordinate system are assured to be informationtheoretically more efficient than that of spectral coordinate system. Moreover, by the definition of the coordinate system, the meaning of axes are equivalent among different speakers, so the speaker adaptation can be easily performed without trajectory mapping. The experimental results show that the 40% of errors are reduced by the coordinate conversion in the speaker-dependent tasks. The scores of the speakeradaptive tasks in the proposed feature domain are always superior to those of the speaker-dependent tasks in the spectral domain.

    Original languageEnglish
    Title of host publicationICASSP 1992 - 1992 International Conference on Acoustics, Speech, and Signal Processing
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages457-460
    Number of pages4
    Volume1
    ISBN (Electronic)0780305329
    DOIs
    Publication statusPublished - 1992
    Event1992 International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1992 - San Francisco, United States
    Duration: 1992 Mar 231992 Mar 26

    Other

    Other1992 International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1992
    Country/TerritoryUnited States
    CitySan Francisco
    Period92/3/2392/3/26

    ASJC Scopus subject areas

    • Software
    • Signal Processing
    • Electrical and Electronic Engineering

    Fingerprint

    Dive into the research topics of 'Speaker adaptive Phoneme recognition based on feature mapping from spectral domain to probabilistic domain'. Together they form a unique fingerprint.

    Cite this