Phoneme boundary estimation using bidirectional recurrent neural networks and its applications

Toshiaki Fukada, Mike Schuster, Yoshinori Sagisaka

Research output: Contribution to journalArticlepeer-review

11 Citations (Scopus)

Abstract

This paper describes a phoneme boundary estimation method based on bidirectional recurrent neural networks (BRNNs). Experimental results showed that the proposed method could estimate segment boundaries significantly better than an HMM or a multilayer perceptron-based method. Furthermore, we incorporated the BRNN-based segment boundary estimator into the HMM-based and segment model-based recognition systems. As a result, we confirmed that (1) BRNN outputs were effective for improving the recognition rate and reducing computational time in an HMM-based recognition system and (2) segment lattices obtained by the proposed methods dramatically reduce the computational complexity of segment model-based recognition.

Original languageEnglish
Pages (from-to)20-30
Number of pages11
JournalSystems and Computers in Japan
Volume30
Issue number4
DOIs
Publication statusPublished - 1999 Apr
Externally publishedYes

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Information Systems
  • Hardware and Architecture
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'Phoneme boundary estimation using bidirectional recurrent neural networks and its applications'. Together they form a unique fingerprint.

Cite this