Abstract
This paper proposes a novel scheme for keyword-spotting in conversational speech using frame-level likelihood of phonemes and statistics of their duration. Since spontaneous utterances include many ill-formed sentences, it is most difficult to realize a highly advanced continuous speech recognition system based on a top-down syntax driven process. We, therefore, propose a bottom-up method to detect keywords in continuous speech based on a dynamical programming technique using both phonemic and durational likelihood. Our algorithm basically depends on island-driven both-side-free DP method. In the performance test of the speaker-dependent keyword spotting, it was found that, compared to the conventional continuous DP method, the erroneous candidates and the processing time decreases to 1/6 in new method. This result shows the feasibility of our method for continuous speech recognition, especially for conversational style utterances.
Original language | English |
---|---|
Pages | 1281-1284 |
Number of pages | 4 |
Publication status | Published - 1993 |
Event | 3rd European Conference on Speech Communication and Technology, EUROSPEECH 1993 - Berlin, Germany Duration: 1993 Sept 22 → 1993 Sept 25 |
Conference
Conference | 3rd European Conference on Speech Communication and Technology, EUROSPEECH 1993 |
---|---|
Country/Territory | Germany |
City | Berlin |
Period | 93/9/22 → 93/9/25 |
ASJC Scopus subject areas
- Software
- Linguistics and Language
- Computer Science Applications
- Communication