Abstract
To provide an appropriate model for perception of temporal structures of speech, we applied a comprehensive computational model of the human auditory peripherals to detect changes in speech signals that potentially indicate arrivals of new events. In each tonotopic sub-band, an increase in the activation level was taken into account for the plausibility of a new event, while a decrease was ignored. The total contour obtained by integrating the sub-band information exhibited sharp peaks and dips compared to the loudness contour. A quantitative evaluation to estimate the speaking rate of natural speech also demonstrated that the event-plausibility model performs better than the loudness model.
Original language | English |
---|---|
Pages | 677-680 |
Number of pages | 4 |
Publication status | Published - 2005 |
Event | 9th European Conference on Speech Communication and Technology - Lisbon, Portugal Duration: 2005 Sept 4 → 2005 Sept 8 |
Conference
Conference | 9th European Conference on Speech Communication and Technology |
---|---|
Country/Territory | Portugal |
City | Lisbon |
Period | 05/9/4 → 05/9/8 |
ASJC Scopus subject areas
- Engineering(all)