Application of auditory image model for speech event detection

Minoru Tsuzaki*, Satomi Tanaka, Hiroaki Kato, Yoshinori Sagisaka

*この研究の対応する著者

研究成果: Paper査読

1 被引用数 (Scopus)

抄録

To provide an appropriate model for perception of temporal structures of speech, we applied a comprehensive computational model of the human auditory peripherals to detect changes in speech signals that potentially indicate arrivals of new events. In each tonotopic sub-band, an increase in the activation level was taken into account for the plausibility of a new event, while a decrease was ignored. The total contour obtained by integrating the sub-band information exhibited sharp peaks and dips compared to the loudness contour. A quantitative evaluation to estimate the speaking rate of natural speech also demonstrated that the event-plausibility model performs better than the loudness model.

本文言語English
ページ677-680
ページ数4
出版ステータスPublished - 2005
イベント9th European Conference on Speech Communication and Technology - Lisbon, Portugal
継続期間: 2005 9月 42005 9月 8

Conference

Conference9th European Conference on Speech Communication and Technology
国/地域Portugal
CityLisbon
Period05/9/405/9/8

ASJC Scopus subject areas

  • 工学(全般)

フィンガープリント

「Application of auditory image model for speech event detection」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル