抄録
In recent years, plenty of studies endeavor to analyze the life auditory scenarios via mining non-speech sounds. Conventional audio recognition schemes clearly bound the feature extraction and recognition stages, such as in speech recognition. However, such separation leads to inconsistency in the purposes at each stage. The recognition stage contributes to portray the global data distribution focusing on 'relationship' between signal samples. However, such consideration can hardly be embedded into feature extraction process which centered on the local structure, thus, the prominent 'relation' information is destroyed. In this paper, we propose a unified acoustic recognition framework taking advantage of primitive feature input without injuring discriminant information and adopting effective classification scheme accordingly. We formulate the sound into subspace representation and initially adopt Grassmannian distance to classify the subspace-indexed non-speech sounds. To validate the proposed framework, we conducted experiments using RWCP Sound Scene Database. The experimental results demonstrated the proposed framework achieved fine recognition performance with high efficiency.
本文言語 | English |
---|---|
ホスト出版物のタイトル | Proceedings - 45th International Conference on Parallel Processing Workshops, ICPPW 2016 |
出版社 | Institute of Electrical and Electronics Engineers Inc. |
ページ | 421-425 |
ページ数 | 5 |
巻 | 2016-September |
ISBN(電子版) | 9781509028252 |
DOI | |
出版ステータス | Published - 2016 9月 23 |
イベント | 45th International Conference on Parallel Processing Workshops, ICPPW 2016 - Philadelphia, United States 継続期間: 2016 8月 16 → 2016 8月 19 |
Other
Other | 45th International Conference on Parallel Processing Workshops, ICPPW 2016 |
---|---|
国/地域 | United States |
City | Philadelphia |
Period | 16/8/16 → 16/8/19 |
ASJC Scopus subject areas
- ソフトウェア
- 数学 (全般)
- ハードウェアとアーキテクチャ