抄録
We propose a novel approach to auditory stream segregation which extracts individual sounds (auditory stream) from a mixture of sounds in auditory scene analysis. The HBSS (Harmonic-Based Stream Segregation) system is designed and developed by employing a multi-agent system. HBSS uses only harmonics as a clue to segregation and extracts auditory streams incrementally. When the tracer-generator agent detects a new sound, it spawns a tracer agent, which extracts an auditory stream by tracing its harmonic structure. The tracer sends a feedforward signal so that the generator and other tracers should not work on the same stream that is being traced. The quality of segregation may be poor due to redundant and ghost tracers. HBSS copes with this problem by introducing monitor agents, which detect and eliminate redundant and ghost tracers. HBSS can segregate two streams from a mixture of man's and woman's speech. It is easy to resynthesize speech or sounds from the corresponding streams. Additionally, HBSS can be easily extended by adding agents of a new capability. HBSS can be considered as the first step to computational auditory scene analysis.
本文言語 | English |
---|---|
ホスト出版物のタイトル | Proceedings of the National Conference on Artificial Intelligence |
Place of Publication | Menlo Park, CA, United States |
出版社 | AAAI |
ページ | 100-107 |
ページ数 | 8 |
巻 | 1 |
出版ステータス | Published - 1994 |
外部発表 | はい |
イベント | Proceedings of the 12th National Conference on Artificial Intelligence. Part 1 (of 2) - Seattle, WA, USA 継続期間: 1994 7月 31 → 1994 8月 4 |
Other
Other | Proceedings of the 12th National Conference on Artificial Intelligence. Part 1 (of 2) |
---|---|
City | Seattle, WA, USA |
Period | 94/7/31 → 94/8/4 |
ASJC Scopus subject areas
- ソフトウェア