Harmonic sound stream segregation using localization and its application to speech stream segregation

Tomohiro Nakatani, Hiroshi G. Okuno

Research output: Contribution to journalArticlepeer-review

33 Citations (Scopus)

Abstract

Sound stream segregation is essential to understand auditory events in the real world. In this paper, we present a new method of segregating a series of harmonic sounds. The harmonic structure and sound source direction are used as clues for segregation. The direction information of the sources is used to extract fundamental frequencies of individual harmonic sounds, and harmonic sounds are segregated according to the extracted fundamental frequencies. Sequential grouping of harmonic sounds is achieved by using both sound source directions and fundamental frequencies. An application of the harmonic stream segregation to speech stream segregation is presented. It provides effective speech stream segregation using binaural microphones. Experimental results show that the method reduces the spectrum distortions and the fundamental frequency errors compared to an existing monaural system, and that it can segregate three simultaneous harmonic streams with only two microphones.

Original languageEnglish
Pages (from-to)209-222
Number of pages14
JournalSpeech Communication
Volume27
Issue number3
DOIs
Publication statusPublished - 1999 Apr
Externally publishedYes

ASJC Scopus subject areas

  • Signal Processing
  • Electrical and Electronic Engineering
  • Experimental and Cognitive Psychology
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Harmonic sound stream segregation using localization and its application to speech stream segregation'. Together they form a unique fingerprint.

Cite this