TY - GEN
T1 - On microphone arrangement for multichannel speech enhancement based on nonnegative matrix factorization in time-channel domain
AU - Murase, Yoshikazu
AU - Chiba, Hironobu
AU - Ono, Nobutaka
AU - Miyabe, Shigeki
AU - Yamada, Takeshi
AU - Makino, Shoji
N1 - Publisher Copyright:
© 2014 Asia-Pacific Signal and Information Processing Ass.
PY - 2014/2/12
Y1 - 2014/2/12
N2 - In this paper, we investigate the relationship between the way microphones are arranged and the degree to which speech is enhanced using the transfer-function-gain nonnegative matrix factorization (NMF), which is an amplitude-based speech enhancement method that is suitable for use with an asynchronous distributed microphone array. In an asynchronous distributed microphone array, recording devices can be placed freely and the number of devices can be easily increased. Therefore, it is important that to determine the optimum microphone arrangement and the degree to which the performance is improved by using many microphones. We understood experimental evaluations to show that the performance by supervised NMF can achieve close to the ideal time-frequency masking with a sufficient number of microphones. We also show that the performance is better when more microphones are placed close to each source.
AB - In this paper, we investigate the relationship between the way microphones are arranged and the degree to which speech is enhanced using the transfer-function-gain nonnegative matrix factorization (NMF), which is an amplitude-based speech enhancement method that is suitable for use with an asynchronous distributed microphone array. In an asynchronous distributed microphone array, recording devices can be placed freely and the number of devices can be easily increased. Therefore, it is important that to determine the optimum microphone arrangement and the degree to which the performance is improved by using many microphones. We understood experimental evaluations to show that the performance by supervised NMF can achieve close to the ideal time-frequency masking with a sufficient number of microphones. We also show that the performance is better when more microphones are placed close to each source.
UR - http://www.scopus.com/inward/record.url?scp=84949924164&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84949924164&partnerID=8YFLogxK
U2 - 10.1109/APSIPA.2014.7041687
DO - 10.1109/APSIPA.2014.7041687
M3 - Conference contribution
AN - SCOPUS:84949924164
T3 - 2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014
BT - 2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014
Y2 - 9 December 2014 through 12 December 2014
ER -