TY - GEN
T1 - A DOA based speaker diarization system for real meetings
AU - Araki, Shoko
AU - Fujimoto, Masakiyo
AU - Ishizuka, Kentaro
AU - Sawada, Hiroshi
AU - Makino, Shoji
PY - 2008
Y1 - 2008
N2 - This paper presents a speaker diarization system that estimates who spoke when in a meeting. Our proposed system is realized by using a noise robust voice activity detector (VAD), a direction of arrival (DOA) estimator, and a DOA classifier. Our previous system utilized the generalized cross correlation method with the phase transform (GCC-PHAT) approach for the DOA estimation. Because the GCC-PHAT can estimate just one DOA per frame, it was difficult to handle speaker overlaps. This paper tries to deal with this issue by employing a DOA at each time-frequency slot (TFDOA), and reports how it improves diarization performance for real meetings / conversations recorded in a room with a reverberation time of 350 ms.
AB - This paper presents a speaker diarization system that estimates who spoke when in a meeting. Our proposed system is realized by using a noise robust voice activity detector (VAD), a direction of arrival (DOA) estimator, and a DOA classifier. Our previous system utilized the generalized cross correlation method with the phase transform (GCC-PHAT) approach for the DOA estimation. Because the GCC-PHAT can estimate just one DOA per frame, it was difficult to handle speaker overlaps. This paper tries to deal with this issue by employing a DOA at each time-frequency slot (TFDOA), and reports how it improves diarization performance for real meetings / conversations recorded in a room with a reverberation time of 350 ms.
KW - Diarization
KW - Direction of arrival
KW - Voice activity detector
UR - http://www.scopus.com/inward/record.url?scp=50449094778&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=50449094778&partnerID=8YFLogxK
U2 - 10.1109/HSCMA.2008.4538680
DO - 10.1109/HSCMA.2008.4538680
M3 - Conference contribution
AN - SCOPUS:50449094778
SN - 9781424423385
T3 - 2008 Hands-free Speech Communication and Microphone Arrays, Proceedings, HSCMA 2008
SP - 29
EP - 32
BT - 2008 Hands-free Speech Communication and Microphone Arrays, Proceedings, HSCMA 2008
T2 - 2008 Hands-free Speech Communication and Microphone Arrays, HSCMA 2008
Y2 - 6 May 2008 through 8 May 2008
ER -