TY - GEN
T1 - Automatic synchronization between lyrics and music CD recordings based on Viterbi alignment of segregated vocal signals
AU - Fujihara, Hiromasa
AU - Goto, Masataka
AU - Ogata, Jun
AU - Komatani, Kazunori
AU - Ogata, Tetsuya
AU - Okuno, Hiroshi G.
PY - 2006
Y1 - 2006
N2 - This paper describes a system that can automatically synchronize between polyphonic musical audio signals and corresponding lyrics. Although there were methods that can synchronize between monophonie speech signals and corresponding text transcriptions by using Viterbi alignment techniques, they cannot be applied to vocals in CD recordings because accompaniment sounds often overlap with vocals. To align lyrics with such vocals, we therefore developed three methods: a method for segregating vocals from polyphonic sound mixtures, a method for detecting vocal sections, and a method for adapting a speech-recognizer phone model to segregated vocal signals. Experimental results for 10 Japanese popular-music songs showed that our system can synchronize between music and lyrics with satisfactory accuracy for 8 songs.
AB - This paper describes a system that can automatically synchronize between polyphonic musical audio signals and corresponding lyrics. Although there were methods that can synchronize between monophonie speech signals and corresponding text transcriptions by using Viterbi alignment techniques, they cannot be applied to vocals in CD recordings because accompaniment sounds often overlap with vocals. To align lyrics with such vocals, we therefore developed three methods: a method for segregating vocals from polyphonic sound mixtures, a method for detecting vocal sections, and a method for adapting a speech-recognizer phone model to segregated vocal signals. Experimental results for 10 Japanese popular-music songs showed that our system can synchronize between music and lyrics with satisfactory accuracy for 8 songs.
UR - http://www.scopus.com/inward/record.url?scp=34547508425&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34547508425&partnerID=8YFLogxK
U2 - 10.1109/ISM.2006.38
DO - 10.1109/ISM.2006.38
M3 - Conference contribution
AN - SCOPUS:34547508425
SN - 0769527469
SN - 9780769527468
T3 - ISM 2006 - 8th IEEE International Symposium on Multimedia
SP - 257
EP - 264
BT - ISM 2006 - 8th IEEE International Symposium on Multimedia
T2 - ISM 2006 - 8th IEEE International Symposium on Multimedia
Y2 - 11 December 2006 through 13 December 2006
ER -