Automatic synchronization between lyrics and music CD recordings based on Viterbi alignment of segregated vocal signals

Hiromasa Fujihara*, Masataka Goto, Jun Ogata, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno

*この研究の対応する著者

研究成果: Conference contribution

31 被引用数 (Scopus)

抄録

This paper describes a system that can automatically synchronize between polyphonic musical audio signals and corresponding lyrics. Although there were methods that can synchronize between monophonie speech signals and corresponding text transcriptions by using Viterbi alignment techniques, they cannot be applied to vocals in CD recordings because accompaniment sounds often overlap with vocals. To align lyrics with such vocals, we therefore developed three methods: a method for segregating vocals from polyphonic sound mixtures, a method for detecting vocal sections, and a method for adapting a speech-recognizer phone model to segregated vocal signals. Experimental results for 10 Japanese popular-music songs showed that our system can synchronize between music and lyrics with satisfactory accuracy for 8 songs.

本文言語English
ホスト出版物のタイトルISM 2006 - 8th IEEE International Symposium on Multimedia
ページ257-264
ページ数8
DOI
出版ステータスPublished - 2006
外部発表はい
イベントISM 2006 - 8th IEEE International Symposium on Multimedia - San Diego, CA, United States
継続期間: 2006 12月 112006 12月 13

出版物シリーズ

名前ISM 2006 - 8th IEEE International Symposium on Multimedia

Conference

ConferenceISM 2006 - 8th IEEE International Symposium on Multimedia
国/地域United States
CitySan Diego, CA
Period06/12/1106/12/13

ASJC Scopus subject areas

  • コンピュータ ネットワークおよび通信

フィンガープリント

「Automatic synchronization between lyrics and music CD recordings based on Viterbi alignment of segregated vocal signals」の研究トピックを掘り下げます。これらがまとまってユニークなフィンガープリントを構成します。

引用スタイル