TY - JOUR
T1 - Latent topic similarity for music retrieval and its application to a system that supports DJ performance
AU - Hirai, Tatsunori
AU - Doi, Hironori
AU - Morishima, Shigeo
N1 - Publisher Copyright:
© 2018 Information Processing Society of Japan.
PY - 2018/1
Y1 - 2018/1
N2 - This paper presents a topic modeling method to retrieve similar music fragments and its application, Music- Mixer, which is a computer-aided DJ system that supports DJ performance by automatically mixing songs in a seamless manner. MusicMixer mixes songs based on audio similarity calculated via beat analysis and latent topic analysis of the chromatic signal in the audio. The topic represents latent semantics on how chromatic sounds are generated. Given a list of songs, a DJ selects a song with beats and sounds similar to a specific point of the currently playing song to seamlessly transition between songs. By calculating similarities between all existing song sections that can be naturally mixed, MusicMixer retrieves the best mixing point from a myriad of possibilities and enables seamless song transitions. Although it is comparatively easy to calculate beat similarity from audio signals, considering the semantics of songs from the viewpoint of a human DJ has proven difficult. Therefore, we propose a method to represent audio signals to construct topic models that acquire latent semantics of audio. The results of a subjective experiment demonstrate the effectiveness of the proposed latent semantic analysis method. MusicMixer achieves automatic song mixing using the audio signal processing approach; thus, users can perform DJ mixing simply by selecting a song from a list of songs suggested by the system.
AB - This paper presents a topic modeling method to retrieve similar music fragments and its application, Music- Mixer, which is a computer-aided DJ system that supports DJ performance by automatically mixing songs in a seamless manner. MusicMixer mixes songs based on audio similarity calculated via beat analysis and latent topic analysis of the chromatic signal in the audio. The topic represents latent semantics on how chromatic sounds are generated. Given a list of songs, a DJ selects a song with beats and sounds similar to a specific point of the currently playing song to seamlessly transition between songs. By calculating similarities between all existing song sections that can be naturally mixed, MusicMixer retrieves the best mixing point from a myriad of possibilities and enables seamless song transitions. Although it is comparatively easy to calculate beat similarity from audio signals, considering the semantics of songs from the viewpoint of a human DJ has proven difficult. Therefore, we propose a method to represent audio signals to construct topic models that acquire latent semantics of audio. The results of a subjective experiment demonstrate the effectiveness of the proposed latent semantic analysis method. MusicMixer achieves automatic song mixing using the audio signal processing approach; thus, users can perform DJ mixing simply by selecting a song from a list of songs suggested by the system.
KW - Automatic song mixing
KW - Computer-aided performance
KW - Topic analysis
UR - http://www.scopus.com/inward/record.url?scp=85043983409&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85043983409&partnerID=8YFLogxK
U2 - 10.2197/ipsjjip.26.276
DO - 10.2197/ipsjjip.26.276
M3 - Article
AN - SCOPUS:85043983409
SN - 0387-5806
VL - 26
SP - 276
EP - 284
JO - Journal of information processing
JF - Journal of information processing
ER -