Latent topic similarity for music retrieval and its application to a system that supports DJ performance

Tatsunori Hirai, Hironori Doi, Shigeo Morishima

Research output: Contribution to journalArticlepeer-review

Abstract

This paper presents a topic modeling method to retrieve similar music fragments and its application, Music- Mixer, which is a computer-aided DJ system that supports DJ performance by automatically mixing songs in a seamless manner. MusicMixer mixes songs based on audio similarity calculated via beat analysis and latent topic analysis of the chromatic signal in the audio. The topic represents latent semantics on how chromatic sounds are generated. Given a list of songs, a DJ selects a song with beats and sounds similar to a specific point of the currently playing song to seamlessly transition between songs. By calculating similarities between all existing song sections that can be naturally mixed, MusicMixer retrieves the best mixing point from a myriad of possibilities and enables seamless song transitions. Although it is comparatively easy to calculate beat similarity from audio signals, considering the semantics of songs from the viewpoint of a human DJ has proven difficult. Therefore, we propose a method to represent audio signals to construct topic models that acquire latent semantics of audio. The results of a subjective experiment demonstrate the effectiveness of the proposed latent semantic analysis method. MusicMixer achieves automatic song mixing using the audio signal processing approach; thus, users can perform DJ mixing simply by selecting a song from a list of songs suggested by the system.

Original languageEnglish
Pages (from-to)276-284
Number of pages9
JournalJournal of information processing
Volume26
DOIs
Publication statusPublished - 2018 Jan

Keywords

  • Automatic song mixing
  • Computer-aided performance
  • Topic analysis

ASJC Scopus subject areas

  • Computer Science(all)

Fingerprint

Dive into the research topics of 'Latent topic similarity for music retrieval and its application to a system that supports DJ performance'. Together they form a unique fingerprint.

Cite this