TY - GEN
T1 - Audio-based automatic generation of a piano reduction score by considering the musical structure
AU - Takamori, Hirofumi
AU - Nakatsuka, Takayuki
AU - Fukayama, Satoru
AU - Goto, Masataka
AU - Morishima, Shigeo
N1 - Funding Information:
Supported by JST ACCEL, Japan (grant no. JPMJAC1602).
Publisher Copyright:
© 2019, Springer Nature Switzerland AG.
PY - 2019
Y1 - 2019
N2 - This study describes a method that automatically generates a piano reduction score from the audio recordings of popular music while considering the musical structure. The generated score comprises both right- and left-hand piano parts, which reflect the melodies, chords, and rhythms extracted from the original audio signals. Generating such a reduction score from an audio recording is challenging because automatic music transcription is still considered to be inefficient when the input contains sounds from various instruments. Reflecting the long-term correlation structure behind similar repetitive bars is also challenging; further, previous methods have independently generated each bar. Our approach addresses the aforementioned issues by integrating musical analysis, especially structural analysis, with music generation. Our method extracts rhythmic features as well as melodies and chords from the input audio recording and reflects them in the score. To consider the long-term correlation between bars, we use similarity matrices, created for several acoustical features, as constraints. We further conduct a multivariate regression analysis to determine the acoustical features that represent the most valuable constraints for generating a musical structure. We have generated piano scores using our method and have observed that we can produce scores that differently balance between the ability to achieve rhythmic characteristics and the ability to obtain musical structures.
AB - This study describes a method that automatically generates a piano reduction score from the audio recordings of popular music while considering the musical structure. The generated score comprises both right- and left-hand piano parts, which reflect the melodies, chords, and rhythms extracted from the original audio signals. Generating such a reduction score from an audio recording is challenging because automatic music transcription is still considered to be inefficient when the input contains sounds from various instruments. Reflecting the long-term correlation structure behind similar repetitive bars is also challenging; further, previous methods have independently generated each bar. Our approach addresses the aforementioned issues by integrating musical analysis, especially structural analysis, with music generation. Our method extracts rhythmic features as well as melodies and chords from the input audio recording and reflects them in the score. To consider the long-term correlation between bars, we use similarity matrices, created for several acoustical features, as constraints. We further conduct a multivariate regression analysis to determine the acoustical features that represent the most valuable constraints for generating a musical structure. We have generated piano scores using our method and have observed that we can produce scores that differently balance between the ability to achieve rhythmic characteristics and the ability to obtain musical structures.
KW - Acoustic feature
KW - Multivariate regression analysis
KW - Musical structure
KW - Piano reduction
KW - Self-similarity matrix
UR - http://www.scopus.com/inward/record.url?scp=85059836205&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85059836205&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-05716-9_14
DO - 10.1007/978-3-030-05716-9_14
M3 - Conference contribution
AN - SCOPUS:85059836205
SN - 9783030057152
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 169
EP - 181
BT - MultiMedia Modeling - 25th International Conference, MMM 2019, Proceedings
A2 - Huet, Benoit
A2 - Kompatsiaris, Ioannis
A2 - Vrochidis, Stefanos
A2 - Mezaris, Vasileios
A2 - Cheng, Wen-Huang
A2 - Gurrin, Cathal
PB - Springer Verlag
T2 - 25th International Conference on MultiMedia Modeling, MMM 2019
Y2 - 8 January 2019 through 11 January 2019
ER -