TY - GEN
T1 - Musical instrument recognizer "instrogram" and its application to music retrieval based on instrumentation similarity
AU - Kitahara, Tetsuro
AU - Goto, Masataka
AU - Komatani, Kazunori
AU - Ogata, Tetsuya
AU - Okuno, Hiroshi G.
PY - 2006/12/1
Y1 - 2006/12/1
N2 - Instrumentation is an important cue in retrieving musical content. Conventional methods for instrument recognition performing notewise require accurate estimation of the onset time and fundamental frequency (F0) for each note, which is not easy in polyphonic music. This paper presents a non-notewise method for instrument recognition in polyphonic musical audio signals. Instead of such note-wise estimation, our method calculates the temporal trajectory of instrument existence probabilities for every F0 and visualizes it as a spectrogram-like graphical representation, called an instrogram. This method can avoid the influence by errors of onset detection and F0 estimation because it does not use them. We also present methods for MPEG-7-based instrument annotation and music information retrieval based on the similarity between instrograms. Experimental results with realistic music show the average accuracy of 76.2% for the instrument annotation and that the instrogram-based similarity measure represents the actual instrumentation similarity better than an MFCC-based one.
AB - Instrumentation is an important cue in retrieving musical content. Conventional methods for instrument recognition performing notewise require accurate estimation of the onset time and fundamental frequency (F0) for each note, which is not easy in polyphonic music. This paper presents a non-notewise method for instrument recognition in polyphonic musical audio signals. Instead of such note-wise estimation, our method calculates the temporal trajectory of instrument existence probabilities for every F0 and visualizes it as a spectrogram-like graphical representation, called an instrogram. This method can avoid the influence by errors of onset detection and F0 estimation because it does not use them. We also present methods for MPEG-7-based instrument annotation and music information retrieval based on the similarity between instrograms. Experimental results with realistic music show the average accuracy of 76.2% for the instrument annotation and that the instrogram-based similarity measure represents the actual instrumentation similarity better than an MFCC-based one.
UR - http://www.scopus.com/inward/record.url?scp=46249124867&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=46249124867&partnerID=8YFLogxK
U2 - 10.1109/ISM.2006.113
DO - 10.1109/ISM.2006.113
M3 - Conference contribution
AN - SCOPUS:46249124867
SN - 0769527469
SN - 9780769527468
T3 - ISM 2006 - 8th IEEE International Symposium on Multimedia
SP - 265
EP - 272
BT - ISM 2006 - 8th IEEE International Symposium on Multimedia
T2 - ISM 2006 - 8th IEEE International Symposium on Multimedia
Y2 - 11 December 2006 through 13 December 2006
ER -