TY - GEN
T1 - Acoustic features for estimation of perceptional similarity
AU - Adachi, Yoshihiro
AU - Kawamoto, Shinichi
AU - Morishima, Shigeo
AU - Nakamura, Satoshi
PY - 2007
Y1 - 2007
N2 - This paper describes an examination of acoustic features for the estimation of perceptional similarity between speeches. We firstly extract some acoustic features including personality from speeches of 36 persons. Secondly, we calculate each distance between extracted features using Gaussian Mixture Model (GMM) or Dynamic Time Warping (DTW), and then we sort speeches based on the physical similarity. On the other hand, there is the permutation based on the perceptional similarity which is sorted according to the subject. We evaluate the physical features by the Spearman's rank correlation coefficient with two permutations. Consequently, the results show that DTW distance with high STRAIGHT Cepstrum is an optimum feature for estimation of perceptional similarity.
AB - This paper describes an examination of acoustic features for the estimation of perceptional similarity between speeches. We firstly extract some acoustic features including personality from speeches of 36 persons. Secondly, we calculate each distance between extracted features using Gaussian Mixture Model (GMM) or Dynamic Time Warping (DTW), and then we sort speeches based on the physical similarity. On the other hand, there is the permutation based on the perceptional similarity which is sorted according to the subject. We evaluate the physical features by the Spearman's rank correlation coefficient with two permutations. Consequently, the results show that DTW distance with high STRAIGHT Cepstrum is an optimum feature for estimation of perceptional similarity.
KW - Acoustic features
KW - Perceptional similarity
KW - Physical similarity
KW - Spearman's rank correlation coefficient
UR - http://www.scopus.com/inward/record.url?scp=38349082361&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=38349082361&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-77255-2_33
DO - 10.1007/978-3-540-77255-2_33
M3 - Conference contribution
AN - SCOPUS:38349082361
SN - 9783540772545
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 306
EP - 314
BT - Advances in Multimedia Information Processing - PCM 2007 - 8th Pacific Rim Conference on Multimedia, Proceedings
PB - Springer Verlag
T2 - 8th Pacific-Rim Conference on Multimedia, PCM 2007
Y2 - 11 December 2007 through 14 December 2007
ER -