TY - JOUR
T1 - A 3D guitar fingering assessing system based on CNN-Hand pose estimation and SVR-Assessment
AU - Wang, Zhao
AU - Ohya, Jun
N1 - Publisher Copyright:
© 2018, Society for Imaging Science and Technology.
PY - 2018
Y1 - 2018
N2 - This paper proposes a guitar fingering assessing system based on CNN (Convolutional Neural Network) hand pose estimation and SVR (Support Vector Regression) evaluation. To spur our progress, first, a CNN architecture is proposed to estimate temporal 3D position of 16 joints of hand; then, based on a DCT (Discrete Cosine Transform) feature and SVR, fingering of guitarist is scored to interpret how well guitarist played. We also release a new dataset for professional guitar playing analysis with significant advantage in total number of video, professional judgement by expert of guitarist, accurate annotation for hand pose and score of guitar performance. Experiments using videos containing multiple persons' guitar plays under different conditions demonstrate that the proposed method outperforms the current state-of-art with (1) low mean error (Euclid distance of 6,1 mm) and high computation efficiency for hand pose estimation; (2) high rank correlation (0.68) for assessing the fingering (C major scale and symmetrical excise) of guitarists.
AB - This paper proposes a guitar fingering assessing system based on CNN (Convolutional Neural Network) hand pose estimation and SVR (Support Vector Regression) evaluation. To spur our progress, first, a CNN architecture is proposed to estimate temporal 3D position of 16 joints of hand; then, based on a DCT (Discrete Cosine Transform) feature and SVR, fingering of guitarist is scored to interpret how well guitarist played. We also release a new dataset for professional guitar playing analysis with significant advantage in total number of video, professional judgement by expert of guitarist, accurate annotation for hand pose and score of guitar performance. Experiments using videos containing multiple persons' guitar plays under different conditions demonstrate that the proposed method outperforms the current state-of-art with (1) low mean error (Euclid distance of 6,1 mm) and high computation efficiency for hand pose estimation; (2) high rank correlation (0.68) for assessing the fingering (C major scale and symmetrical excise) of guitarists.
UR - http://www.scopus.com/inward/record.url?scp=85052905106&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85052905106&partnerID=8YFLogxK
U2 - 10.2352/ISSN.2470-1173.2018.09.IRIACV-204
DO - 10.2352/ISSN.2470-1173.2018.09.IRIACV-204
M3 - Conference article
AN - SCOPUS:85052905106
SN - 2470-1173
SP - 2781
EP - 2785
JO - IS and T International Symposium on Electronic Imaging Science and Technology
JF - IS and T International Symposium on Electronic Imaging Science and Technology
T2 - Intelligent Robotics and Industrial Applications using Computer Vision 2018, IRIACV 2018
Y2 - 28 January 2018 through 1 February 2018
ER -