TY - JOUR
T1 - Automatic prosodic segmentation by F0 clustering using superpositional modeling
AU - Nakai, Mitsuru
AU - Singer, Harald
AU - Sagisaka, Yoshinori
AU - Shimodaira, Hiroshi
PY - 1995
Y1 - 1995
N2 - In this paper, we propose an automatic method for detecting accent phrase boundaries in Japanese continuous speech by using F0 information. In the training phase, hand labeled accent patterns are parameterized according to a superpositional model proposed by Fujisaki, and assigned to some clusters by a clustering method, in which accent templates are calculated as centroid of each cluster. In the segmentation phase, automatic N-best extraction of boundaries is performed by One-Stage DP matching between the reference templates and the target F0 contour. About 90% of accent phrase boundaries were correctly detected in speaker independent experiments with the ATR Japanese continuous speech database.
AB - In this paper, we propose an automatic method for detecting accent phrase boundaries in Japanese continuous speech by using F0 information. In the training phase, hand labeled accent patterns are parameterized according to a superpositional model proposed by Fujisaki, and assigned to some clusters by a clustering method, in which accent templates are calculated as centroid of each cluster. In the segmentation phase, automatic N-best extraction of boundaries is performed by One-Stage DP matching between the reference templates and the target F0 contour. About 90% of accent phrase boundaries were correctly detected in speaker independent experiments with the ATR Japanese continuous speech database.
UR - http://www.scopus.com/inward/record.url?scp=0028996982&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0028996982&partnerID=8YFLogxK
M3 - Conference article
AN - SCOPUS:0028996982
SN - 0736-7791
VL - 1
SP - 624
EP - 627
JO - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
JF - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
T2 - Proceedings of the 1995 20th International Conference on Acoustics, Speech, and Signal Processing. Part 1 (of 5)
Y2 - 9 May 1995 through 12 May 1995
ER -