TY - GEN
T1 - A trial of communicative prosody generation based on control characteristic of one word utterance observed in real conversational speech
AU - Greenberg, Yoko
AU - Shibuya, Nagisa
AU - Tsuzaki, Minoru
AU - Kato, Hiroaki
AU - Sagisaka, Yoshinori
N1 - Funding Information:
This work was also supported in part by the Grant-in-Aid for Scientific Research (A)(2), No. 16200016, JSPS.
Publisher Copyright:
© 2006 Proceedings of the International Conference on Speech Prosody.
PY - 2006
Y1 - 2006
N2 - Aiming at prosody control for conversational speech synthesis, communicative prosodies were generated based on the prosodic characteristics derived from one word utterance "n". The grouping of F0 patterns using VQ revealed four F0 dynamic patterns (rise, gradual fall, fall, and rise&fall) for large amounts of one-word utterance "n" in daily conversations. Through the analysis using an F0 generation model, different control characteristics were found for these patterns. A communicative prosody control scheme is proposed for short utterances reflecting these control characteristics for three dimensional representative perceptual impressions, confident-doubtful, allowable-unacceptable and positive-negative previously obtained by MDS analysis. The naturalness evaluation tests for synthesized conversational speech showed superiority in naturalness of the proposed prosody control. These results indicate the possibility of communicative prosody generation for conversational speech synthesis through perceptional impression expressions using corpus-based approach.
AB - Aiming at prosody control for conversational speech synthesis, communicative prosodies were generated based on the prosodic characteristics derived from one word utterance "n". The grouping of F0 patterns using VQ revealed four F0 dynamic patterns (rise, gradual fall, fall, and rise&fall) for large amounts of one-word utterance "n" in daily conversations. Through the analysis using an F0 generation model, different control characteristics were found for these patterns. A communicative prosody control scheme is proposed for short utterances reflecting these control characteristics for three dimensional representative perceptual impressions, confident-doubtful, allowable-unacceptable and positive-negative previously obtained by MDS analysis. The naturalness evaluation tests for synthesized conversational speech showed superiority in naturalness of the proposed prosody control. These results indicate the possibility of communicative prosody generation for conversational speech synthesis through perceptional impression expressions using corpus-based approach.
UR - http://www.scopus.com/inward/record.url?scp=85089849822&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85089849822&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85089849822
T3 - Proceedings of the International Conference on Speech Prosody
BT - 3rd International Conference on Speech Prosody 2006
A2 - Hoffmann, R.
A2 - Mixdorff, H.
PB - International Speech Communications Association
T2 - 3rd International Conference on Speech Prosody, SP 2006
Y2 - 2 May 2006 through 5 May 2006
ER -