TY - JOUR
T1 - Speech segment network approach for optimization of synthesis unit set
AU - Iwahashi, Naoto
AU - Sagisaka, Yoshinori
PY - 1995/10
Y1 - 1995/10
N2 - In this paper, a speech segment network approach for the construction of a suitable synthesis unit set with which high-quality speech can be synthesized, and yet which is of small enough size to be practical, is proposed. The speech segment network approach selects a synthesis unit set in which segmental and/or inter-segmental distortions are minimized by using combinatorial optimization methods such as iterative improvement and simulated annealing. Experimental results using diphone segments have shown that the suitable diphone unit sets, with total or maximum of inter-segmental distortion reduced by about 35 and 30%, respectively, can be constructed using this method. This reduction rate was enhanced as the segment candidate population increased. Effectiveness of this unit set design was also perceptually confirmed by a listening test, using speech synthesized with the selected diphone unit set.
AB - In this paper, a speech segment network approach for the construction of a suitable synthesis unit set with which high-quality speech can be synthesized, and yet which is of small enough size to be practical, is proposed. The speech segment network approach selects a synthesis unit set in which segmental and/or inter-segmental distortions are minimized by using combinatorial optimization methods such as iterative improvement and simulated annealing. Experimental results using diphone segments have shown that the suitable diphone unit sets, with total or maximum of inter-segmental distortion reduced by about 35 and 30%, respectively, can be constructed using this method. This reduction rate was enhanced as the segment candidate population increased. Effectiveness of this unit set design was also perceptually confirmed by a listening test, using speech synthesized with the selected diphone unit set.
UR - http://www.scopus.com/inward/record.url?scp=0029386592&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0029386592&partnerID=8YFLogxK
U2 - 10.1006/csla.1995.0016
DO - 10.1006/csla.1995.0016
M3 - Article
AN - SCOPUS:0029386592
SN - 0885-2308
VL - 9
SP - 335
EP - 352
JO - Computer Speech and Language
JF - Computer Speech and Language
IS - 4
ER -