Abstract
This paper presents an improved semiautomatic labeling system which has been developed to construct a fine speech data base. An automatic labeling technique used in the system is based on optimal matching of an input speech sample to reference samples. A major point of improvement this system offers is that it is able to treat connected speech samples by means of a two-stage matching process. The phonetic unit for the labeling is an acoustically compact segment called APSEG (acoustic-phonetic segment). The latter part of the paper discusses speech representation using networks. The acoustic-phonetic variations of the utterances are described by the differences in the label sequences, so that they are represented by a network consisting of the nodes of the APSEG labels.
Original language | English |
---|---|
Pages (from-to) | 1-6 |
Number of pages | 6 |
Journal | Denshi Gijutsu Sogo Kenkyusho Iho/Bulletin of the Electrotechnical Laboratory |
Volume | 52 |
Issue number | 3 |
Publication status | Published - 1988 |
Externally published | Yes |
ASJC Scopus subject areas
- Condensed Matter Physics
- Electrical and Electronic Engineering