Abstract
We developed a three-dimensional mechanical vocal cord model for Waseda Talker No. 7 (WT-7), an anthropomorphic talking robot, for generating speech sounds with various voice qualities. The vocal cord model is a cover model that has two thin folds made of thermoplastic material. The model self-oscillates by airflow exhausted from the lung model and generates the glottal sound source, which is fed into the vocal tract for generating the speech sound. Using the vocal cord model, breathy and creaky voices, as well as the modal (normal) voice, were produced in a manner similar to the human laryngeal control. The breathy voice is characterized by a noisy component mixed with the periodic glottal sound source and the creaky voice is characterized by an extremely low-pitch vibration. The breathy voice was produced by adjusting the glottal opening and generating the turbulence noise by the airflow just above the glottis. The creaky voice was produced by adjusting the vocal cord tension, the sub-glottal pressure and the vibration mass so as to generate a double-pitch vibration with a long pitch interval. The vocal cord model used to produce these voice qualities was evaluated in terms of the vibration pattern as measured by a high-speed camera, the glottal airflow and the acoustic characteristics of the glottal sound source, as compared to the data for a human.
Original language | English |
---|---|
Pages (from-to) | 105-120 |
Number of pages | 16 |
Journal | Advanced Robotics |
Volume | 26 |
Issue number | 1-2 |
DOIs | |
Publication status | Published - 2012 |
Keywords
- Humanoid robot
- speech production
- talking robot
- vocal cords
- voice quality
ASJC Scopus subject areas
- Software
- Human-Computer Interaction
- Control and Systems Engineering
- Hardware and Architecture
- Computer Science Applications