TY - GEN
T1 - Speech-to-image media conversion based on VQ and neural network
AU - Morishima, Shigeo
AU - Harashima, Hiroshi
PY - 1991/12/1
Y1 - 1991/12/1
N2 - Automatic media conversion schemes from speech to a facial image and a construction of a real-time image synthesis system are presented. The purpose of this research is to realize an intelligent human-machine interface or intelligent communication system with synthesized human face images. A human face image is reconstructed on the display of a terminal using a 3-D surface model and texture mapping technique. Facial motion images are synthesized by transformation of the 3-D model. In the motion driving method, based on vector quantization and the neural network, the synthesized head image can appear to speak some given words and phrases naturally, in synchronization with voice signals from a speaker.
AB - Automatic media conversion schemes from speech to a facial image and a construction of a real-time image synthesis system are presented. The purpose of this research is to realize an intelligent human-machine interface or intelligent communication system with synthesized human face images. A human face image is reconstructed on the display of a terminal using a 3-D surface model and texture mapping technique. Facial motion images are synthesized by transformation of the 3-D model. In the motion driving method, based on vector quantization and the neural network, the synthesized head image can appear to speak some given words and phrases naturally, in synchronization with voice signals from a speaker.
UR - http://www.scopus.com/inward/record.url?scp=0026396368&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0026396368&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:0026396368
SN - 078030033
T3 - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
SP - 2865
EP - 2868
BT - Proceedings - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing
A2 - Anon, null
PB - Publ by IEEE
T2 - Proceedings of the 1991 International Conference on Acoustics, Speech, and Signal Processing - ICASSP 91
Y2 - 14 May 1991 through 17 May 1991
ER -