An intelligent facial image coding driven by speech and phoneme

13 January 2003

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 1795-1798 vol.3
https://doi.org/10.1109/icassp.1989.266799

Abstract

The authors propose and compare two types of model-based facial motion coding schemes, i.e. synthesis by rules and synthesis by parameters. In synthesis by rules, facial motion images are synthesized on the basis of rules extracted by analysis of training image samples that include all of the phonemes and coarticulation. This system can be utilized as an automatic facial animation synthesizer from text input or as a man-machine interface using the facial motion image. In synthesis by parameters, facial motion images are synthesized on the basis of a code word index of speech parameters. Experimental results indicate good performance for both systems, which can create natural facial-motion images with very low transmission rate. Details of 3-D modeling, algorithm synthesis, and performance are discussed.<>

Keywords

This publication has 3 references indexed in Scilit:

A muscle model for animation three-dimensional facial expression
ACM SIGGRAPH Computer Graphics, 1987
Characteristics of the mouth shape in the production of Japanese - Stroboscopic observation.
Acoustical Science and Technology, 1982
An Algorithm for Vector Quantizer Design
IEEE Transactions on Communications, 1980