Learning to recognise talking faces
- 1 January 1996
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 4 (10514651) , 55-59 vol.4
- https://doi.org/10.1109/icpr.1996.547233
Abstract
An approach for person identification is described based on spatio-temporal analysis of the talking face. A person is represented by a parametric model of the visible speech articulators and their temporal characteristics during speech production. The model consists of shape parameters, representing the lip contour and intensity parameters representing the grey level distribution in the mouth region. The model is used to track lips in image sequences where the model parameters are recovered from the tracking results. While some of these parameters relate to speech information, others are intuitively related to different persons and we show that models based on these features enable successful person identification. We model the shape and intensity parameters as mixtures of Gaussians and their temporal dependencies by hidden Markov models. Identifying a talking person is performed by estimating the likelihood of each model for having generated the observed sequence of features and the model with the highest likelihood is chosen as the identified person.Keywords
This publication has 16 references indexed in Scilit:
- Visual speech recognition using active shape models and hidden Markov modelsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1996
- Locating and tracking facial speech featuresPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1996
- Person identification using multiple cuesIEEE Transactions on Pattern Analysis and Machine Intelligence, 1995
- Human and machine recognition of faces: a surveyProceedings of the IEEE, 1995
- Connectionist models of face processing: A surveyPattern Recognition, 1994
- Use of active shape models for locating structures in medical imagesImage and Vision Computing, 1994
- Automatic recognition and analysis of human faces and facial expressions: a surveyPattern Recognition, 1992
- Evaluating the articulation index for auditory–visual inputThe Journal of the Acoustical Society of America, 1991
- Identification and ratings of caricatures: Implications for mental representations of facesCognitive Psychology, 1987
- A Maximum Likelihood Approach to Continuous Speech RecognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1983