Nonlinear manifold learning for visual speech recognition
- 19 November 2002
- proceedings article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 494-499
- https://doi.org/10.1109/iccv.1995.466899
Abstract
A technique for representing and learning smooth nonlinear manifolds is presented and applied to several lip reading tasks. Given a set of points drawn from a smooth manifold in an abstract feature space, the technique is capable of determining the structure of the surface and of finding the closest manifold point to a given query point. We use this technique to learn the "space of lips" in a visual speech recognition task. The learned manifold is used for tracking and extracting the lips, for interpolating between frames in an image sequence and for providing features for recognition. We describe a system based on hidden Markov models and this learned lip manifold that significantly improves the performance of acoustic speech recognizers in degraded environments. We also present preliminary results on a purely visual lip reader.Keywords
This publication has 9 references indexed in Scilit:
- A model problem in the representation of digital image sequencesPattern Recognition, 1993
- RASTA-PLP speech analysis techniquePublished by Institute of Electrical and Electronics Engineers (IEEE) ,1992
- Deformable Templates for Face RecognitionJournal of Cognitive Neuroscience, 1991
- Networks for approximation and learningProceedings of the IEEE, 1990
- Integration of acoustic and visual speech signals using neural networksIEEE Communications Magazine, 1989
- Lip Reading: Automatic Visual Recognition of Spoken WordsPublished by Optica Publishing Group ,1989
- An improved automatic lipreading system to enhance speech recognitionPublished by Association for Computing Machinery (ACM) ,1988
- Snakes: Active contour modelsInternational Journal of Computer Vision, 1988
- Evaluation and integration of visual and auditory information in speech perception.Journal of Experimental Psychology: Human Perception and Performance, 1983