Dynamic appearance-based recognition
- 22 November 2002
- proceedings article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 540-546
- https://doi.org/10.1109/cvpr.1997.609378
Abstract
We describe a hierarchical appearance-based method for learning, recognizing, and predicting arbitrary spatiotemporal sequences of images. The method, which implements a robust hierarchical form of the Kalman filter derived from the Minimum Description Length (MDL) principle, includes as a special case several well-known object encoding techniques including eigenspace methods for static recognition. Successive levels of the hierarchical filter implement dynamic models operating over successively larger spatial and temporal scales. Each hierarchical level predicts the recognition state at a lower level and modifies its own recognition state using the residual error between the prediction and the actual lower-level state. Simultaneously, on a longer time scale, the filter learns an internal model of input dynamics by adapting its generative and state transition matrices at each level to minimize prediction errors. The resulting prediction/learning scheme thereby implements an on-line form of the well-known Expectation-Maximization (EM) algorithm from statistics. We present experimental results demonstrating the method's efficacy in mediating robust spatiotemporal recognition in a variety of scenarios containing varying degrees of occlusions and clutter.Keywords
This publication has 14 references indexed in Scilit:
- Dynamic Model of Visual Recognition Predicts Neural Response Properties in the Visual CortexNeural Computation, 1997
- Emergence of simple-cell receptive field properties by learning a sparse code for natural imagesNature, 1996
- Dealing with occlusions in the eigenspace approachPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1996
- An active vision architecture based on iconic representationsArtificial Intelligence, 1995
- Visual learning and recognition of 3-d objects from appearanceInternational Journal of Computer Vision, 1995
- What Is the Goal of Sensory Coding?Neural Computation, 1994
- Multiscale recursive estimation, data fusion, and regularizationIEEE Transactions on Automatic Control, 1994
- Eigenfaces for RecognitionJournal of Cognitive Neuroscience, 1991
- Robust StatisticsPublished by Wiley ,1981
- A Mathematical Theory of CommunicationBell System Technical Journal, 1948