Minimizing Binding Errors Using Learned Conjunctive Features
- 1 February 2000
- journal article
- Published by MIT Press in Neural Computation
- Vol. 12 (2) , 247-278
- https://doi.org/10.1162/089976600300015772
Abstract
We have studied some of the design trade-offs governing visual representations based on spatially invariant conjunctive feature detectors, with an emphasis on the susceptibility of such systems to false-positive recognition errors—Malsburg's classical binding problem. We begin by deriving an analytical model that makes explicit how recognition performance is affected by the number of objects that must be distinguished, the number of features included in the representation, the complexity of individual objects, and the clutter load, that is, the amount of visual material in the field of view in which multiple objects must be simultaneously recognized, independent of pose, and without explicit segmentation. Using the domain of text to model object recognition in cluttered scenes, we show that with corrections for the nonuniform probability and nonindependence of text features, the analytical model achieves good fits to measured recognition rates in simulations involving a wide range of clutter loads, word sizes, and feature counts. We then introduce a greedy algorithm for feature learning, derived from the analytical model, which grows a representation by choosing those conjunctive features that are most likely to distinguish objects from the cluttered backgrounds in which they are embedded. We show that the representations produced by this algorithm are compact, decorrelated, and heavily weighted toward features of low conjunctive order. Our results provide a more quantitative basis for understanding when spatially invariant conjunctive features can support unambiguous perception in multiobject scenes, and lead to several insights regarding the properties of visual representations optimized for specific recognition tasks.Keywords
This publication has 27 references indexed in Scilit:
- INVARIANT FACE AND OBJECT RECOGNITION IN THE VISUAL SYSTEMProgress in Neurobiology, 1997
- Speed of processing in the human visual systemNature, 1996
- Inferotemporal Cortex and Object VisionAnnual Review of Neuroscience, 1996
- Modeling visual recognition from neurobiological constraintsNeural Networks, 1994
- Multidimensional indexing for recognizing visual shapesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1994
- Distortion invariant object recognition in the dynamic link architectureIEEE Transactions on Computers, 1993
- Recognising Faces: Effects of Lighting Direction, Inversion, and Brightness ReversalPerception, 1992
- An interactive activation model of context effects in letter perception: I. An account of basic findings.Psychological Review, 1981
- The Ferrier Lecture, 1977 The neuron network of the cerebral cortex: a functional interpretationProceedings of the Royal Society of London. B. Biological Sciences, 1978
- Looking at upside-down faces.Journal of Experimental Psychology, 1969