A coherent computational approach to model bottom-up visual attention
Top Cited Papers
- 20 March 2006
- journal article
- clinical trial
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 28 (5) , 802-817
- https://doi.org/10.1109/tpami.2006.86
Abstract
Visual attention is a mechanism which filters out redundant visual information and detects the most relevant parts of our visual field. Automatic determination of the most visually relevant areas would be useful in many applications such as image and video coding, watermarking, video browsing, and quality assessment. Many research groups are currently investigating computational modeling of the visual attention system. The first published computational models have been based on some basic and well-understood human visual system (HVS) properties. These models feature a single perceptual layer that simulates only one aspect of the visual system. More recent models integrate complex features of the HVS and simulate hierarchical perceptual representation of the visual input. The bottom-up mechanism is the most occurring feature found in modern models. This mechanism refers to involuntary attention (i.e., salient spatial visual features that effortlessly or involuntary attract our attention). This paper presents a coherent computational approach to the modeling of the bottom-up visual attention. This model is mainly based on the current understanding of the HVS behavior. Contrast sensitivity functions, perceptual decomposition, visual masking, and center-surround interactions are some of the features implemented in this model. The performances of this algorithm are assessed by using natural images and experimental measurements from an eye-tracking system. Two adequate well-known metrics (correlation coefficient and Kullbacl-Leibler divergence) are used to validate this model. A further metric is also defined. The results from this model are finally compared to those from a reference bottom-up model.Keywords
This publication has 36 references indexed in Scilit:
- A computational model of recurrent, colinear long-range interaction in V1 for contour enhancement and junction detectionJournal of Vision, 2002
- Algorithms for defining visual regions-of-interest: comparison with eye fixationsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2000
- Natural scene statistics at the centre of gazeNetwork: Computation in Neural Systems, 1999
- A model of saliency-based visual attention for rapid scene analysisPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1998
- Attentional capture by abrupt onsets: New perceptual objects or visual masking?Journal of Experimental Psychology: Human Perception and Performance, 1996
- Modeling visual attention via selective tuningArtificial Intelligence, 1995
- Visual motion and attentional capturePerception & Psychophysics, 1994
- The cortex transform: Rapid computation of simulated neural imagesComputer Vision, Graphics, and Image Processing, 1987
- The Laplacian Pyramid as a Compact Image CodeIEEE Transactions on Communications, 1983
- A feature-integration theory of attentionCognitive Psychology, 1980