Using Segmentation to Verify Object Hypotheses

1 June 2007

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Abstract

We present an approach for object recognition that combines detection and segmentation within a efficient hypothesize/test framework. Scanning-window template classifiers are the current state-of-the-art for many object classes such as faces, cars, and pedestrians. Such approaches, though quite successful, can be hindered by their lack of explicit encoding of object shape/structure - one might, for example, find faces in trees. We adopt the following strategy; we first use these systems as attention mechanisms, generating many possible object locations by tuning them for low missed-detections and high false-positives. At each hypothesized detection, we compute a local figure-ground segmentation using a window of slightly larger extent than that used by the classifier. This segmentation task is guided by top-down knowledge. We learn offline from training data those segmentations that are consistent with true positives. We then prune away those hypotheses with bad segmentations. We show this strategy leads to significant improvements (10-20%) over established approaches such as ViolaJones and DalalTriggs on a variety of benchmark datasets including the PASCAL challenge, LabelMe, and the INRIAPerson dataset.

Keywords

This publication has 11 references indexed in Scilit:

OBJCUT: Efficient Segmentation Using Top-Down and Bottom-Up Cues
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
LabelMe: A Database and Web-Based Tool for Image Annotation
International Journal of Computer Vision, 2007
OBJCUT for Face Detection
Published by Springer Nature ,2006
Rapid object detection using a boosted cascade of simple features
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Histograms of Oriented Gradients for Human Detection
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Learning-Based Computer Vision with Intel’s Open Source Computer Vision Library
Intel Technology Journal, 2005
LOCUS: learning object classes with unsupervised segmentation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Interleaved Object Categorization and Segmentation
Published by British Machine Vision Association and Society for Pattern Recognition ,2003
Fast approximate energy minimization via graph cuts
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1999
On the verification of hypothesized matches in model-based recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1991