Using Segmentation to Verify Object Hypotheses
- 1 June 2007
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
We present an approach for object recognition that combines detection and segmentation within a efficient hypothesize/test framework. Scanning-window template classifiers are the current state-of-the-art for many object classes such as faces, cars, and pedestrians. Such approaches, though quite successful, can be hindered by their lack of explicit encoding of object shape/structure - one might, for example, find faces in trees. We adopt the following strategy; we first use these systems as attention mechanisms, generating many possible object locations by tuning them for low missed-detections and high false-positives. At each hypothesized detection, we compute a local figure-ground segmentation using a window of slightly larger extent than that used by the classifier. This segmentation task is guided by top-down knowledge. We learn offline from training data those segmentations that are consistent with true positives. We then prune away those hypotheses with bad segmentations. We show this strategy leads to significant improvements (10-20%) over established approaches such as ViolaJones and DalalTriggs on a variety of benchmark datasets including the PASCAL challenge, LabelMe, and the INRIAPerson dataset.Keywords
This publication has 11 references indexed in Scilit:
- OBJCUT: Efficient Segmentation Using Top-Down and Bottom-Up CuesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- LabelMe: A Database and Web-Based Tool for Image AnnotationInternational Journal of Computer Vision, 2007
- OBJCUT for Face DetectionPublished by Springer Nature ,2006
- Rapid object detection using a boosted cascade of simple featuresPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Histograms of Oriented Gradients for Human DetectionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Learning-Based Computer Vision with Intel’s Open Source Computer Vision LibraryIntel Technology Journal, 2005
- LOCUS: learning object classes with unsupervised segmentationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Interleaved Object Categorization and SegmentationPublished by British Machine Vision Association and Society for Pattern Recognition ,2003
- Fast approximate energy minimization via graph cutsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1999
- On the verification of hypothesized matches in model-based recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991