Identifying Semantically Equivalent Object Fragments
- 27 July 2005
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 1 (10636919) , 2-9
- https://doi.org/10.1109/cvpr.2005.180
Abstract
We describe a novel technique for identifying semantically equivalent parts in images belonging to the same object class, (e.g. eyes, license plates, aircraft wings etc.). The visual appearance of such object parts can differ substantially, and therefore, traditional image similarity-based methods are inappropriate for this task. The technique we propose is based on the use of common context. We first retrieve context fragments, which consistently appear together with a given input fragment in a stable geometric relation. We then use the context fragments in new images to infer the most likely position of equivalent parts. Given a set of image examples of objects in a class, the method can automatically learn the part structure of the domain - identify the main parts, and how their appearance changes across objects in the class. Two applications of the proposed algorithm are shown: the detection and identification of object parts and object recognition.Keywords
This publication has 9 references indexed in Scilit:
- Distinctive Image Features from Scale-Invariant KeypointsInternational Journal of Computer Vision, 2004
- Scale & Affine Invariant Interest Point DetectorsInternational Journal of Computer Vision, 2004
- Robust wide-baseline stereo from maximally stable extremal regionsImage and Vision Computing, 2004
- Object class recognition by unsupervised scale-invariant learningPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- A performance evaluation of local descriptorsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Interleaved Object Categorization and SegmentationPublished by British Machine Vision Association and Society for Pattern Recognition ,2003
- Contextual Priming for Object DetectionInternational Journal of Computer Vision, 2003
- Visual features of intermediate complexity and their use in classificationNature Neuroscience, 2002
- Wide Baseline Stereo Matching based on Local, Affinely Invariant RegionsPublished by British Machine Vision Association and Society for Pattern Recognition ,2000