Object retrieval with large vocabularies and fast spatial matching
Top Cited Papers
- 1 June 2007
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- No. 10636919,p. 1-8
- https://doi.org/10.1109/cvpr.2007.383172
Abstract
In this paper, we present a large-scale object retrieval system. The user supplies a query object by selecting a region of a query image, and the system returns a ranked list of images that contain the same object, retrieved from a large corpus. We demonstrate the scalability and performance of our system on a dataset of over 1 million images crawled from the photo-sharing site, Flickr [3], using Oxford landmarks as queries. Building an image-feature vocabulary is a major time and performance bottleneck, due to the size of our dataset. To address this problem we compare different scalable methods for building a vocabulary and introduce a novel quantization method based on randomized trees which we show outperforms the current state-of-the-art on an extensive ground-truth. Our experiments show that the quantization has a major effect on retrieval quality. To further improve query performance, we add an efficient spatial verification stage to re-rank the results returned from our bag-of-words model and show that this consistently improves search quality, though by less of a margin when the visual vocabulary is large. We view this work as a promising step towards much larger, "web-scale " image corpora.Keywords
This publication has 13 references indexed in Scilit:
- Scalable Recognition with a Vocabulary TreePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- Multiple Object Class Detection with a Generative ModelPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- Randomized Trees for Real-Time Keypoint RecognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Sub-linear Indexing for Large Scale Object RecognitionPublished by British Machine Vision Association and Society for Pattern Recognition ,2005
- Distinctive Image Features from Scale-Invariant KeypointsInternational Journal of Computer Vision, 2004
- Scale & Affine Invariant Interest Point DetectorsInternational Journal of Computer Vision, 2004
- Multiple View Geometry in Computer VisionPublished by Cambridge University Press (CUP) ,2004
- Video Google: a text retrieval approach to object matching in videosPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Vector Quantization and Signal CompressionPublished by Springer Nature ,1992
- Random sample consensusCommunications of the ACM, 1981