Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval
Top Cited Papers
- 1 January 2007
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- No. 15505499,p. 1-8
- https://doi.org/10.1109/iccv.2007.4408891
Abstract
Given a query image of an object, our objective is to retrieve all instances of that object in a large (1M+) image database. We adopt the bag-of-visual-words architecture which has proven successful in achieving high precision at low recall. Unfortunately, feature detection and quantization are noisy processes and this can result in variation in the particular visual words that appear in different images of the same object, leading to missed results. In the text retrieval literature a standard method for improving performance is query expansion. A number of the highly ranked documents from the original query are reissued as a new query. In this way, additional relevant terms can be added to the query. This is a form of blind rele- vance feedback and it can fail if 'outlier' (false positive) documents are included in the reissued query. In this paper we bring query expansion into the visual domain via two novel contributions. Firstly, strong spatial constraints between the query image and each result allow us to accurately verify each return, suppressing the false positives which typically ruin text-based query expansion. Secondly, the verified images can be used to learn a latent feature model to enable the controlled construction of expanded queries. We illustrate these ideas on the 5000 annotated image Oxford building database together with more than 1M Flickr images. We show that the precision is substantially boosted, achieving total recall in many cases.Keywords
This publication has 10 references indexed in Scilit:
- Object retrieval with large vocabularies and fast spatial matchingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- Scalable Recognition with a Vocabulary TreePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- Photo tourismPublished by Association for Computing Machinery (ACM) ,2006
- Local feature view clustering for 3D object recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Distinctive Image Features from Scale-Invariant KeypointsInternational Journal of Computer Vision, 2004
- Scale & Affine Invariant Interest Point DetectorsInternational Journal of Computer Vision, 2004
- Multiple View Geometry in Computer VisionPublished by Cambridge University Press (CUP) ,2004
- Video Google: a text retrieval approach to object matching in videosPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Combining greyvalue invariants with local constraints for object recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1996
- Improving retrieval performance by relevance feedbackJournal of the American Society for Information Science, 1990