Small codes and large image databases for recognition

Top Cited Papers

1 June 2008

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

No. 10636919,p. 1-8
https://doi.org/10.1109/cvpr.2008.4587633

Abstract

The Internet contains billions of images, freely available online. Methods for efficiently searching this incredibly rich resource are vital for a large number of applications. These include object recognition, computer graphics, personal photo collections, online image search tools. In this paper, our goal is to develop efficient image search and scene matching techniques that are not only fast, but also require very little memory, enabling their use on standard hardware or even on handheld devices. Our approach uses recently developed machine learning techniques to convert the Gist descriptor (a real valued vector that describes orientation energies at different scales and orientations within an image) to a compact binary code, with a few hundred bits per image. Using our scheme, it is possible to perform real-time searches with millions from the Internet using a single large PC and obtain recognition results comparable to the full descriptor. Using our codes on high quality labeled images from the LabelMe database gives surprisingly powerful recognition results using simple nearest neighbor techniques.

Keywords

This publication has 20 references indexed in Scilit:

Semantic hashing
International Journal of Approximate Reasoning, 2009
LabelMe: A Database and Web-Based Tool for Image Annotation
International Journal of Computer Vision, 2007
Scene completion using millions of photographs
Published by Association for Computing Machinery (ACM) ,2007
Optimizing signal and image processing applications using Intel libraries
Published by SPIE-Intl Soc Optical Eng ,2007
Reducing the Dimensionality of Data with Neural Networks
Science, 2006
Scalable Recognition with a Vocabulary Tree
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2006
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2006
Sub-linear Indexing for Large Scale Object Recognition
Published by British Machine Vision Association and Society for Pattern Recognition ,2005
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision, 2004
Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope
International Journal of Computer Vision, 2001