Large-scale image classification: Fast feature extraction and SVM training

Top Cited Papers

1 June 2011

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

No. 10636919,p. 1689-1696
https://doi.org/10.1109/cvpr.2011.5995477

Abstract

Most research efforts on image classification so far have been focused on medium-scale datasets, which are often defined as datasets that can fit into the memory of a desktop (typically 4G~48G). There are two main reasons for the limited effort on large-scale image classification. First, until the emergence of ImageNet dataset, there was almost no publicly available large-scale benchmark data for image classification. This is mostly because class labels are expensive to obtain. Second, large-scale classification is hard because it poses more challenges than its medium-scale counterparts. A key challenge is how to achieve efficiency in both feature extraction and classifier training without compromising performance. This paper is to show how we address this challenge using ImageNet dataset as an example. For feature extraction, we develop a Hadoop scheme that performs feature extraction in parallel using hundreds of mappers. This allows us to extract fairly sophisticated features (with dimensions being hundreds of thousands) on 1.2 million images within one day. For SVM training, we develop a parallel averaging stochastic gradient descent (ASGD) algorithm for training one-against-all 1000-class SVM classifiers. The ASGD algorithm is capable of dealing with terabytes of training data and converges very fast-typically 5 epochs are sufficient. As a result, we achieve state-of-the-art performance on the ImageNet 1000-class classification, i.e., 52.9% in classification accuracy and 71.8% in top 5 hit rate.

Keywords

This publication has 15 references indexed in Scilit:

Large linear classification when data cannot fit in memory
Published by Association for Computing Machinery (ACM) ,2010
An HOG-LBP human detector with partial occlusion handling
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
LabelMe: A Database and Web-Based Tool for Image Annotation
International Journal of Computer Vision, 2007
Pegasos
Published by Association for Computing Machinery (ACM) ,2007
Training linear SVMs in linear time
Published by Association for Computing Machinery (ACM) ,2006
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2006
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision, 2004
A comparative study of texture measures with classification based on featured distributions
Published by Elsevier ,2001
Gradient-based learning applied to document recognition
Proceedings of the IEEE, 1998
Acceleration of Stochastic Approximation by Averaging
SIAM Journal on Control and Optimization, 1992