Scene Classification Using a Hybrid Generative/Discriminative Approach
Top Cited Papers
Open Access
- 4 April 2008
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Pattern Analysis and Machine Intelligence
- Vol. 30 (4) , 712-727
- https://doi.org/10.1109/tpami.2007.70716
Abstract
We investigate whether dimensionality reduction using a latent generative model is beneficial for the task of weakly supervised scene classification. In detail, we are given a set of labeled images of scenes (for example, coast, forest, city, river, etc.), and our objective is to classify a new image into one of these categories. Our approach consists of first discovering latent ";topics"; using probabilistic Latent Semantic Analysis (pLSA), a generative model from the statistical text literature here applied to a bag of visual words representation for each image, and subsequently, training a multiway classifier on the topic distribution vector for each image. We compare this approach to that of representing each image by a bag of visual words vector directly and training a multiway classifier on these vectors. To this end, we introduce a novel vocabulary using dense color SIFT descriptors and then investigate the classification performance under changes in the size of the visual vocabulary, the number of latent topics learned, and the type of discriminative classifier used (k-nearest neighbor or SVM). We achieve superior classification performance to recent publications that have used a bag of visual word representation, in all cases, using the authors' own data sets and testing protocols. We also investigate the gain in adding spatial information. We show applications to image retrieval with relevance feedback and to scene classification in videos.Keywords
This publication has 27 references indexed in Scilit:
- Multiclass Object Recognition with Sparse, Localized FeaturesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object CategoriesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Modeling scenes with local descriptors and latent aspectsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Discovering objects and their location in imagesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Learning object categories from Google's image searchPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Distinctive Image Features from Scale-Invariant KeypointsInternational Journal of Computer Vision, 2004
- Scale & Affine Invariant Interest Point DetectorsInternational Journal of Computer Vision, 2004
- Video Google: a text retrieval approach to object matching in videosPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Indoor-outdoor image classificationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Modeling the Shape of the Scene: A Holistic Representation of the Spatial EnvelopeInternational Journal of Computer Vision, 2001