Decomposition, discovery and detection of visual categories using topic models

1 June 2008

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

No. 10636919,p. 1-8
https://doi.org/10.1109/cvpr.2008.4587803

Abstract

We present a novel method for the discovery and detection of visual object categories based on decompositions using topic models. The approach is capable of learning a compact and low dimensional representation for multiple visual categories from multiple view points without labeling of the training instances. The learnt object components range from local structures over line segments to global silhouette-like descriptions. This representation can be used to discover object categories in a totally unsupervised fashion. Furthermore we employ the representation as the basis for building a supervised multi-category detection system making efficient use of training examples and out-performing pure features-based representations. The proposed speed-ups make the system scale to large databases. Experiments on three databases show that the approach improves the state-of-the-art in unsupervised learning as well as supervised detection. In particular we improve the state-of-the-art on the challenging PASCALpsila06 multi-class detection tasks for several categories.

Keywords

This publication has 19 references indexed in Scilit:

Multiclass Object Recognition with Sparse, Localized Features
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2006
Multiple Object Class Detection with a Generative Model
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2006
Improvements of Object Detection Using Boosted Histograms
Published by British Machine Vision Association and Society for Pattern Recognition ,2006
Latent Mixture Vocabularies for Object Categorization
Published by British Machine Vision Association and Society for Pattern Recognition ,2006
Modeling scenes with local descriptors and latent aspects
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Pedestrian Detection in Crowded Scenes
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision, 2004
Finding scientific topics
Proceedings of the National Academy of Sciences, 2004
Training support vector machines: an application to face detection
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Unsupervised Learning by Probabilistic Latent Semantic Analysis
Machine Learning, 2001