Decomposition, discovery and detection of visual categories using topic models
- 1 June 2008
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- No. 10636919,p. 1-8
- https://doi.org/10.1109/cvpr.2008.4587803
Abstract
We present a novel method for the discovery and detection of visual object categories based on decompositions using topic models. The approach is capable of learning a compact and low dimensional representation for multiple visual categories from multiple view points without labeling of the training instances. The learnt object components range from local structures over line segments to global silhouette-like descriptions. This representation can be used to discover object categories in a totally unsupervised fashion. Furthermore we employ the representation as the basis for building a supervised multi-category detection system making efficient use of training examples and out-performing pure features-based representations. The proposed speed-ups make the system scale to large databases. Experiments on three databases show that the approach improves the state-of-the-art in unsupervised learning as well as supervised detection. In particular we improve the state-of-the-art on the challenging PASCALpsila06 multi-class detection tasks for several categories.Keywords
This publication has 19 references indexed in Scilit:
- Multiclass Object Recognition with Sparse, Localized FeaturesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- Multiple Object Class Detection with a Generative ModelPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- Improvements of Object Detection Using Boosted HistogramsPublished by British Machine Vision Association and Society for Pattern Recognition ,2006
- Latent Mixture Vocabularies for Object CategorizationPublished by British Machine Vision Association and Society for Pattern Recognition ,2006
- Modeling scenes with local descriptors and latent aspectsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Pedestrian Detection in Crowded ScenesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Distinctive Image Features from Scale-Invariant KeypointsInternational Journal of Computer Vision, 2004
- Finding scientific topicsProceedings of the National Academy of Sciences, 2004
- Training support vector machines: an application to face detectionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Unsupervised Learning by Probabilistic Latent Semantic AnalysisMachine Learning, 2001