TagProp: Discriminative metric learning in nearest neighbor models for image auto-annotation
Top Cited Papers
- 1 September 2009
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- No. 15505499,p. 309-316
- https://doi.org/10.1109/iccv.2009.5459266
Abstract
Image auto-annotation is an important open problem in computer vision. For this task we propose TagProp, a discriminatively trained nearest neighbor model. Tags of test images are predicted using a weighted nearest-neighbor model to exploit labeled training images. Neighbor weights are based on neighbor rank or distance. TagProp allows the integration of metric learning by directly maximizing the log-likelihood of the tag predictions in the training set. In this manner, we can optimally combine a collection of image similarity metrics that cover different aspects of image content, such as local shape descriptors, or global color histograms. We also introduce a word specific sigmoidal modulation of the weighted neighbor tag predictions to boost the recall of rare words. We investigate the performance of different variants of our model and compare to existing work. We present experimental results for three challenging data sets. On all three, TagProp makes a marked improvement as compared to the current state-of-the-art.Keywords
This publication has 14 references indexed in Scilit:
- Is that you? Metric learning approaches for face identificationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- Image annotation via graph learningPattern Recognition, 2009
- A Discriminative Kernel-Based Approach to Rank Images from Text QueriesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2008
- Real-Time Computerized Annotation of PicturesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2008
- Supervised Learning of Semantic Classes for Image Annotation and RetrievalIEEE Transactions on Pattern Analysis and Machine Intelligence, 2007
- SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category RecognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene CategoriesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- Learning distance functions for image retrievalPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- Multiple Bernoulli relevance models for image and video annotationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- Automatic image annotation and retrieval using cross-media relevance modelsPublished by Association for Computing Machinery (ACM) ,2003