A Discriminative Kernel-Based Approach to Rank Images from Text Queries
Top Cited Papers
- 20 June 2008
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 30 (8) , 1371-1384
- https://doi.org/10.1109/tpami.2007.70791
Abstract
This paper introduces a discriminative model for the retrieval of images from text queries. Our approach formalizes the retrieval task as a ranking problem, and introduces a learning procedure optimizing a criterion related to the ranking performance. The proposed model hence addresses the retrieval problem directly and does not rely on an intermediate image annotation task, which contrasts with previous research. Moreover, our learning procedure builds upon recent work on the online learning of kernel-based classifiers. This yields an efficient, scalable algorithm, which can benefit from recent kernels developed for image comparison. The experiments performed over stock photography data show the advantage of our discriminative ranking approach over state-of-the-art alternatives (e.g. our model yields 26.3% average precision over the Corel dataset, which should be compared to 22.0%, for the best alternative model evaluated). Further analysis of the results shows that our model is especially advantageous over difficult queries such as queries with few relevant pictures or multiple-word queries.Keywords
This publication has 30 references indexed in Scilit:
- Evaluation campaigns and TRECVidPublished by Association for Computing Machinery (ACM) ,2006
- The pyramid match kernel: discriminative classification with sets of image featuresPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Multiple Bernoulli relevance models for image and video annotationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- Links between perceptrons, MLPs and SVMsPublished by Association for Computing Machinery (ACM) ,2004
- Boosting Image RetrievalInternational Journal of Computer Vision, 2004
- Automatic image annotation and retrieval using cross-media relevance modelsPublished by Association for Computing Machinery (ACM) ,2003
- On image classification: city vs. landscapePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Indoor-outdoor image classificationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Multiresolution gray-scale and rotation invariant texture classification with local binary patternsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Unsupervised Learning by Probabilistic Latent Semantic AnalysisMachine Learning, 2001