Real-Time Computerized Annotation of Pictures
Top Cited Papers
- 2 February 2008
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 30 (6) , 985-1002
- https://doi.org/10.1109/tpami.2007.70847
Abstract
Developing effective methods for automated annotation of digital pictures continues to challenge computer scientists. The capability of annotating pictures by computers can lead to breakthroughs in a wide range of applications, including Web image search, online picture-sharing communities, and scientific experiments. In this work, the authors developed new optimization and estimation techniques to address two fundamental problems in machine learning. These new techniques serve as the basis for the automatic linguistic indexing of pictures - real time (ALIPR) system of fully automatic and high-speed annotation for online pictures. In particular, the D2-clustering method, in the same spirit as K-Means for vectors, is developed to group objects represented by bags of weighted vectors. Moreover, a generalized mixture modeling technique (kernel smoothing as a special case) for nonvector data is developed using the novel concept of hypothetical local mapping (HLM). ALIPR has been tested by thousands of pictures from an Internet photo-sharing site, unrelated to the source of those pictures used in the training process. Its performance has also been studied at an online demonstration site, where arbitrary users provide pictures of their choices and indicate the correctness of each annotation word. The experimental results show that a single computer processor can suggest annotation terms in real time and with good accuracy.Keywords
This publication has 30 references indexed in Scilit:
- Image retrievalACM Computing Surveys, 2008
- Learning theory: Past performance and future resultsNature, 2004
- Automatic linguistic indexing of pictures by a statistical modeling approachPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- An active learning framework for content-based information retrievalIEEE Transactions on Multimedia, 2002
- SIMPLIcity: semantics-sensitive integrated matching for picture librariesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2001
- Content-based image retrieval at the end of the early yearsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2000
- 10.1162/153244303322533214Applied Physics Letters, 2000
- Relevance feedback: a power tool for interactive content-based image retrievalIEEE Transactions on Circuits and Systems for Video Technology, 1998
- Image Representations for Visual LearningScience, 1996
- A Note on Asymptotic Joint NormalityThe Annals of Mathematical Statistics, 1972