Scalable near identical image and shot detection
Top Cited Papers
- 9 July 2007
- proceedings article
- Published by Association for Computing Machinery (ACM)
- p. 549-556
- https://doi.org/10.1145/1282280.1282359
Abstract
This paper proposes and compares two novel schemes for near duplicate image and video-shot detection. The first approach is based on global hierarchical colour histograms, using Locality Sensitive Hashing for fast retrieval. The second approach uses local feature descriptors (SIFT) and for retrieval exploits techniques used in the information retrieval community to compute approximate set intersections between documents using a min-Hash algorithm. The requirements for near-duplicate images vary according to the application, and we address two types of near duplicate definition: (i) being perceptually identical (e.g. up to noise, discretization effects, small photometric distortions etc); and (ii) being images of the same 3D scene (so allowing for viewpoint changes and partial occlusion). We define two shots to be near-duplicates if they share a large percentage of near-duplicate frames. We focus primarily on scalability to very large image and video databases, where fast query processing is necessary. Both methods are designed so that only a small amount of data need be stored for each image. In the case of near-duplicate shot detection it is shown that a weak approximation to histogram matching, consuming substantially less storage, is sufficient for good results. We demonstrate our methods on the TRECVID 2006 data set which contains approximately 165 hours of video (about 17.8M frames with 146K key frames), and also on feature films and pop videos.Keywords
This publication has 15 references indexed in Scilit:
- Finding near-duplicate web pagesPublished by Association for Computing Machinery (ACM) ,2006
- Video Mining with Frequent Itemset ConfigurationsPublished by Springer Nature ,2006
- Video Clip Matching Using MPEG-7 Descriptors and Edit DistancePublished by Springer Nature ,2006
- Automatic identification of digital video based on shot-level sequence matchingPublished by Association for Computing Machinery (ACM) ,2005
- A Comparison of Affine Region DetectorsInternational Journal of Computer Vision, 2005
- Distinctive Image Features from Scale-Invariant KeypointsInternational Journal of Computer Vision, 2004
- Automated location matching in moviesComputer Vision and Image Understanding, 2003
- A performance evaluation of local descriptorsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Fast video matching with signature alignmentPublished by Association for Computing Machinery (ACM) ,2003
- Color invariancePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2001