Scalable near identical image and shot detection

Top Cited Papers

9 July 2007

proceedings article
Published by Association for Computing Machinery (ACM)

p. 549-556
https://doi.org/10.1145/1282280.1282359

Abstract

This paper proposes and compares two novel schemes for near duplicate image and video-shot detection. The first approach is based on global hierarchical colour histograms, using Locality Sensitive Hashing for fast retrieval. The second approach uses local feature descriptors (SIFT) and for retrieval exploits techniques used in the information retrieval community to compute approximate set intersections between documents using a min-Hash algorithm. The requirements for near-duplicate images vary according to the application, and we address two types of near duplicate definition: (i) being perceptually identical (e.g. up to noise, discretization effects, small photometric distortions etc); and (ii) being images of the same 3D scene (so allowing for viewpoint changes and partial occlusion). We define two shots to be near-duplicates if they share a large percentage of near-duplicate frames. We focus primarily on scalability to very large image and video databases, where fast query processing is necessary. Both methods are designed so that only a small amount of data need be stored for each image. In the case of near-duplicate shot detection it is shown that a weak approximation to histogram matching, consuming substantially less storage, is sufficient for good results. We demonstrate our methods on the TRECVID 2006 data set which contains approximately 165 hours of video (about 17.8M frames with 146K key frames), and also on feature films and pop videos.

Keywords

This publication has 15 references indexed in Scilit:

Finding near-duplicate web pages
Published by Association for Computing Machinery (ACM) ,2006
Video Mining with Frequent Itemset Configurations
Published by Springer Nature ,2006
Video Clip Matching Using MPEG-7 Descriptors and Edit Distance
Published by Springer Nature ,2006
Automatic identification of digital video based on shot-level sequence matching
Published by Association for Computing Machinery (ACM) ,2005
A Comparison of Affine Region Detectors
International Journal of Computer Vision, 2005
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision, 2004
Automated location matching in movies
Computer Vision and Image Understanding, 2003
A performance evaluation of local descriptors
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Fast video matching with signature alignment
Published by Association for Computing Machinery (ACM) ,2003
Color invariance
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2001