Abstract
The increase in the number of video copies, both legal and illegal, has become a major problem in the multimedia and Internet era. In this article, we propose a novel method for detecting various video copies in a video sequence. To achieve fast and robust detection, the method fully integrates several components, namely the min-hashing signature to compactly represent a video sequence, a spatio-temporal matching scheme to accurately evaluate video similarity compiled from the spatial and temporal aspects, and some speedup techniques to expedite both min-hashing indexing and spatio-temporal matching. The results of experiments demonstrate that, compared to several baseline methods with different feature descriptors and matching schemes, the proposed method which combines both global and local feature descriptors yields the best performance when encountering a variety of video transformations. The method is very fast, requiring approximately 0.06 seconds to search for copies of a thirty-second video clip in a six-hour video sequence.
- Andoni, A. and Indyk, P. 2008. Near-optimal hashing algorithms for approximate nearest neighbor in high dimension. Comm. ACM 51, 1, 117--122. Google Scholar
Digital Library
- Buhler, J. 2001. Efficient large-scale sequence comparison by locality-sensitive hashing. Bioinform. 17, 5, 419--218.Google Scholar
Cross Ref
- Chang, S. F., Chen, W., Meng, H. J., Sundaram, H., and Zhong, D. 1998. A fully automated content-based video search engine supporting spatiotemporal queries. IEEE Trans. Circuits Syst. Video Technol. 8, 5, 602--615. Google Scholar
Digital Library
- Cheung, S. C. and Zakhor, A. 2003. Efficient video similarity measurement with video signature. IEEE Trans. Circuits Syst. Video Technol. 13, 1, 59--74. Google Scholar
Digital Library
- Chiu, C. Y., Yang, C. C., and Chen, C. S. 2007. Efficient and effective video copy detection based on spatiotemporal analysis. In Proceedings of the IEEE International Symposium on Multimedia (ISM). 10--12, 202--209. Google Scholar
Digital Library
- Chiu, C. Y., Chen, C. S., and Chien, L. F. 2008. A framework for handling spatiotemporal variations in video copy detection. IEEE Trans. Circuits Syst. Video Technol. 18, 3, 412--417. Google Scholar
Digital Library
- Cohen, E., Datar, M., Fujiwara, S., Gionis, A., Indyk, P., Motwani, R., Ullman, J., and Yang, C. 2000. Finding interesting associations without support pruning. In Proceedings of the IEEE International Conference on Data Engineering (ICDE). 489--500. Google Scholar
Digital Library
- Das, A., Datar, M., and Garg, A. 2007. Google news personalization: scalable online collaborative filtering. In Proceedings of International World Wide Web Conference (WWW). Google Scholar
Digital Library
- Dementhon, D. and Doermann, D. 2006. Video retrieval of near-duplicates using k-nearest neighbor retrieval of spatio-temporal descriptors. Multimedia Tools Appl. 30, 3, 229--253. Google Scholar
Digital Library
- Deng, Y. and Manjunath, B. S. 1998. NeTra-V: toward an object-based video representation. IEEE Trans. Circuits Syst. Video Technol. 8, 5, 616--627. Google Scholar
Digital Library
- Ennesser, F. and Medioni, G. 1995. Finding Waldo, or focus of attention using local color information. IEEE Trans. Patt. Anal. Mach. Intell. 17, 8, 805--809. Google Scholar
Digital Library
- Hampapur, A. and Bolle, R. M. 2001. Comparison of distance measures for video copy detection. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME). 737--740.Google Scholar
- Hoad, T. C. and Zobel, J. 2006. Detection of video sequence using compact signatures. ACM Trans. Inform. Syst. 24, 1, 1--50. Google Scholar
Digital Library
- Hua, X. S., Chen, X., and Zhang, H. J. 2004. Robust video signature based on ordinal measure. In Proceedings of the IEEE International Conference on Image Processing (ICIP). Volume 1, 685--688.Google Scholar
- Jain, A. K., Vailaya, A., and Xiong, W. 1999. Query by video clip. Multimedia Syst. 7, 5, 369--384. Google Scholar
Digital Library
- Joly, A., Buisson, O., and Frelicot, C. 2007. Content-based copy retrieval using distortion-based probabilistic similarity search. IEEE Trans. Multimedia 9, 2, 293--306. Google Scholar
Digital Library
- Kashino, K., Kurozumi, T., and Murase, H. 2003. A quick search method for audio and video signals based on histogram pruning. IEEE Trans. Multimedia 5, 3, 348--357. Google Scholar
Digital Library
- Ke, Y., Sukthankar, R., and Huston, L. 2004. An efficient parts-based near-duplicate and sub-image retrieval system. In Proceedings of the ACM International Conference on Multimedia (MM). 869--876. Google Scholar
Digital Library
- Kim, C. and Vasudev, B. 2005. Spatiotemporal sequence matching for efficient video copy detection. IEEE Trans. Circuits Syst. Video Technol. 15, 1, 127--132. Google Scholar
Digital Library
- Kim, H. S., Lee, J., Liu, H., and Lee, D. 2008. Video linkage: group based copied video detection. In Proceedings of the ACM International Conference on Content-based Image and Video Retrieval (CIVR). 397--406. Google Scholar
Digital Library
- Law-To, J., Buisson, O., Gouet-Brunet, V., and Boujemaa, N. 2006. Robust voting algorithm based on labels of behavior for video copy detection. In Proceedings of the ACM International Conference on Multimedia (MM). 835--844. Google Scholar
Digital Library
- Law-To, J., Chen, L., Joly, A., Laptev, I., Buission, O., Gouet-Brunet, V., Boujemaa, N., and Stentiford, F. 2007. Video copy detection: a comparative study. In Proceedings of the ACM International Conference on Image and Video Retrieval (CIVR). 371--378. Google Scholar
Digital Library
- Liu, T., Zhang, H. J., and Qi, F. 2003. A novel video key-frame extraction algorithm based on perceived motion energy model. IEEE Trans. Circuits Syst. Video Technol. 13, 10, 1006--1013. Google Scholar
Digital Library
- Lowe, D. G. Distinctive image features from scale-invariant keypoints. 2004. Inter. J. Comput. Vision 60, 2, 91--110. Google Scholar
Digital Library
- Massoudi, A., Lefebvre, F., Demarty, C. H., Oisel, L., and Chupeau, B. 2006. A video fingerprint based on visual digest and local fingerprints. In Proceedings of the IEEE International Conference on Image Processing (ICIP). 2297--2300.Google Scholar
- Naphade, M. R. and Huang, T. S. 2001. A probabilistic framework for semantic video indexing, filtering, and retrieval. IEEE Trans. Multimedia 3, 1, 141--151. Google Scholar
Digital Library
- Poullot, S., Crucianu, M., and Buisson, O. 2008. Scalable mining of large video databases using copy detection. In Proceedings of the ACM International Conference on Multimedia (MM). 61--70. Google Scholar
Digital Library
- Sayood, K. 1996. Introduction to Data Compression, Morgan Kaufmann, Los Altos, CA. Google Scholar
Digital Library
- Schmid, C. and Mohr, R. 1997. Local grayvalue invariants for image retrieval. IEEE Trans. Patt. Anal. Mach. Intell. 19, 5, 530--535. Google Scholar
Digital Library
- Shen, H. T., Ooi, B. C., Zhou, X., and Huang, Z. 2005. Towards effective indexing for very large video sequence database. In Proceedings of the ACM International Conference on Management of Data (SIGMOD). 730--741. Google Scholar
Digital Library
- Smoliar, S. W. and Zhang, H. 1994. Content-based video indexing and retrieval. IEEE Multimedia 1, 2, 62--72. Google Scholar
Digital Library
- Sonka, M., Hlavac, V., and Boyle, R. 1999. Image Processing, Analysis, and Machine Vision, Brooks/Cole Publishing, Pacific Grove, CA. Google Scholar
Digital Library
- Swain, M. J. and Ballard, D. H. 1991. Color indexing. Int. J. Comput. Vision 7. 1, 11--32. Google Scholar
Digital Library
- Willems, G., Tuytelaars, T., and Gool, L. V. 2008. Spatio-temporal features for robust content-based video copy detection. In Proceedings of the ACM International Conference on Multimedia Information Retrieval. 283--290. Google Scholar
Digital Library
- Wu, X., Hauptmann, A. G., and Ngo, C. W. 2007. Practical elimination of near-duplicates from Web video search. In Proceedings of the ACM International Conference on Multimedia (MM). 218--227. Google Scholar
Digital Library
- Yuan, J., Duan, L. Y., Tian, Q., and Xu, C. 2004. Fast and robust search short video clip search using an index structure. In Proceedings of the ACM International Workshop on Multimedia Information Retrieval (MIR). 61--68. Google Scholar
Digital Library
Index Terms
Fast min-hashing indexing and robust spatio-temporal matching for detecting video copies
Recommendations
Weber Binarized Statistical Image Features (WBSIF) based video copy detection
Display Omitted A novel Video Copy Detection (VCD) is proposed.The VCD is based on the Weber Binarized Statistical Image Features descriptor (WBSIF).The matching process is performed by using of the 2 test.Results outline the robustness and ...
A spatio-temporal pyramid matching for video retrieval
Highlights We introduce a content-based video retrieval system for a query video shot. The shot boundaries are found using a classifier learnt from a boosting algorithm. The similarity of video shots is calculated by spatio-temporal pyramid matching. ...
Accurate content-based video copy detection with efficient feature indexing
ICMR '11: Proceedings of the 1st ACM International Conference on Multimedia RetrievalWe describe an accurate content-based copy detection system that uses both local and global visual features to ensure robustness. Our system advances state-of-the-art techniques in four key directions. (1) Multiple-codebook-based product quantization: ...






Comments