Abstract
This article presents a novel content-based copy detection system for 3D videos. The system creates compact and robust depth and visual signatures from the 3D videos. Then, signature of a query video is compared against an indexed database of reference videos' signatures. The system returns a score, using both spatial and temporal characteristics of videos, indicating whether the query video matches any video in the reference video database, and in case of matching, which portion of the reference video matches the query video. Analysis shows that the system is efficient, both computationally and storage-wise. The system can be used, for example, by video content owners, video hosting sites, and third-party companies to find illegally copied 3D videos. We implemented Spider, a complete realization of the proposed system, and conducted rigorous experiments on it. Our experimental results show that the proposed system can achieve high accuracy in terms of precision and recall even if the 3D videos are subjected to several transformations at the same time. For example, the proposed system yields 100% precision and recall when copied videos are parts of original videos, and more than 90% precision and recall when copied videos are subjected to different individual transformations.
Supplemental Material
Available for Download
Supplemental movie, appendix, image and software files for, Spider: A system for finding 3D video copies
- Bino Free 3D Player. 2013. http://bino3d.org/Google Scholar
- FFmpeg. 2013. http://www.ffmpeg.org/Google Scholar
- Jopensurf. 2013. http://code.google.com/p/jopensurfGoogle Scholar
- Microsoft Research. 2013. MSR 3D video. http://research.microsoft.com/en-us/um/people/sbkang/3dvideodownload/Google Scholar
- Mobile 3DTV. 2013. http://sp.cs.tut.fi/mobile3dtv/video-plus-depth/Google Scholar
- Traixes DepthGate. 2013. http://doc.triaxes.tv/depthgate.Google Scholar
- ISO. 2008. ISO/IEC JTC1/SC29/WG11. Reference softwares for depth estimation and view synthesis. Doc. M15377.Google Scholar
- FLANN. 2011. FLANN - Fast library for approximate nearest neighbors. http://www.cs.ubc.ca/~mariusm/index.php/FLANN.Google Scholar
- Aly, M., Munich, M., and Perona, P. 2011a. Distributed kd-trees for retrieval from very large image collections. In Proceedings of the British Machine Vision Conference (BMVC).Google Scholar
- Aly, M., Munich, M., and Perona, P. 2011b. Indexing in large scale image collections: Scaling properties and benchmark. In Proceedings of the IEEE Workshop on Applications of Computer Vision (WACV'11). 418--425. Google Scholar
Digital Library
- Chen, W.-Y., Chang, Y.-L., Lin, S.-F., Ding, L.-F., and Chen, L.-G. 2005. Efficient depth image based rendering with edge dependent depth filter and interpolation. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME'05). 1314--1317.Google Scholar
- Datar, M., Immorlica, N., Indyk, P., and Mirrokni, V. S. 2004. Locality-Sensitive hashing scheme based on p-stable distributions. In proceedings of the 20th Annual Symposium on Computational geometry (SCG'04).253--262. Google Scholar
Digital Library
- Friedman, J. H., Bentley, J. L., and Finkel, R. A. 1977. An algorithm for finding best matches in logarithmic expected time. ACM Trans. Math. Softw. 3, 3, 209--226. Google Scholar
Digital Library
- Gong, Y. and Liu, X. 2000. Video summarization using singular value decomposition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'00). 174--180.Google Scholar
- Hampapur, A., Hyun, K., and Bolle, R. M. 2002. Comparison of sequence matching techniques for video copy detection. In Proceedings of the SPIE Conference on Storage and Retrieval for Media Databases (SPIE'02). 194--201.Google Scholar
- Harvey, R. C. and Hefeeda, M. 2012. Spatio-temporal video copy detection. In proceedings of the ACM Multimedia Systems conference (MMSys'12). Chapel Hill, NC, USA. Google Scholar
Digital Library
- Kahng, A. B., Lach, J., Mangione-Smith, W. H., Mantik, S., Markov, I. L., Potkonjak, M., Tucker, P., Wang, H., and Wolfe, G. 1998. Watermarking techniques for intellectual property protection. In Proceedings of the 35th Annual Design Automation Conference (DAC'98). 776--781. Google Scholar
Digital Library
- Kauff, P., Atzpadin, N., Fehn, C., Mller, M., Schreer, O., Smolic, A., and Tanger, R. 2007. Depth map creation and image-based rendering for advanced 3DTV services providing interoperability and scalability. Signal Process. Image Comm. 22, 2, 217--234. Google Scholar
Digital Library
- Khodabakhshi, N. and Hefeeda, M. 2012. Copy detection of 3d videos. In Proceedings of the ACM Multimedia Systems Conference (MMSys'12). Chapel Hill, NC, USA. Google Scholar
Digital Library
- Koz, A., Cigla, C., and Alatan, A. 2010. Watermarking of free-view video. IEEE Trans. Image Process. 19, 7, 1785--1797. Google Scholar
Digital Library
- Li, Z. and Chen, J. 2010. Efficient compressed domain video copy detection. In Proceedings of the International Conference on Management and Service Science (MASS'10). 1--4.Google Scholar
- Liu, Z., Gibbon, D., Zavesky, E., Shahraray, B., and Haffner, P. 2007. A fast, comprehensive shot boundary determination system. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME'07). 1487--1490.Google Scholar
- Liu, Z., Liu, T., Gibbon, D. C., and Shahraray, B. 2010. Effective and scalable video copy detection. In Proceedings of the International Conference on Multimedia Information Retrieval (MIR'10). 119--128. Google Scholar
Digital Library
- Lowe, D. G. 2004. Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60, 2, 91--110. Google Scholar
Digital Library
- Mahanti, A., Eager, D., Vernon, M., and Sundaram-Stukel, D. 2008. Speeded-up robust features (SURF). Comput. Vis. Image Understand. 110, 3, 346--359. Google Scholar
Digital Library
- Merkle, P., Muller, K., and Wiegand, T. 2010. 3D video: Acquisition, coding, and display. In Proceedings of the International Conference on Consumer Electronics Digest of Technical Papers (ICCE'10). 127--128.Google Scholar
- Ramachandra, V., Zwicker, M., and Nguyen, T. 2008. 3D video fingerprinting. In Proceedings of the 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV'08). 81--84.Google Scholar
- Roth, G., Laganie andre, R., Lambert, P., Lakhmiri, I., and Janati, T. 2010. A simple but effective approach to video copy detection. In Proceedings of the Canadian Conference on Computer and Robot Vision (CRV'10). 63--70. Google Scholar
Digital Library
- Shum, H. Y., Li, Y., and Kang, S. B. 2004. An introduction to image-based rendering. In Integrated Image and Graphics Technologies, D. Zhang, M. Kamel, and G. Baciu, Eds., The Kluwer International Series in Engineering and Computer Science, vol. 762, Springer. Google Scholar
Digital Library
- Silpa-Anan, C. and Hartley, R. 2008. Optimised kd-trees for fast image descriptor matching. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08). USA, 1--8.Google Scholar
- Smolic, A., Müller, K., Stefanoski, N., Ostermann, J., Gotchev, A., Akar, G. B., Triantafyllidis, G., and Koz, A. 2007. Coding algorithms for 3DTV-a survey. IEEE Tran. Circ. and Syst. Video Technol. 17, 11, 1606--1621. Google Scholar
Digital Library
- Szeliski, R.2013. Stereo correspondence. In Computer Vision, D. Gries and F. B. Schneider, Eds., Texts in Computer Science., Springer, 467--503.Google Scholar
- Tasdemir, K. and Cetin, A. 2010. Motion vector based features for content based video copy detection. In Proceedings of the International Conference on Pattern Recognition (ICPR'10). 3134--3137. Google Scholar
Digital Library
- Wang, L., Liao, M., Gong, M., Yang, R., and Nister, D. 2006. High-quality real-time stereo using adaptive cost aggregation and dynamic programming. In Proceedings of the 3rd International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06). 798--805. Google Scholar
Digital Library
- Zhang, Z. and Zou, J. 2010. copy detection based on edge analysis. In Proceedings of the IEEE International Conference on Information and Automation (ICIA'10). 2497--2501.Google Scholar
- Zitnick, C. L., Kang, S. B., Uyttendaele, M., Winder, S., and Szeliski, R. 2004. High-quality video view interpolation using a layered representation. ACM Trans. Graph. 23, 3, 600--608. Google Scholar
Digital Library
Index Terms
Spider: A system for finding 3D video copies
Recommendations
Copy detection of 3D videos
MMSys '12: Proceedings of the 3rd Multimedia Systems ConferenceWe present a novel system to detect copies of 3D videos. The system creates signatures from the depth signals of 3D videos. It also extracts visual features from video frames and creates compact spatial signatures for videos. The system then uses the ...
Spatio-temporal video copy detection
MMSys '12: Proceedings of the 3rd Multimedia Systems ConferenceVideo copy detection algorithms are used to find copies of original video content even if the content has been altered. Given the prevalence of video recording and copying devices as well as the availability of many Internet sites for hosting videos, ...
Video copy detection: sequence matching using hypothesis test
AST/UCMA/ISA/ACN'10: Proceedings of the 2010 international conference on Advances in computer science and information technologyvideo copy detection is intended for verifying whether a video sequence is copied from another or not. Such techniques can be used for protecting the copyright. A content-based video detection system extracts signature of the video from its visual ...






Comments