Abstract
Three-dimensional video (3DV) is attracting many interests with its enhanced viewing experience and more user driven features. 3DV has several unique characteristics different from 2D video: (1) It has a much larger amount of data captured and compressed, and corresponding video compression techniques can be much more complicated in order to explore data redundancy. This will lead to more constraints on users' network access and computational capability, (2) Most users only need part of the 3DV data at any given time, while the users' requirements exhibit large diversity, (3) Only a limited number of views are captured and transmitted for 3DV. View rendering is thus necessary to generate virtual views based on the received 3DV data. However, many terminal devices do not have the functionality to generate virtual views. To enable 3DV experience for the majority of users with limited capabilities, adaptive 3DV transmission is necessary to extract/generate the required data content and represent it with supported formats and bitrates for heterogeneous terminal devices. 3DV transcoding is an emerging and effective technique to achieve desired adaptive 3DV transmission. In this article, we propose the first efficient 3DV transcoding scheme that can obtain any desired view, either an encoded one or a virtual one, and compress it with more universal H.264/AVC. The key idea of the proposed scheme is to appropriately utilize motion information contained in the bitstream to generate candidate motion information. Original information of both the desired view and reference views are used to obtain this candidate information and a proper motion refinement process is carried out for certain blocks. Simulation results show that, compared to the straightforward cascade algorithm, the proposed scheme is able to output compressed bitstream of the required view with significantly reduced complexity while incurring negligible performance loss. Such a 3DV transcoding can be applied to most gateways that usually have constraints on computational complexity and time delay.
- Ahmand, I., Wei, X., Sun, Y., and Zhang, Y.-Q. 2005. Video transcoding: An Overview of various techniques and research issues. IEEE Trans. Multimed. 7, 5. Google Scholar
Digital Library
- Bai, B., Boulanger, P., and Harms, J. 2005. A multiview video transcoder. In Proceedings of the ACM Multimedia. 503--506. Google Scholar
Digital Library
- Cha, J., Eid, M., and Saddik, A. 2009. Touchable 3D video system. ACM Trans. Multimed. Comput. Commun. Appl. 5, 4. Google Scholar
Digital Library
- Cheung, G., Ortega, A., and Cheung, N.-M. 2011. Interactive streaming of stored multiview video using redundant frame structures. IEEE Trans. Image Process. 20, 3. Google Scholar
Digital Library
- Fehn, C. 2004. Depth-image-based-rendering (DIBR), compression and transmission for a new approach on 3D-TV. In SPIE Stereoscopic Displays and Virtual Reality Systems.Google Scholar
- Florêncio, D. and Zhang, C. 2009. Multiview video compression and streaming based on predicted viewer position. In Proceedings of ICASSP '09. Google Scholar
Digital Library
- JVT-AA209. 2008. Joint Draft 7.0 on multiview video coding.Google Scholar
- JVT-T207. 2006. Common Test Conditions for Multiview Video Coding. Klagenfurt, Austria.Google Scholar
- Kurutepe, E., Civanlar, M. R., and Tekalp, A. M. 2007. Client-driven selective streaming of multiview video for interactive 3DTV. IEEE Trans. Circuite Syst. Video. Technol. 17. Google Scholar
Digital Library
- Levoy, M. and Hanrahan, P. 1996. Light field rendering, In Proceedings of SIGGRAPH'96, ACM, pp. 31--42. Google Scholar
Digital Library
- Liu, S. and Chen, C. W. 2009. Multiview video transcoding: From multiple views to single view. In Proceedings of the Picture Coding Symposium (PCS'09). Google Scholar
Digital Library
- Liu, S. and Chen, C. W. 2010. 3D video transcoding for virtual views. In Proceedings of ACM Multimedia. Google Scholar
Digital Library
- Liu, Y., Huang, Q., Ma, S., Zhao, D., and Gao, W. 2009. Joint video/depth rate allocation for 3D video coding based on view synthesis distortion model. Signal Process. Image Commun. 24, 8. Google Scholar
Digital Library
- Liu, S., Lai, P., Tian, D., and Chen, C. W. 2011. New depth coding techniques with utilization of corresponding video. IEEE Trans. Broadcas. 57, 2.Google Scholar
Cross Ref
- Lou, J.-G., Cai, H., and Li, J. 2007. Interactive multiview video delivery based on IP multicast. In Advances in Multimedia. Google Scholar
Digital Library
- Maitre, M. and Do, M. N. 2008. Joint encoding of the depth image based representation using shape-adaptive wavelets. In Proceedings of ICIP. 1768--1771Google Scholar
- Microsoft 3D Video Test Sequences (available online: http://research.microsoft.com/en-us/um/people/sbkang/3dvideodownload/)Google Scholar
- MPEG Video and Requirements Subgroup. 2009. Applications and requirements on 3D video coding. Document w11061. MPEG.Google Scholar
- Müller, K., Smolic, A., Dix, K., Merkle, P., Kauff, P., and Wiegand, T. 2008. Reliability-based generation and view synthesis in layered depth video. In Proceedings of the IEEE International Workshop on Multimedia Signal Processing.Google Scholar
- Oh, H. and Ho, Y.-S. 2006. H.264-based depth map coding using motion information of correspondingtexture video. Adv. Image Video Tech. 4319. Google Scholar
Digital Library
- Pan, Z., Ikuta, Y., Bandai, M., and Watanabe, T. 2011. User dependent scheme for multi-view video transmission. In Proceedings of ICC.Google Scholar
- Shum, H.-Y., Chan, S.-C., and Kang, S. B. 2006. Image-Based Rendering. Springer. Google Scholar
Digital Library
- Smolic, A., Müller, K., Dix, K., Merkle, P., Kauff, P., and Wiegand, T. 2008. Intermediate View Interpolation Based on Multiview Video Plus Depth for Advanced 3D Video Systems. In Proceedings of the IEEE International Conference on Image Processing.Google Scholar
- Smolic, A., Müller, K., Merkele, P., Kauff, P., and Wiegand, T. 2009. An overview of available and emerging 3D video formats and depth enhanced stereo as efficient generic solution. In Proceedings of the Picture Coding Symposium. Google Scholar
Digital Library
- Tang, X.-L., Dai, S.-K., and Cai, C.-H. 2010. An analysis of TZSearch algorithm. In Proceedings of ICGCS.Google Scholar
- Tanimoto, M., Fuji, T., and Suzuki, K. 2009. View synthesis algorithm in view synthesis reference software 3.0 (VSRS3.0), Tech. rep. Document M16090, ISO/IEC JTC1/SC29/WG11.Google Scholar
- Vetro, A., Matusik, W., Pfister, H., and Xin, J. 2004. Coding approaches for end-to-end 3D TV systems. Proceedings of the Picture Coding Symposium.Google Scholar
- Yang, Z., Wu, W., Nahrstedt, K., Kurillo G., and Bajscy, R. 2010. Enabling Multi-Party 3D Tele-Immersive Environments with ViewCast. ACM Trans. Multimedia Comput. Commun. Appl. 6, 2. Google Scholar
Cross Ref
- Yang, Y., Yu, M., Jiang, G., and Peng, Z. 2007. A transmission and interaction oriented free-viewpoint video system. Int. J. Circ. Syst. Signal Process. 4, 1.Google Scholar
- Zhang, C. and Florêncio, D. 2010. Joint tracking and multiview video compression. In Proceedings of VCIP.Google Scholar
- Zitnick, L., Kang, S. B., Uyttendaele, M., Winder, S., and Szeliski, R. 2004. High-quality video view interpolation using a layered representation. ACM Trans. Graph. 23, 3, 600--608. Google Scholar
Digital Library
Index Terms
A novel 3D video transcoding scheme for adaptive 3D video transmission to heterogeneous terminals
Recommendations
3D video transcoding for virtual views
MM '10: Proceedings of the 18th ACM international conference on MultimediaRecent emerging development of three dimensional video (3DV) has been vigorously driving the Multiview Video Coding (MVC) standard developed by Joint Video Team as an amendment to H.264/AVC and the new 3DV standard developed by MPEG. It is expected that ...
Multiview video transcoding: from multiple views to single view
PCS'09: Proceedings of the 27th conference on Picture Coding SymposiumAs multiview video is gaining more and more attentions, Multiview Video Coding (MVC) standard has been under development by the Joint Video Team as an extension to H.264/AVC. There will be increasingly more multiview video sources for both high end and ...
Spatial transcoding from scalable video coding to H.264/AVC
ICME'09: Proceedings of the 2009 IEEE international conference on Multimedia and ExpoScalable Video Coding (SVC) is backwards compatible to H.264/AVC in the sense that the base layer sub-bitstream is decodable by an H.264/AVC decoder. However, there are applications wherein it is desirable for an H.264/AVC decoder to obtain a higher ...






Comments