Abstract
As a 3D extension to the High Efficiency Video Coding (HEVC) standard, 3D-HEVC was developed to improve the coding efficiency of multiview videos. It inherits the prediction modes from HEVC, yet both Motion Estimation (ME) and Disparity Estimation (DE) are required for dependent views coding. This improves coding efficiency at the cost of huge computational costs. In this article, an early Merge mode decision approach is proposed for dependent texture views and dependent depth maps coding in 3D-HEVC based on priori and posterior probability models. First, the priori probability model is established by exploiting the hierarchical and interview correlations from those previously encoded blocks. Second, the posterior probability model is built by using the Coded Block Flag (CBF) of the current coding block. Finally, the joint priori and posterior probability model is adopted to early terminate the Merge mode decision for both dependent texture views and dependent depth maps coding. Experimental results show that the proposed approach saves 45.2% and 30.6% encoding time on average for dependent texture views and dependent depth maps coding while maintaining negligible loss of coding efficiency, respectively.
- G. Bjontegaard. 2001. Calculation of average PSNR differences between RD curves. no. ITU-T SC16/Q6, VCEG-M33, Austin, USA (April 2001).Google Scholar
- H. Chen, C. H. Fu, Y. Zhang, Y. L. Chan, and W. C. Siu. 2017. Early merge mode decision for depth maps in 3D-HEVC. In Proceedings of the 22nd International Conference on Digital Signal Processing (DSP) (Aug. 2017), 1--5.Google Scholar
- X. Ding, Y. Li, M. Xia, J. He, and G. Yang. 2018. Detection of motion compensated frame interpolation via motion-aligned temporal difference. Multimedia Tools and Applications (Aug. 2018).Google Scholar
- Q. Hu, X. Zhang, Z. Shi, and Z. Gao. 2016. Neyman-pearson-based early mode decision for HEVC encoding. IEEE Transactions on Multimedia 18, 3 (March 2016), 379--391. Google Scholar
Digital Library
- S. Jung and H. W. Park. 2016. A fast mode decision method in HEVC using adaptive ordering of modes. IEEE Transactions on Circuits and Systems for Video Technology 26, 10 (Oct. 2016), 1846--1858. Google Scholar
Digital Library
- L. Lei, J. Duan, F. Wu, N. Ling, and C. Hou. 2018. Fast mode decision based on grayscale similarity and inter-view correlation for depth map coding in 3D-HEVC. IEEE Transactions on Circuits and Systems for Video Technology 28, 3 (March 2018), 706--718.Google Scholar
Cross Ref
- Y. Li, G. Yang, N. Chen, Y. Zhu, and X. Ding. 2016. Early DIRECT mode decision for MVC using MB mode homogeneity and RD cost correlation. IEEE Transactions on Broadcasting 62, 3 (May 2016), 700--708.Google Scholar
Cross Ref
- Y. Li, G. Yang, Y. Zhu, X. Ding, and X. Sun. 2017. Adaptive inter CU depth decision for HEVC using optimal selection model and encoding parameters. IEEE Transactions on Broadcasting 63, 3 (Sept. 2017), 535--546.Google Scholar
Cross Ref
- Y. Li, G. Yang, Y. Zhu, X. Ding, and X. Sun. 2017. Unimodal stopping model-based early SKIP mode decision for high efficiency video coding. IEEE Transactions on Multimedia 19, 7 (July 2017), 1431--1441.Google Scholar
Digital Library
- Y. Li, G. Yang, Y. Zhu, C. Liu, and K. Liu. 2016. Adaptive mode decision for multiview video coding based on macroblock position constraint model. Journal of Real-Time Image Processing 12, 3 (Oct. 2016), 575--582. Google Scholar
Digital Library
- K. Müller and A. Vetro. 2011. Common test conditions of 3DV core experiments. document JCT3V-G1100, San Jose, CA, USA (2011).Google Scholar
- Z. Pan, J. Lei, Y. Zhang, and F. Wang. 2018. Adaptive fractional-pixel motion estimation skipped algorithm for efficient HEVC motion estimation. ACM Transactionson Multimedia Computing and Communications Applications 14, 1 (Jan. 2018), Article 12. Google Scholar
Digital Library
- Z. Pan, S. Kwong, M.-T. Sun, and J. Lei. 2014. Early merge mode decision based on motion estimation and hierarchical depth correlation for HEVC. IEEE Transactions on Broadcasting 60, 2 (June 2014), 405--412.Google Scholar
Cross Ref
- Z. Pan, Y. Zhang, and S. Kwong. 2015. Efficient motion and disparity estimation optimization for low complexity multiview video coding. IEEE Transactions on Broadcasting 61, 2 (June 2015), 166--176.Google Scholar
Cross Ref
- Z. Pan, Y. Zhang, J. Lei, L. Xu, and X. Sun. 2016. Early DIRECT mode decision based on all-zero block and rate distortion cost for multiview video coding. IET Image Processing 10, 1 (Jan. 2016), 9--15.Google Scholar
Cross Ref
- L. Shen, P. An, Z. Zhang, Q. Hu, and Z. Chen. 2015. A 3D-HEVC fast mode decision algorithm for real-time applications. ACM Transactions on Multimedia Computing and Communications Applications 11, 3 (Jan. 2015), Article 34. Google Scholar
Digital Library
- L. Shen, Z. Liu, R. Ma, P. An, and Z. Zhang. 2011. Low-complexity mode decision for MVC. IEEE Transactions on Circuits and Systems for Video Technology 21, 6 (June 2011), 837--843. Google Scholar
Digital Library
- L. Shen, Z. Liu, Z. Zhang, S. Liu, and P. An. 2009. Selective disparity estimation and variable size motion estimation based on motion homogeneity for multi-view coding. IEEE Transactions on Broadcasting 55, 4 (Dec. 2009), 761--766.Google Scholar
- L. Shen, Z. Liu, T. Yan, Z. Zhang, and P. An. 2010. Early SKIP mode decision for MVC using inter-view correlation. Signal Processing: Image Communication 25, 2 (Feb. 2010), 88--93. Google Scholar
Digital Library
- L. Shen, Z. Zhang, and Z. Liu. 2014. Adaptive inter-mode decision for HEVC jointly utilizing inter-level and spatiotemporal correlations. IEEE Transactions on Circuits and Systems for Video Technology 24, 10 (Oct. 2014), 1709--1722.Google Scholar
- G. J. Sullivan, J. Ohm, W.-J. Han, and T. Wiegand. 2012. Overview of the high efficiency video coding (HEVC) standard. IEEE Transactions on Circuits and Systems for Video Technology 22, 12 (Dec. 2012), 1649--1668. Google Scholar
Digital Library
- M. Tanimoto, M. P. Tehrani, T. Fujii, and T. Yendo. 2011. Free-viewpoint TV. IEEE Signal Processing Magazine 28, 1 (Jan. 2011), 67--76.Google Scholar
Cross Ref
- J. Tariq, S. Kwong, and H. Yuan. 2017. Spatial/temporal motion consistency based MERGE mode early decision for HEVC. Journal of Visual Communication and Image Representation 44 (April 2017), 198--213. Google Scholar
Digital Library
- G. Tech, Y. Chen, K. Müller, J.-R. Ohm, A. Vetro, and Y. Wang. 2016. Overview of the multiview and 3D extensions of high efficiency video coding. IEEE Transactions on Circuits and Systems for Video Technology 26, 1 (Jan. 2016), 35--49.Google Scholar
Digital Library
- A. Vetro, A. M. Tourapis, K. Muller, and T. Chen. 2011. 3D-TV content storage and transmission. IEEE Transactions on Broadcasting 52, 7 (June 2011), 384--394.Google Scholar
- A. Vetro, T. Wiegand, and G. J. Sullivan. 2011. Overview of the stereo and multi-view video coding extensions of the H. 264/MPEG-4 AVC standard. Proceedings of the IEEE 99, 4 (April 2011), 626--642.Google Scholar
Cross Ref
- F. Wang, H. Zeng, Q. Shen, and S. Du. 2013. Efficient early direct mode decision for multi-view video coding. Signal Processing: Image Communication 28, 7 (Aug. 2013), 736--744. Google Scholar
Digital Library
- T. Wiegand, G. J. Sullivan, G. Bjontegaard, and A. Luthra. 2003. Overview of the H.264/AVC video coding standard. IEEE Transactions on Circuits and Systems for Video Technology 13, 7 (Aug. 2003), 560--576. Google Scholar
Digital Library
- Y. X. Song and K. B. Jia. 2015. Early merge mode decision for texture coding in 3D-HEVC. Journal of Visual Communication and Image Representation 33 (Nov. 2015), 60--68. Google Scholar
Digital Library
- M. Xia, G. Yang, L. Li, R. Li, and X. Sun. 2017. Detecting video frame rate up-conversion based on frame-level analysis of average texture variation. Multimedia Tools and Applications 76, 6 (March 2017), 8399--8421. Google Scholar
Digital Library
- J. Yang, J. Kim, K. Won, H. Lee, and B. Jeon. 2011. Early SKIP detection for HEVC. document JCTVC-G543, JCT-VC, Geneva, Switzerland (2011).Google Scholar
- H. Zeng, X. Wang, J. Chen, C. Cai, and Y. Zhang. 2014. Fast multiview video coding using adaptive prediction structure and hierarchical mode decision. IEEE Transactions on Circuits and Systems for Video Technology 24, 9 (March 2014), 1566--1578.Google Scholar
- D. Zhang, T. Yinand, G. Yang, M. Xia, L. Li, and X. Sun. 2017. Detecting image seam carving with low scaling ratio using multi-scale spatial and spectral entropies. Journal of Visual Communication and Image Representation 48 (Aug. 2017), 281--291. Google Scholar
Digital Library
- J. Zhang, B. Li, and H. Li. 2016. An efficient fast mode decision method for inter prediction in HEVC. IEEE Transactions on Circuits and Systems for Video Technology 28, 6 (Aug. 2016), 1502--1515.Google Scholar
- N. Zhang, D. Zhao, Y.-W. Chen, J.-L. Lin, and W. Gao. 2014. Fast encoder decision for texture coding in 3D-HEVC. Signal Processing: Image Communication 29, 9 (Oct. 2014), 951--961. Google Scholar
Digital Library
- Q. Zhang, K. Huang, X. Wang, B. Jiang, and Y. Gan. 2017. Efficient multiview video plus depth coding for 3D-HEVC based on complexity classification of the treeblock. Journal of Real-Time Image Processing (May 2017), 1--18.Google Scholar
Cross Ref
- Q. Zhang, Q. Wu, X. Wang, and Y. Gan. 2014. Early SKIP mode decision for three-dimensional high efficiency video coding using spatial and interview correlations. Journal of Electronic Imaging 23, 5 (Oct. 2014), 053017--053024.Google Scholar
Cross Ref
- Q. Zhang, N. Zhang, T. Wei, X. Qian, K. Huang, and Y. Gan. 2017. Fast depth map mode decision based on depth-texture correlation and edge classification for 3D-HEVC. Journal of Visual Communication and Image Representation 45 (May 2017), 170--180. Google Scholar
Digital Library
- Y. Zhang, S. Kwong, G. Jiang, X. Wang, and M. Yu. 2012. Statistical early termination model for fast mode decision and reference frame selection in multiview video coding. IEEE Transactions on Broadcasting 58, 1 (Dec. 2012), 10--23.Google Scholar
Cross Ref
- Y. Zhang, S. Kwong, L. Xu, and G. Jiang. 2013. DIRECT mode early decision optimization based on rate distortion cost property and inter-view correlation. IEEE Transactions on Broadcasting 59, 2 (April 2013), 390--398.Google Scholar
Cross Ref
- T. Zhao, S. Kwong, H. Wang, Z. Wang, Z. Pan, and C.-C. J. Kuo. 2013. Multiview coding mode decision with hybrid optimal stopping model. IEEE Transactions on Image Processing 22, 4 (Dec. 2013), 1598--1609. Google Scholar
Digital Library
- W. Zhao, T. Onoye, and T. Song. 2015. Hierarchical structure-based fast mode decision for H.265/HEVC. IEEE Transactions on Circuits and Systems for Video Technology 25, 10 (Oct. 2015), 1651--1664.Google Scholar
Digital Library
Index Terms
Probability Model-Based Early Merge Mode Decision for Dependent Views Coding in 3D-HEVC
Recommendations
Early merge mode decision for texture coding in 3D-HEVC
An early merge mode decision method for texture coding in 3D-HEVC is proposed.Inter-view correlation of coding modes is studied for B frame.Hierarchical depth correlation of coding modes is studied for P frame.Two combinations of conditions to early ...
A 3D-HEVC Fast Mode Decision Algorithm for Real-Time Applications
3D High Efficiency Video Coding (3D-HEVC) is an extension of the HEVC standard for coding of multiview videos and depth maps. It inherits the same quadtree coding structure as HEVC for both components, which allows recursively splitting into four equal-...
Fast intra mode decision for depth coding in 3D-HEVC
The emergent 3D High Efficiency Video Coding (3D-HEVC) is an extension of the High Efficiency Video Coding (HEVC) standard for the compression of the multi-view texture videos plus depth maps format. Since depth maps have different statistical ...






Comments