Abstract
Rate-distortion (R-D) models are functions that describe the relationship between the bitrate and expected level of distortion in the reconstructed video stream. R-D models enable optimization of the received video quality in different network conditions. Several R-D models have been proposed for the increasingly popular fine-grained scalable video sequences. However, the models' relative performance has not been thoroughly analyzed. Moreover, the time complexity of each model is not known, nor is the range of bitrates in which the model produces valid results. This lack of quantitative performance analysis makes it difficult to select the model that best suits a target streaming system. In this article, we classify, analyze, and rigorously evaluate all R-D models proposed for FGS coders in the literature. We classify R-D models into three categories: analytic, empirical, and semi-analytic. We describe the characteristics of each category. We analyze the R-D models by following their mathematical derivations, scrutinizing the assumptions made, and explaining when the assumptions fail and why. In addition, we implement all R-D models, a total of eight, and evaluate them using a diverse set of video sequences. In our evaluation, we consider various source characteristics, diverse channel conditions, different encoding/decoding parameters, different frame types, and several performance metrics including accuracy, range of applicability, and time complexity of each model. We also present clear systematic ways (pseudo codes) for constructing various R-D models from a given video sequence. Based on our experimental results, we present a justified list of recommendations on selecting the best R-D models for video-on-demand, video conferencing, real-time, and peer-to-peer streaming systems.
- Adjeroh, D. and Lee, M. 2004. Scene-adaptive transform domain video partitioning. IEEE Trans. Multimedia 6, 1, 58--69. Google Scholar
Digital Library
- Center for Image Processing Research. 2006. http://www.cipr.rpi.edu/resource.Google Scholar
- Chiang, T. and Zhang, Y. 1997. A new rate control scheme using quadratic rate distortion model. IEEE Trans. Circ. Syst. Video Techn. 7, 1, 246--250. Google Scholar
Digital Library
- Dai, M. 2004. Rate-distortion analysis and traffic modeling of scalable video coders. Ph.D. thesis, Department of Electrical Engineering, Texas A&M University. Google Scholar
Digital Library
- Dai, M. and Loguinov, D. 2003. Analysis of rate-distortion functions and congestion control in scalable Internet video streaming. In Proceedings of ACM International Workshop on Network and Operating Systems Support for Digital Audio and Video (NOSSDAV'03). Monterey, CA. Google Scholar
Digital Library
- Dai, M., Loguinov, D., and Radha, H. 2003. Statistical analysis and distortion modeling of MPEG-4 FGS. In Proceedings of IEEE International Conference on Image Processing (ICIP'03). Barcelona, Spain.Google Scholar
- Dai, M., Loguinov, D., and Radha, H. 2004. Rate-distortion modeling of scalable video coders. In Proceedings of IEEE International Conference on Image Processing (ICIP'04). Singapore.Google Scholar
- Dai, M., Loguinov, D., and Radha, H. 2006. Rate-distortion analysis and quality control in scalable Internet streaming. IEEE Trans. Multimedia 8, 6, 1135--1146. Google Scholar
Digital Library
- Ding, W. and Liu, B. 1996. Rate control of MPEG video coding and recording by rate-quantization modeling. IEEE Trans. Circ. Syst. Video Techn. 6, 1, 12--20. Google Scholar
Digital Library
- Hang, H. and Chen, J. 1997. Source model for transform video coder and its application I: Fundamental theory. IEEE Trans. Circ. Syst. Video Techn. 7, 2, 287--298. Google Scholar
Digital Library
- He, Z., Kim, Y., and Mitra, S. 2001. Low-delay rate control for DCT video coding via ρ-domain source modeling. IEEE Trans. Circ. Syst. Video Techn. 11, 8, 928--940. Google Scholar
Digital Library
- He, Z. and Mitra, S. 2001. A unified rate-distortion analysis framework for transform coding. IEEE Trans. Circ. Syst. Video Techn. 11, 12, 1221--1236. Google Scholar
Digital Library
- He, Z. and Mitra, S. 2002. A linear source model and a unified rate control algorithm for DCT video coding. IEEE Trans. Circ. Syst. Video Techn. 12, 11, 970--982. Google Scholar
Digital Library
- Hsu, C. and Hefeeda, M. 2006a. On the accuracy and complexity of rate-distortion models for fine-grained scalable video sequences. Tech. rep. TR 2006-12, Simon Fraser University. http://nsl.cs.surrey.sfu.ca/projects/fgs/.Google Scholar
- Hsu, C. and Hefeeda, M. 2006b. Source models for fine-grained scalable video sequences. Tech. rep., Simon Fraser University.Google Scholar
- ISO/IEC 14496-2. 2004. Coding of audio-visual objects—part 2: Visual.Google Scholar
- ISO/IEC 14496-5. 2004. MPEG-4 Visual reference software.Google Scholar
- Joshi, R. and Fischer, T. 1995. Comparison of generalized Gaussian and Laplacian modeling in DCT image coding. IEEE Signal Proces. Lett. 2, 5, 81--82.Google Scholar
Cross Ref
- Li, W. 2001. Overview of fine-granularity scalability in MPEG-4 video standard. IEEE Trans. Circ. Syst. Video Techn. 11, 3, 301--317. Google Scholar
Digital Library
- Lin, L. and Ortega, A. 1998. Bit-rate control using piecewise approximated rate-distortion characteristics. IEEE Trans. Circ. Syst. Video Techn. 8, 4, 446--459. Google Scholar
Digital Library
- Mallet, S. and Falzon, F. 1998. Analysis of low bit rate image transform coding. IEEE Trans. Signal Process. 46, 4, 1027--1042. Google Scholar
Digital Library
- Martinez, W. and Martinez, A. 2002. Computational Statistics Handbook with Matlab, 1st ed. Chapman and Hall, Upper Saddle River, NJ. Google Scholar
Digital Library
- Muller, F. 1993. Distribution shape of two-dimensional DCT coefficients of natural images. IEEE Electron. Lett. 29, 22, 1935--1936.Google Scholar
Cross Ref
- Park, H. J. and Lee, T. W. 2004. Modeling nonlinear dependencies in natural images using mixture of Laplacian distribution. In Proceedings of Advances in Neural Information Processing Systems (NIPS'04). Vancouver, Canada.Google Scholar
- Radha, H., Schaar, M., and Chen, Y. 2001. The MPEG-4 fine-grained scalable video coding method for multimedia streaming over IP. IEEE Trans. Multimedia 3, 1, 53--68. Google Scholar
Digital Library
- Sullivan, G. and Wiegand, T. 1998. Rate-distortion optimization for video compression. IEEE Signal Process. Magazine 15, 6, 74--90.Google Scholar
Cross Ref
- Sun, J., Gao, W., Zhao, D., and Huang, Q. 2005. Statistical model, analysis and approximation of rate-distortion function in MPEG-4 FGS videos. In Proceedings of SPIE International Conference on Visual Communication and Image Processing (VCIP'05). Beijing, China.Google Scholar
- Varanasi, M. and Aazhang, B. 1989. Parametric generalized Gaussian density estimation. J. Acoust. Soc. Amer. 86, 4, 1404--1415.Google Scholar
Cross Ref
- Video Traces Research Group. 2006. http://trace.eas.asu.edu/yuv.Google Scholar
- Zhang, X., Vetro, A., Shi, Y., and Sun, H. 2003. Constant quality constrained rate allocation for FGS-coded video. IEEE Trans. Circ. Syst. Video Techn. 13, 2, 121--130. Google Scholar
Digital Library
Index Terms
On the accuracy and complexity of rate-distortion models for fine-grained scalable video sequences
Recommendations
On rate-distortion modeling and extraction of H.264/SVC fine-granular scalable video
Fine-granular scalable (FGS) technologies in H.264/ AVC-based scalable video coding (SVC) provide a flexible foundation to accommodate different network capacities. To support efficient quality extraction, it is important to obtain the rate-distortion (...
Comparative Rate-Distortion-Complexity Analysis of HEVC and AVC Video Codecs
This paper analyzes the rate-distortion-complexity of High Efficiency Video Coding (HEVC) reference video codec (HM) and compares the results with AVC reference codec (JM). The examined software codecs are HM 6.0 using Main Profile (MP) and JM 18.0 ...
Low-Complexity Rate-Distortion Optimization Algorithms for HEVC Intra Prediction
MMM 2014: Proceedings of the 20th Anniversary International Conference on MultiMedia Modeling - Volume 8325HEVC achieves a better coding efficiency relative to prior standards, but also involves dramatically increased complexity. The complexity increase for intra prediction is especially intensive due to a highly flexible quad-tree coding structure and a ...






Comments