skip to main content
research-article

On the accuracy and complexity of rate-distortion models for fine-grained scalable video sequences

Published:16 May 2008Publication History
Skip Abstract Section

Abstract

Rate-distortion (R-D) models are functions that describe the relationship between the bitrate and expected level of distortion in the reconstructed video stream. R-D models enable optimization of the received video quality in different network conditions. Several R-D models have been proposed for the increasingly popular fine-grained scalable video sequences. However, the models' relative performance has not been thoroughly analyzed. Moreover, the time complexity of each model is not known, nor is the range of bitrates in which the model produces valid results. This lack of quantitative performance analysis makes it difficult to select the model that best suits a target streaming system. In this article, we classify, analyze, and rigorously evaluate all R-D models proposed for FGS coders in the literature. We classify R-D models into three categories: analytic, empirical, and semi-analytic. We describe the characteristics of each category. We analyze the R-D models by following their mathematical derivations, scrutinizing the assumptions made, and explaining when the assumptions fail and why. In addition, we implement all R-D models, a total of eight, and evaluate them using a diverse set of video sequences. In our evaluation, we consider various source characteristics, diverse channel conditions, different encoding/decoding parameters, different frame types, and several performance metrics including accuracy, range of applicability, and time complexity of each model. We also present clear systematic ways (pseudo codes) for constructing various R-D models from a given video sequence. Based on our experimental results, we present a justified list of recommendations on selecting the best R-D models for video-on-demand, video conferencing, real-time, and peer-to-peer streaming systems.

References

  1. Adjeroh, D. and Lee, M. 2004. Scene-adaptive transform domain video partitioning. IEEE Trans. Multimedia 6, 1, 58--69. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Center for Image Processing Research. 2006. http://www.cipr.rpi.edu/resource.Google ScholarGoogle Scholar
  3. Chiang, T. and Zhang, Y. 1997. A new rate control scheme using quadratic rate distortion model. IEEE Trans. Circ. Syst. Video Techn. 7, 1, 246--250. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Dai, M. 2004. Rate-distortion analysis and traffic modeling of scalable video coders. Ph.D. thesis, Department of Electrical Engineering, Texas A&M University. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Dai, M. and Loguinov, D. 2003. Analysis of rate-distortion functions and congestion control in scalable Internet video streaming. In Proceedings of ACM International Workshop on Network and Operating Systems Support for Digital Audio and Video (NOSSDAV'03). Monterey, CA. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Dai, M., Loguinov, D., and Radha, H. 2003. Statistical analysis and distortion modeling of MPEG-4 FGS. In Proceedings of IEEE International Conference on Image Processing (ICIP'03). Barcelona, Spain.Google ScholarGoogle Scholar
  7. Dai, M., Loguinov, D., and Radha, H. 2004. Rate-distortion modeling of scalable video coders. In Proceedings of IEEE International Conference on Image Processing (ICIP'04). Singapore.Google ScholarGoogle Scholar
  8. Dai, M., Loguinov, D., and Radha, H. 2006. Rate-distortion analysis and quality control in scalable Internet streaming. IEEE Trans. Multimedia 8, 6, 1135--1146. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Ding, W. and Liu, B. 1996. Rate control of MPEG video coding and recording by rate-quantization modeling. IEEE Trans. Circ. Syst. Video Techn. 6, 1, 12--20. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Hang, H. and Chen, J. 1997. Source model for transform video coder and its application I: Fundamental theory. IEEE Trans. Circ. Syst. Video Techn. 7, 2, 287--298. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. He, Z., Kim, Y., and Mitra, S. 2001. Low-delay rate control for DCT video coding via ρ-domain source modeling. IEEE Trans. Circ. Syst. Video Techn. 11, 8, 928--940. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. He, Z. and Mitra, S. 2001. A unified rate-distortion analysis framework for transform coding. IEEE Trans. Circ. Syst. Video Techn. 11, 12, 1221--1236. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. He, Z. and Mitra, S. 2002. A linear source model and a unified rate control algorithm for DCT video coding. IEEE Trans. Circ. Syst. Video Techn. 12, 11, 970--982. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Hsu, C. and Hefeeda, M. 2006a. On the accuracy and complexity of rate-distortion models for fine-grained scalable video sequences. Tech. rep. TR 2006-12, Simon Fraser University. http://nsl.cs.surrey.sfu.ca/projects/fgs/.Google ScholarGoogle Scholar
  15. Hsu, C. and Hefeeda, M. 2006b. Source models for fine-grained scalable video sequences. Tech. rep., Simon Fraser University.Google ScholarGoogle Scholar
  16. ISO/IEC 14496-2. 2004. Coding of audio-visual objects—part 2: Visual.Google ScholarGoogle Scholar
  17. ISO/IEC 14496-5. 2004. MPEG-4 Visual reference software.Google ScholarGoogle Scholar
  18. Joshi, R. and Fischer, T. 1995. Comparison of generalized Gaussian and Laplacian modeling in DCT image coding. IEEE Signal Proces. Lett. 2, 5, 81--82.Google ScholarGoogle ScholarCross RefCross Ref
  19. Li, W. 2001. Overview of fine-granularity scalability in MPEG-4 video standard. IEEE Trans. Circ. Syst. Video Techn. 11, 3, 301--317. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Lin, L. and Ortega, A. 1998. Bit-rate control using piecewise approximated rate-distortion characteristics. IEEE Trans. Circ. Syst. Video Techn. 8, 4, 446--459. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Mallet, S. and Falzon, F. 1998. Analysis of low bit rate image transform coding. IEEE Trans. Signal Process. 46, 4, 1027--1042. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Martinez, W. and Martinez, A. 2002. Computational Statistics Handbook with Matlab, 1st ed. Chapman and Hall, Upper Saddle River, NJ. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Muller, F. 1993. Distribution shape of two-dimensional DCT coefficients of natural images. IEEE Electron. Lett. 29, 22, 1935--1936.Google ScholarGoogle ScholarCross RefCross Ref
  24. Park, H. J. and Lee, T. W. 2004. Modeling nonlinear dependencies in natural images using mixture of Laplacian distribution. In Proceedings of Advances in Neural Information Processing Systems (NIPS'04). Vancouver, Canada.Google ScholarGoogle Scholar
  25. Radha, H., Schaar, M., and Chen, Y. 2001. The MPEG-4 fine-grained scalable video coding method for multimedia streaming over IP. IEEE Trans. Multimedia 3, 1, 53--68. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Sullivan, G. and Wiegand, T. 1998. Rate-distortion optimization for video compression. IEEE Signal Process. Magazine 15, 6, 74--90.Google ScholarGoogle ScholarCross RefCross Ref
  27. Sun, J., Gao, W., Zhao, D., and Huang, Q. 2005. Statistical model, analysis and approximation of rate-distortion function in MPEG-4 FGS videos. In Proceedings of SPIE International Conference on Visual Communication and Image Processing (VCIP'05). Beijing, China.Google ScholarGoogle Scholar
  28. Varanasi, M. and Aazhang, B. 1989. Parametric generalized Gaussian density estimation. J. Acoust. Soc. Amer. 86, 4, 1404--1415.Google ScholarGoogle ScholarCross RefCross Ref
  29. Video Traces Research Group. 2006. http://trace.eas.asu.edu/yuv.Google ScholarGoogle Scholar
  30. Zhang, X., Vetro, A., Shi, Y., and Sun, H. 2003. Constant quality constrained rate allocation for FGS-coded video. IEEE Trans. Circ. Syst. Video Techn. 13, 2, 121--130. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. On the accuracy and complexity of rate-distortion models for fine-grained scalable video sequences

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Multimedia Computing, Communications, and Applications
        ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 4, Issue 2
        May 2008
        197 pages
        ISSN:1551-6857
        EISSN:1551-6865
        DOI:10.1145/1352012
        Issue’s Table of Contents

        Copyright © 2008 ACM

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 16 May 2008
        • Accepted: 1 February 2007
        • Revised: 1 December 2006
        • Received: 1 August 2006
        Published in tomm Volume 4, Issue 2

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader
      About Cookies On This Site

      We use cookies to ensure that we give you the best experience on our website.

      Learn more

      Got it!