skip to main content
research-article

A cognitive approach for effective coding and transmission of 3D video

Published:04 November 2011Publication History
Skip Abstract Section

Abstract

Future multimedia applications will rely on the transmission of 3D video contents within heterogeneous fruition scenarios, and as a matter of fact, the reliable delivery of 3D video signals proves to be a crucial issue in such communications. To this purpose, multimedia communication experts have been designing cross-layer strategies to improve the quality of the perceived 3D experience. This article presents a new cross-layer strategy, called Cognitive Source Coding (CSC), that defines a new 3D video system able to identify the different elements of the 3D scene and choose the most appropriate coding strategy.

References

  1. Aaron, A., Zhang, R., and Girod, B. 2002. Wyner-ziv coding for motion video. In Proceedings of the Asilomar Conference on Signals, Systems and Computers. Vol. 1. 240--244.Google ScholarGoogle Scholar
  2. Adikari, A. B. B., Fernando, W. A. C., Weerakkody, W. A. R. J., Kondoz, A., Martínez, J. L., and Cuenca, P. 2008. DVC based stereoscopic video transmission in a mobile communication system. In Proceedings of the IEEE Future Multimedia Networking (FMN). (co-located with NGMAST'08). 439--443. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Aksay, A., Bilen, C., Kurutepe, E., Ozcelebi, T., Akar, G. B., Civanlar, R., and Tekalp, M. 2006. Temporal and spatial scaling for stereoscopic video compression. In Proceedings of the European Signal Processing Conference (EUSIPCO).Google ScholarGoogle Scholar
  4. Alregib, G., Altunbasak, Y., and Rossignac, J. 2005. Error-resilient transmission of 3d models. ACM Trans. Graph. 24, 2, 182--208. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Artigas, X., Ascenso, J., Dalai, M., Klomp, S., Kubasov, D., and Ouaret, M. 2007. The DISCOVER codec: Architecture, techniques and evaluation. In Proceedings of the Picture Coding Symposium (PCS).Google ScholarGoogle Scholar
  6. Balter, R., Gioia, P., and Morin, L. 2006. Scalable and efficient coding using 3D modeling. IEEE Trans. Multimedia 8, 6, 1147--1155. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Benoit, A., Callet, P. L., Campisi, P., and Cousseau, R. 2008. Quality assessment of stereoscopic images. EURASIP J. Image Video Process. ID 659024.Google ScholarGoogle Scholar
  8. Boser, B. E., Guyon, I. M., and Vapnik, V. N. 1992. A training algorithm for optimal margin classifiers. In Proceedings of the 5th Annual ACM Workshop on Computational Learning Theory (COLT). 144--152. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Boughorbel, S., Tarel, J. P., and Boujemaa, N. 2005. Conditionally positive definite kernels for SVM based image recognition. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME). 113--116.Google ScholarGoogle Scholar
  10. Bremond, R., Petit, J., and Tarel, J.-P. 2010. Saliency maps of high dynamic range images. In Proceedings of the Media Retargeting Workshop in conjunction with ECCV'10. http://perso.lcpc.fr/tarel.jean-philippe/publis/weccv10.html. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Crave, O., Guillemot, C., Pesquet-Popescu, B., and Tillier, C. 2008. Multiple description source coding with side information. In Proceedings of the European Signal Processing Conference (EUSIPCO).Google ScholarGoogle Scholar
  12. Fan, Y., Wang, J., Sun, J., Wang, P., and Yu, S. 2003. A novel multiple description video codec based on Slepian-Wolf coding. In Proceedings of the Data Compression Conference (DCC). 515. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Färber, N., Stuhlmuller, K., and Girod, B. 1999. Analysis of error propagation in hybrid video coding with application to error resilience. In Proceedings of the International Conference on Image Processing, (ICIP). 550--554.Google ScholarGoogle Scholar
  14. Fehn, C. 2004. 3D-TV Using depth-image-based rendering (DIBR). In Proceedings of the Picture Coding Symposium (PCS).Google ScholarGoogle Scholar
  15. Felzenszwalb, P. F. and Huttenlocher, D. P. 2004. Efficient graph-based image segmentation. Int. J. Computer Vision 59, 2, 167--181. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Fraunhofer HHI. 2011. Repository of FHG HHI on 3DTV NoE. https://www.3dtv-research.org/3dav/3DAV_Demos/FHG_HHI/Sequences/.Google ScholarGoogle Scholar
  17. Goel, S., Ismael, Y., and Boyoumi, M. A. 2005. Adaptive search window size algorithm for fast motion estimation in H.264/AVC standard. In Proceedings of the IEEE International Midwest Symposium on Circuits and Systems (MWSCAS). 1557--1560.Google ScholarGoogle Scholar
  18. Goyal, V. K. 2001. Multiple description coding: compression meets the network. IEEE Signal Process. Mag. 8, 5, 74--93.Google ScholarGoogle ScholarCross RefCross Ref
  19. Haykin, S. 2005. Cognitive radio: Brain-empowered wireless communications. IEEE J. Sel. Areas Comm. 23, 2, 201--220. (Invited). Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. ISO/IEC JTC1. 2001. Coding of audio-visual objects - Part 2: Visual. ISO/IEC 14 496-2 (MPEG-4 Visual version 1), 4/99; Amendment 1 (version 2), 2/00; Amendment 4 (streaming profile), 1/01.Google ScholarGoogle Scholar
  21. ITU-T. 1995. Video coding for low bitrate communications, Version 1. ITU-T Recommendation H.263.Google ScholarGoogle Scholar
  22. ITU-T and ISO/IEC JTC1. 1994. Generic coding of moving pictures and associated audio information - Part 2: Video. ITU-T Recommendation H.262-ISO/IEC 13 818-2 (MPEG-2).Google ScholarGoogle Scholar
  23. Jagmohan, A. and Ahuja, N. 2003. Wyner-Ziv encoded predictive multiple descriptions. In Proceedings of the Data Compression Conference (DCC). 213--222. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Karim, H. A., Hewage, C. T. E. R., Yu, A. C., Worral, S., Dogan, S., and Kondoz, A. M. 2007. Scalable multiple description 3D video coding based on even and odd frame. In Proceedings of the Pretante Coding Symposium (PCS).Google ScholarGoogle Scholar
  25. Katsaggelos, A. K., Eisenberg, Y., Zhai, F., Berry, R., and Pappas, T. N. 2005. Advances in efficient resource allocation for packet-based real-time video transmission. Proc. IEEE 93, 1, 135--147.Google ScholarGoogle ScholarCross RefCross Ref
  26. Liao, J. and Villasenor, J. 2000. Adaptive intra block update for robust transmission of H.263. IEEE Trans. Circuits Syst. Video Technol. 10, 1, 30--35. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Microsoft Research. 2011. MSR 3D video. http://research.microsoft.com/en-us/um/people/sbkang/3dvideodownload.Google ScholarGoogle Scholar
  28. Milani, S. 2010. Simone Milani's Homepage. http://www.dei.unipd.it/~sim1mil/downloads.html.Google ScholarGoogle Scholar
  29. Milani, S. 2011. Simone Milani's Homepage. http://www.dei.unipd.it/~sim1mil/publications.html♯CSCDemo.Google ScholarGoogle Scholar
  30. Milani, S. and Calvagno, G. 2009. A distributed video coding approach for multiple description video transmission over lossy channels. In Proceedings of the European Signal Processing Conference (EUSIPCO). 1824--1828.Google ScholarGoogle Scholar
  31. Milani, S. and Calvagno, G. 2010a. A cognitive approach for effective coding and transmission of 3D video. In Proceedings of the ACM Multimedia 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Milani, S. and Calvagno, G. 2010b. A cognitive source coding scheme for multiple description 3DTV transmission. In Proceedings of the 11th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS'10).Google ScholarGoogle Scholar
  33. Milani, S. and Calvagno, G. 2010c. Multiple description distributed video coding using redundant slices and lossy syndromes. IEEE Sig. Process. Lett. 17, 1, 51--54.Google ScholarGoogle ScholarCross RefCross Ref
  34. Mobile3DTV project. 2011. 3D Video database. http://sp.cs.tut.fi/mobile3dtv/stereo-video/.Google ScholarGoogle Scholar
  35. Norkin, A., Aksay, A., Bilen, C., Akar, G. B., Gotchev, A., and Astola, J. 2006. Schemes for multiple description coding of stereoscopic 3D. In Proceedings of the Symposium on Multimedia Content Representation, Classification and Security. Lecture Notes in Computer Science, vol. 4105/2006. Springer, 730--737. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Puri, R. and Ramchandran, K. 2002. PRISM: A new robust video coding architecture based on distributed compression principles. In Proceedings of the Allerton Conference 2002. 402--408.Google ScholarGoogle Scholar
  37. Reusens, E., Castagno, R., Buhan, C. L., Piron, L., Ebrahimi, T., and Kunt, M. 1996. Dynamic video coding—an overview. In Proceedings of the IEEE International Conference on Image Processing (ICIP). 377--380.Google ScholarGoogle Scholar
  38. Rosenberg, J. and Schulzrinne, H. 1999. An RTP payload format for generic forward error correction (RFC2733). Internet Draft, Network Working Group. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Saxena, A., Sun, M., and Ng, A. Y. 2009. Make3D: Learning 3D scene structure from a single still image. IEEE Trans. Pattern Anal. Mach. Intell. 30, 5, 824--840. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. Schierl, T., Stockhammer, T., and Wiegand, T. 2007. Compression of multiple depth maps for DIBR. IEEE Trans. Circuits Syst. Video Technol. 17, 9, 1204--1217. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. Schulzrinne, H., Casner, S., Frederick, R., and Jacobson, V. 1996. RTP: A transport protocol for real-time applications (RFC1889). In Network Working Group.Google ScholarGoogle Scholar
  42. Shi, S., Jeon, W., Nahrsted, K., and Campbell, R. 2009. M-TEEVE: Real-Time 3D video interaction and broadcasting framework for mobile devices. In Proceedings of the 2nd International Conference on Immersive Telecommunications (IMMERSCOM'09). Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Wang, A., Zhao, Y., and Bai, H. 2009. Robust description distributed video coding using optimized zero-padding. Sci. China Ser. F-Inf. Sci. 52, 2, 206--214.Google ScholarGoogle ScholarCross RefCross Ref
  44. Wang, J., Wu, X., Yu, S., and Sun, J. 2006. Multiple descriptions in the Wyner-Ziv setting. In Proceedings of the IEEE Internet Symposium on Information Theory (ISIT). 1584--1588.Google ScholarGoogle Scholar
  45. Wang, Z., Bovik, A. C., Sheikh, H. R., and Simoncelli, E. P. 2004. Image Quality Assessment: From Error Visibility to Structural Similarity. IEEE Trans. Image Process. 13, 4, 600--612. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Wiegand, T. 2004. Version 3 of H.264/AVC. In Proceedings of the 12th JVT Meeting.Google ScholarGoogle Scholar
  47. Wu, M., Vetro, A., and Chen, C. W. 2004. Multiple Description Image Coding with Distributed Source Coding and Side Information. In Proceedings of SPIE: Multimedia Systems and Applications VII. Vol. 5600. 120--127.Google ScholarGoogle Scholar
  48. Yeo, C. and Ramchandran, K. 2007. Robust distributed multiview video compression for wireless camera networks. In Proceedings of the IEEE Visual Communications and Image Processing (VCIP 2007). Vol. 6508. 65080P-1--65080P-9.Google ScholarGoogle Scholar

Index Terms

  1. A cognitive approach for effective coding and transmission of 3D video

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM Transactions on Multimedia Computing, Communications, and Applications
          ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 7S, Issue 1
          Special section on ACM multimedia 2010 best paper candidates, and issue on social media
          October 2011
          246 pages
          ISSN:1551-6857
          EISSN:1551-6865
          DOI:10.1145/2037676
          Issue’s Table of Contents

          Copyright © 2011 ACM

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 4 November 2011
          • Revised: 1 May 2011
          • Accepted: 1 May 2011
          • Received: 1 January 2011
          Published in tomm Volume 7S, Issue 1

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader
        About Cookies On This Site

        We use cookies to ensure that we give you the best experience on our website.

        Learn more

        Got it!