Abstract
Future multimedia applications will rely on the transmission of 3D video contents within heterogeneous fruition scenarios, and as a matter of fact, the reliable delivery of 3D video signals proves to be a crucial issue in such communications. To this purpose, multimedia communication experts have been designing cross-layer strategies to improve the quality of the perceived 3D experience. This article presents a new cross-layer strategy, called Cognitive Source Coding (CSC), that defines a new 3D video system able to identify the different elements of the 3D scene and choose the most appropriate coding strategy.
- Aaron, A., Zhang, R., and Girod, B. 2002. Wyner-ziv coding for motion video. In Proceedings of the Asilomar Conference on Signals, Systems and Computers. Vol. 1. 240--244.Google Scholar
- Adikari, A. B. B., Fernando, W. A. C., Weerakkody, W. A. R. J., Kondoz, A., Martínez, J. L., and Cuenca, P. 2008. DVC based stereoscopic video transmission in a mobile communication system. In Proceedings of the IEEE Future Multimedia Networking (FMN). (co-located with NGMAST'08). 439--443. Google Scholar
Digital Library
- Aksay, A., Bilen, C., Kurutepe, E., Ozcelebi, T., Akar, G. B., Civanlar, R., and Tekalp, M. 2006. Temporal and spatial scaling for stereoscopic video compression. In Proceedings of the European Signal Processing Conference (EUSIPCO).Google Scholar
- Alregib, G., Altunbasak, Y., and Rossignac, J. 2005. Error-resilient transmission of 3d models. ACM Trans. Graph. 24, 2, 182--208. Google Scholar
Digital Library
- Artigas, X., Ascenso, J., Dalai, M., Klomp, S., Kubasov, D., and Ouaret, M. 2007. The DISCOVER codec: Architecture, techniques and evaluation. In Proceedings of the Picture Coding Symposium (PCS).Google Scholar
- Balter, R., Gioia, P., and Morin, L. 2006. Scalable and efficient coding using 3D modeling. IEEE Trans. Multimedia 8, 6, 1147--1155. Google Scholar
Digital Library
- Benoit, A., Callet, P. L., Campisi, P., and Cousseau, R. 2008. Quality assessment of stereoscopic images. EURASIP J. Image Video Process. ID 659024.Google Scholar
- Boser, B. E., Guyon, I. M., and Vapnik, V. N. 1992. A training algorithm for optimal margin classifiers. In Proceedings of the 5th Annual ACM Workshop on Computational Learning Theory (COLT). 144--152. Google Scholar
Digital Library
- Boughorbel, S., Tarel, J. P., and Boujemaa, N. 2005. Conditionally positive definite kernels for SVM based image recognition. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME). 113--116.Google Scholar
- Bremond, R., Petit, J., and Tarel, J.-P. 2010. Saliency maps of high dynamic range images. In Proceedings of the Media Retargeting Workshop in conjunction with ECCV'10. http://perso.lcpc.fr/tarel.jean-philippe/publis/weccv10.html. Google Scholar
Digital Library
- Crave, O., Guillemot, C., Pesquet-Popescu, B., and Tillier, C. 2008. Multiple description source coding with side information. In Proceedings of the European Signal Processing Conference (EUSIPCO).Google Scholar
- Fan, Y., Wang, J., Sun, J., Wang, P., and Yu, S. 2003. A novel multiple description video codec based on Slepian-Wolf coding. In Proceedings of the Data Compression Conference (DCC). 515. Google Scholar
Digital Library
- Färber, N., Stuhlmuller, K., and Girod, B. 1999. Analysis of error propagation in hybrid video coding with application to error resilience. In Proceedings of the International Conference on Image Processing, (ICIP). 550--554.Google Scholar
- Fehn, C. 2004. 3D-TV Using depth-image-based rendering (DIBR). In Proceedings of the Picture Coding Symposium (PCS).Google Scholar
- Felzenszwalb, P. F. and Huttenlocher, D. P. 2004. Efficient graph-based image segmentation. Int. J. Computer Vision 59, 2, 167--181. Google Scholar
Digital Library
- Fraunhofer HHI. 2011. Repository of FHG HHI on 3DTV NoE. https://www.3dtv-research.org/3dav/3DAV_Demos/FHG_HHI/Sequences/.Google Scholar
- Goel, S., Ismael, Y., and Boyoumi, M. A. 2005. Adaptive search window size algorithm for fast motion estimation in H.264/AVC standard. In Proceedings of the IEEE International Midwest Symposium on Circuits and Systems (MWSCAS). 1557--1560.Google Scholar
- Goyal, V. K. 2001. Multiple description coding: compression meets the network. IEEE Signal Process. Mag. 8, 5, 74--93.Google Scholar
Cross Ref
- Haykin, S. 2005. Cognitive radio: Brain-empowered wireless communications. IEEE J. Sel. Areas Comm. 23, 2, 201--220. (Invited). Google Scholar
Digital Library
- ISO/IEC JTC1. 2001. Coding of audio-visual objects - Part 2: Visual. ISO/IEC 14 496-2 (MPEG-4 Visual version 1), 4/99; Amendment 1 (version 2), 2/00; Amendment 4 (streaming profile), 1/01.Google Scholar
- ITU-T. 1995. Video coding for low bitrate communications, Version 1. ITU-T Recommendation H.263.Google Scholar
- ITU-T and ISO/IEC JTC1. 1994. Generic coding of moving pictures and associated audio information - Part 2: Video. ITU-T Recommendation H.262-ISO/IEC 13 818-2 (MPEG-2).Google Scholar
- Jagmohan, A. and Ahuja, N. 2003. Wyner-Ziv encoded predictive multiple descriptions. In Proceedings of the Data Compression Conference (DCC). 213--222. Google Scholar
Digital Library
- Karim, H. A., Hewage, C. T. E. R., Yu, A. C., Worral, S., Dogan, S., and Kondoz, A. M. 2007. Scalable multiple description 3D video coding based on even and odd frame. In Proceedings of the Pretante Coding Symposium (PCS).Google Scholar
- Katsaggelos, A. K., Eisenberg, Y., Zhai, F., Berry, R., and Pappas, T. N. 2005. Advances in efficient resource allocation for packet-based real-time video transmission. Proc. IEEE 93, 1, 135--147.Google Scholar
Cross Ref
- Liao, J. and Villasenor, J. 2000. Adaptive intra block update for robust transmission of H.263. IEEE Trans. Circuits Syst. Video Technol. 10, 1, 30--35. Google Scholar
Digital Library
- Microsoft Research. 2011. MSR 3D video. http://research.microsoft.com/en-us/um/people/sbkang/3dvideodownload.Google Scholar
- Milani, S. 2010. Simone Milani's Homepage. http://www.dei.unipd.it/~sim1mil/downloads.html.Google Scholar
- Milani, S. 2011. Simone Milani's Homepage. http://www.dei.unipd.it/~sim1mil/publications.html♯CSCDemo.Google Scholar
- Milani, S. and Calvagno, G. 2009. A distributed video coding approach for multiple description video transmission over lossy channels. In Proceedings of the European Signal Processing Conference (EUSIPCO). 1824--1828.Google Scholar
- Milani, S. and Calvagno, G. 2010a. A cognitive approach for effective coding and transmission of 3D video. In Proceedings of the ACM Multimedia 2010. Google Scholar
Digital Library
- Milani, S. and Calvagno, G. 2010b. A cognitive source coding scheme for multiple description 3DTV transmission. In Proceedings of the 11th International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS'10).Google Scholar
- Milani, S. and Calvagno, G. 2010c. Multiple description distributed video coding using redundant slices and lossy syndromes. IEEE Sig. Process. Lett. 17, 1, 51--54.Google Scholar
Cross Ref
- Mobile3DTV project. 2011. 3D Video database. http://sp.cs.tut.fi/mobile3dtv/stereo-video/.Google Scholar
- Norkin, A., Aksay, A., Bilen, C., Akar, G. B., Gotchev, A., and Astola, J. 2006. Schemes for multiple description coding of stereoscopic 3D. In Proceedings of the Symposium on Multimedia Content Representation, Classification and Security. Lecture Notes in Computer Science, vol. 4105/2006. Springer, 730--737. Google Scholar
Digital Library
- Puri, R. and Ramchandran, K. 2002. PRISM: A new robust video coding architecture based on distributed compression principles. In Proceedings of the Allerton Conference 2002. 402--408.Google Scholar
- Reusens, E., Castagno, R., Buhan, C. L., Piron, L., Ebrahimi, T., and Kunt, M. 1996. Dynamic video coding—an overview. In Proceedings of the IEEE International Conference on Image Processing (ICIP). 377--380.Google Scholar
- Rosenberg, J. and Schulzrinne, H. 1999. An RTP payload format for generic forward error correction (RFC2733). Internet Draft, Network Working Group. Google Scholar
Digital Library
- Saxena, A., Sun, M., and Ng, A. Y. 2009. Make3D: Learning 3D scene structure from a single still image. IEEE Trans. Pattern Anal. Mach. Intell. 30, 5, 824--840. Google Scholar
Digital Library
- Schierl, T., Stockhammer, T., and Wiegand, T. 2007. Compression of multiple depth maps for DIBR. IEEE Trans. Circuits Syst. Video Technol. 17, 9, 1204--1217. Google Scholar
Digital Library
- Schulzrinne, H., Casner, S., Frederick, R., and Jacobson, V. 1996. RTP: A transport protocol for real-time applications (RFC1889). In Network Working Group.Google Scholar
- Shi, S., Jeon, W., Nahrsted, K., and Campbell, R. 2009. M-TEEVE: Real-Time 3D video interaction and broadcasting framework for mobile devices. In Proceedings of the 2nd International Conference on Immersive Telecommunications (IMMERSCOM'09). Google Scholar
Digital Library
- Wang, A., Zhao, Y., and Bai, H. 2009. Robust description distributed video coding using optimized zero-padding. Sci. China Ser. F-Inf. Sci. 52, 2, 206--214.Google Scholar
Cross Ref
- Wang, J., Wu, X., Yu, S., and Sun, J. 2006. Multiple descriptions in the Wyner-Ziv setting. In Proceedings of the IEEE Internet Symposium on Information Theory (ISIT). 1584--1588.Google Scholar
- Wang, Z., Bovik, A. C., Sheikh, H. R., and Simoncelli, E. P. 2004. Image Quality Assessment: From Error Visibility to Structural Similarity. IEEE Trans. Image Process. 13, 4, 600--612. Google Scholar
Digital Library
- Wiegand, T. 2004. Version 3 of H.264/AVC. In Proceedings of the 12th JVT Meeting.Google Scholar
- Wu, M., Vetro, A., and Chen, C. W. 2004. Multiple Description Image Coding with Distributed Source Coding and Side Information. In Proceedings of SPIE: Multimedia Systems and Applications VII. Vol. 5600. 120--127.Google Scholar
- Yeo, C. and Ramchandran, K. 2007. Robust distributed multiview video compression for wireless camera networks. In Proceedings of the IEEE Visual Communications and Image Processing (VCIP 2007). Vol. 6508. 65080P-1--65080P-9.Google Scholar
Index Terms
A cognitive approach for effective coding and transmission of 3D video
Recommendations
A cognitive approach for effective coding and transmission of 3D video
MM '10: Proceedings of the 18th ACM international conference on MultimediaReliable delivery of 3D video contents to a wide set of users is expected to be the next big revolution in multimedia applications provided that it is possible to grant a certain level of Quality-of-Experience (QoE) to the end user.
During the last ...
Conditional Entropy Coding of VQ Indexes for Image Compression
DCC '97: Proceedings of the Conference on Data CompressionVector quantization (VQ) is a source coding methodology with provable rate-distortion optimality. However, despite more than two decades of intensive research, VQ theoretical promise is yet to be fully realized in image compression practice. Restricted ...
SSIM-based joint-bit allocation for 3D video coding
The quality of a 3D video display depends on virtual view synthesis process which is affected by the bit allocation criterion. The performance of a bit allocation algorithm is dependent on various encoding parameters like quantization parameter, motion ...






Comments