Abstract
The ability to create super high-resolution video is becoming relative easy to do either through a single high-definition video camera or panoramic video that automatically stitches multiple views together. As an example of the former, the motion picture industry now has 6000 × 4000 pixel full-rate video cameras available. This means that supporting region-of-interest cropping will become more important in the future. In this article, we propose a mechanism to support region-of-interest adaptation of stored video. The proposed approach creates a compression-compliant stream (e.g., MPEG-2), while still allowing it to be cropped. Fortunately, video standards like MPEG-2 specify the format of a compliant stream, and not the algorithm to get there. As a result, there is an opportunity to allow system researchers and implementers ways to optimize for applications. We show various fundamental tradeoffs that are made in order to support region-of-interest cropping with super high-resolution video which we received from a local motion-picture firm.
- Agarwal A., Feng, Wu-chi, and Wolfe, C. 2000. A multi-differential video coding algorithm for robust video conferencing. In Proceedings of the SPIE Voice, Video, and Data Communications Conference.Google Scholar
- Ahmad, I., Wei, X., and Sun, Y. 2005. Video transcoding: an overview of various techniques and research issues. IEEE Trans. Multimedia 7, 5. Google Scholar
Digital Library
- Augustine, J., Rao, S. K., Jouppi, N., and Iyer, S. 2004. Region of interest editing of MPEG-2 video streams in the compressed domain. In Proceedings of the IEEE International Conference on Multimedia and Expo. IEEE, Los Alamitos, CA, 559--562.Google Scholar
- Bae, T. M., Thang, T. C., Kim, D. Y., Ro, Y. M., Kang, J. W., and Kim, J. G. 2006. Multiple region-of-interest support in scalable video coding. ETRI J. 28, 2, 239--242.Google Scholar
Cross Ref
- Dugad, R. and Ahuja, N. 2003. A scheme for spatial scalability using nonscalable encoders. IEEE Trans. Circuits Syst. Video Technol. 13, 10. Google Scholar
Digital Library
- El-Alfy, H., Jacobs, J., and Davis, L. 2007. Multi-scale video cropping. In Proceedings of the ACM Multimedia. ACM, New York, 97--106. Google Scholar
Digital Library
- Fan, X., Xie, X., Zhou, H. Q., and Ma, W. Y. 2003. Looking into video frames on small displays. In Proceedings of the ACM Multimedia. ACM, New York, 247--250. Google Scholar
Digital Library
- Feng, W., Dang, T., Kassebaum, J., and Bauman, T. 2008. Supporting region-of-interest cropping through constrained compression. In Proceedings of the ACM Multimedia. ACM, New York. Google Scholar
Digital Library
- Haskell, B., Puri, A., and Netravali, A. 1996. Digital Video Compression Standard: An Introduction to MPEG-2. Chapman & Hall. http://ffmpeg.mplayerhq.hu.Google Scholar
- Huang, J., Feng, W., and Walpole, J. 2006. An experimental analysis of DCT-based approaches for fine-grained multiresolution video. ACM Multimedia Syst. J. 11, 6, 513--531.Google Scholar
Digital Library
- Lambert, P., De Schrijver, D., Van Deursen, D., De Neve., W., Dhondt, Y., and Van de Walle, R. 2006. A real-time content adaptation framework for exploiting ROI scalability in H.264/AVC. In Advanced Concepts for Intelligent Vision Systems, vol. 4179, 442--453. Google Scholar
Digital Library
- Le Gall, D. 1991. MPEG: A video compression standard for multimedia applications. Comm. ACM 34, 4, 46--58. Google Scholar
Digital Library
- Lin, C. W., Chen, Y. C., and Sun, M. T. 2003. Dynamic region of interest transcoding for multipoint video conferencing. IEEE Trans. Circuits Syst. Video Technol. 13, 10, 982--992. Google Scholar
Digital Library
- Mavlankar, A., Baccichet, P., Varodayan, D., and Girod, B. 2007. Optimal slice size for streaming regions of high resolution video with virtual pan/tilt/zoom functionality. In Proceedings of the 15th European Signal Processing Conference (EUSIPCO).Google Scholar
- Rehan, M. and Agathokilis, P. 2007. Frame accurate video cropping in compressed MPEG domain. In Proceedings of the IEEE Pacific Rim Conference on Communications, Computers, and Signal Processing. IEEE, Los Alamitos, CA, 573--576.Google Scholar
- Sinha, A., Agarwal, G., and Anbu, A. 2004. Region-of-interest based compressed domain video transcoding. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing. vol. 3, IEEE, Los Alamitos, CA, 161--164.Google Scholar
- Sun, X., Foote, J., Kimber, D., and Manjunath, B. S. 2005. Region of interest extraction and virtual camera control based on panoramic video capturing. IEEE Trans. Multimedia 7, 5, 981--990. Google Scholar
Digital Library
- Wang, H., El-Maleh, K., and Liang, Y. J. 2006. Real-time region-of-interest video coding using content-adaptive background skipping with dynamic bit reallocation. In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing. IEEE, Los Alamitos, CA.Google Scholar
Index Terms
Supporting region-of-interest cropping through constrained compression
Recommendations
Supporting zoomable video streams with dynamic region-of-interest cropping
MMSys '10: Proceedings of the first annual ACM SIGMM conference on Multimedia systemsStreaming of an arbitrary region of interest (RoI) from a high resolution video is essential to supporting zooming and panning within a video stream. This paper explores two methods for RoI-based streaming, referring to them as tiled streaming and ...
Supporting region-of-interest cropping through constrained compression
MM '08: Proceedings of the 16th ACM international conference on MultimediaThe ability to create very high-resolution video is becoming relative easy to do today, either through a single high definition video camera or panoramic video stitched from multiple cameras. This means that supporting region-of-interest cropping will ...
Saturation-aware human attention region of interest algorithm for efficient video compression
AbstractWe propose a saturation-aware human attention region-of-interest (SA-HAROI) video compression method that performs a perceptual adaptive quantization algorithm on video frames as a function of the distribution of their luminance, motion vector, ...






Comments