Very Low Bit Rate Video Coding Based on Statistical Spatio-Temporal Prediction of Motion, Segmentation and Intensity Fields

Özc̣elik, Taner; Katsaggelos, Aggelos K.

doi:10.1007/978-1-4615-6239-9_9

Taner Özc̣elik⁴ &
Aggelos K. Katsaggelos⁵

Part of the book series: The Springer International Series in Engineering and Computer Science ((SECS,volume 378))

131 Accesses
1 Citations

Abstract

There are a large number of applications requiring the compression of video at Very Low Bit Rates (VLBR). Such applications include wireless video conferencing, video over the internet, multimedia database retrieval and remote sensing and monitoring. Recently, the MPEG-4 standardization effort has been a motivating factor to find a solution to this challenging problem. The existing approaches to this problem can generally be grouped into block-based, model-based, and object-oriented. Block-based approaches follow the traditional strategy of decoupling the image sequence into blocks, model-based approaches rely on complex 3-D models for specific objects that are encoded, and object-oriented approaches rely on analyzing the scene into differently moving objects. All three approaches exhibit potential problems. Block-based approaches tend to generate artifacts at the boundaries of the blocks, as well as to limit the minimum achievable bit rate due to the fixed analysis structure of the scene. Model-based codecs are limited by the complex 3-D models of the objects to be encoded. On the other hand, object-oriented codecs can generate a significant overhead due to the analysis of the scene which needs to be transmitted, which in turn can be the limiting factor in achieving the target bit rates. In this chapter, we propose a hybrid object-oriented codec in which the correlations among the three information fields, e.g., motion, segmentation and intensity fields, are exploited both spatially and temporally. In the proposed method, additional intelligence is given to the decoder, resulting in a reduction of the required bandwidth. The residual information is analyzed in three different categories, i.e., occlusion, model failures, and global refinement. With other side information, it is encoded and transmitted across the channel. Experimental results are presented which demonstrate the effectiveness of the proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

K. Aizawa, H. Harashima and T. Saito, “Model-based synthesis image coding (MBASIC) system for a person’s face,” Signal Processing: Image Communication, vol. 1, no. 2, pp. 139–152, Oct. 1989.
Article Google Scholar
H. G. Mussmann, M. Hotter and J. Ostermann, “Object-oriented analysis-synthesis coding of moving images,” Signal Processing: Image Communication, pp. 117–138, vol. 1, no. 2, 1989.
Article Google Scholar
M. Hotter, “Object-oriented analysis-synthesis coder based on the model of flexible 2-D objects,” Signal Processing: Image Communication, vol. 2, no. 4, pp. 409–428, Dec. 1990.
Article Google Scholar
T. özc̣elik, “A very low bit rate video codec,” Ph.D. Dissertation, Northwestern University, Dept. of EECS, Dec. 1994.
Google Scholar
T. özc̣elik, J. C. Brailean, and A. K. Katsaggelos, “Image and video coding algorithms based on recovery techniques using mean field annealing,” IEEE Proceedings, vol. 83, no. 2, pp. 304–316, Feb. 1995.
Article Google Scholar
M. R. Banham, J. C. Brailean, C. Chan and A. K. Katsaggelos, “Low bit rate video coding using robust motion vector regeneration in the decoder,” IEEE Trans. Image Processing, vol. 3, no. 5, pp. 652–665, Sept. 1994.
Article Google Scholar
J. C. Brailean and A. K. Katsaggelos, “A recursive nonstationary MAP displacement vector field estimation algorithm,” IEEE Trans. Image Processing, vol. 4, no. 4, pp. 416–429, Apr. 1995.
Article Google Scholar
T. özcelik and A. K. Katsaggelos, “Robust displacement vector field prediction algorithms with application to very low bit rate video coding,” Proc. SPIE Visual Communications and Image Processing, Cambridge, MA, vol. 2094, pp. 1378–1389, Nov. 1993.
Google Scholar
M. Hotter, “Predictive contour coding for an object-oriented analysis-synthesis coder,” Proc. IEEE International Symposium on Information Theory, pp. 75, San Diego, CA, Jan. 1990.
Google Scholar
H. H. Chen, M. R. Civanlar and B. G. Haskell, “A block transform coder for arbitrarily shaped image segments,” Proc. Very Low Bit Rate Video Workshop, paper no. 1.1, Essex, UK, 1994.
Google Scholar
M. Hotter and R. Thoma, “Image segmentation based on object-oriented mapping parameter estimation,” Signal Processing, vol. 15, no.3, pp. 315–334, Oct. 1988.
Article Google Scholar
N. Diehl, “Object-oriented motion estimation and segmentation in image sequences”, Signal Processing: Image Communication, pp. 23-56, Dec. 1991.
Google Scholar
(invited) A. K. Katsaggelos and T. özcelik, “A hybrid object-oriented approach to very low bit rate coding of video,” Proc. 1995 International Conf. on Digital Signal Processing, pp. 2-7, Limassol, Cyprus, June 1995.
Google Scholar
(invited) T. özc̣elik and A. K. Katsaggelos, “Exploitation of spatio-temporal inter-correlation among motion, segmentation and intensity fields for very low bit rate coding of video,” Proc. Symposium on Multimedia Communications and Video Coding, New York, NY, Oct. 1995.
Google Scholar
T. özc̣elik and A. K. Katsaggelos, “Detection and encoding of model failures in very low bit rate video coding”, Proc. SPIE Visual Communications and Image Processing, Orlando, FL, March 1996.
Google Scholar
J. C. Brailean and A. K. Katsaggelos, “Displacement field estimation in noisy image sequences”, Proc. EUSIPCO-92, pp. 1319-1322, Brussels, Belgium, Aug. 1992.
Google Scholar
J. C. Brailean, T. özc̣elik and A. K. Katsaggelos, “Restoration of low bit rate compressed images using mean field annealing,” Proc. IEEE ICASSP’ 94, Adelaide, Australia, vol. V, pp. 237–240, April 1994.
Google Scholar
M. Kunt, A. Ikonomopoulos, and M. Kocher, “Second generation image coding techniques,” IEEE Proceedings, vol. 73, no. 4, pp. 549–575, April 1985.
Article Google Scholar
G. M. Schuster and A. K. Katsaggelos, “Optimal lossy segmentation encoding scheme,” Proc. SPIE Conf. Visual Commun, and Image Processing, Orlando, FL, March 1996.
Google Scholar
T. özc̣elik and A. K. Katsaggelos, “Detection and encoding of occluded areas in very low bit rate video coding”, Proc. ICASSP-96, Atlanta, GA, May, 1996.
Google Scholar
S. Efstratiadis, Y. Huang, Z. Xiong, N. P. Galatsanos and A. K. Katsaggelos, “Motion compensated priority discrete cosine transform coding of image sequences”, Proc. SPIE Con. Visual Commun. and Image Processing, pp. 16-25, Boston, MA, Nov. 1991.
Google Scholar

Download references

Author information

Authors and Affiliations

Digital Audio and Video Group, SONY Electronics, Inc., USA
Taner Özc̣elik
Department of EECS, Northwestern University, USA
Aggelos K. Katsaggelos

Authors

Taner Özc̣elik
View author publications
You can also search for this author in PubMed Google Scholar
Aggelos K. Katsaggelos
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Texas Tech University, USA
Hua Harry Li & Shan Sun &
University of Massachusetts, USA
Haluk Derin

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Özc̣elik, T., Katsaggelos, A.K. (1997). Very Low Bit Rate Video Coding Based on Statistical Spatio-Temporal Prediction of Motion, Segmentation and Intensity Fields. In: Li, H.H., Sun, S., Derin, H. (eds) Video Data Compression for Multimedia Computing. The Springer International Series in Engineering and Computer Science, vol 378. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-6239-9_9

Download citation

DOI: https://doi.org/10.1007/978-1-4615-6239-9_9
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-7862-4
Online ISBN: 978-1-4615-6239-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics