Skip to main content

Very Low Bit Rate Video Coding Based on Statistical Spatio-Temporal Prediction of Motion, Segmentation and Intensity Fields

  • Chapter
Video Data Compression for Multimedia Computing

Abstract

There are a large number of applications requiring the compression of video at Very Low Bit Rates (VLBR). Such applications include wireless video conferencing, video over the internet, multimedia database retrieval and remote sensing and monitoring. Recently, the MPEG-4 standardization effort has been a motivating factor to find a solution to this challenging problem. The existing approaches to this problem can generally be grouped into block-based, model-based, and object-oriented. Block-based approaches follow the traditional strategy of decoupling the image sequence into blocks, model-based approaches rely on complex 3-D models for specific objects that are encoded, and object-oriented approaches rely on analyzing the scene into differently moving objects. All three approaches exhibit potential problems. Block-based approaches tend to generate artifacts at the boundaries of the blocks, as well as to limit the minimum achievable bit rate due to the fixed analysis structure of the scene. Model-based codecs are limited by the complex 3-D models of the objects to be encoded. On the other hand, object-oriented codecs can generate a significant overhead due to the analysis of the scene which needs to be transmitted, which in turn can be the limiting factor in achieving the target bit rates. In this chapter, we propose a hybrid object-oriented codec in which the correlations among the three information fields, e.g., motion, segmentation and intensity fields, are exploited both spatially and temporally. In the proposed method, additional intelligence is given to the decoder, resulting in a reduction of the required bandwidth. The residual information is analyzed in three different categories, i.e., occlusion, model failures, and global refinement. With other side information, it is encoded and transmitted across the channel. Experimental results are presented which demonstrate the effectiveness of the proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. K. Aizawa, H. Harashima and T. Saito, “Model-based synthesis image coding (MBASIC) system for a person’s face,” Signal Processing: Image Communication, vol. 1, no. 2, pp. 139–152, Oct. 1989.

    Article  Google Scholar 

  2. H. G. Mussmann, M. Hotter and J. Ostermann, “Object-oriented analysis-synthesis coding of moving images,” Signal Processing: Image Communication, pp. 117–138, vol. 1, no. 2, 1989.

    Article  Google Scholar 

  3. M. Hotter, “Object-oriented analysis-synthesis coder based on the model of flexible 2-D objects,” Signal Processing: Image Communication, vol. 2, no. 4, pp. 409–428, Dec. 1990.

    Article  Google Scholar 

  4. T. özc̣elik, “A very low bit rate video codec,” Ph.D. Dissertation, Northwestern University, Dept. of EECS, Dec. 1994.

    Google Scholar 

  5. T. özc̣elik, J. C. Brailean, and A. K. Katsaggelos, “Image and video coding algorithms based on recovery techniques using mean field annealing,” IEEE Proceedings, vol. 83, no. 2, pp. 304–316, Feb. 1995.

    Article  Google Scholar 

  6. M. R. Banham, J. C. Brailean, C. Chan and A. K. Katsaggelos, “Low bit rate video coding using robust motion vector regeneration in the decoder,” IEEE Trans. Image Processing, vol. 3, no. 5, pp. 652–665, Sept. 1994.

    Article  Google Scholar 

  7. J. C. Brailean and A. K. Katsaggelos, “A recursive nonstationary MAP displacement vector field estimation algorithm,” IEEE Trans. Image Processing, vol. 4, no. 4, pp. 416–429, Apr. 1995.

    Article  Google Scholar 

  8. T. özcelik and A. K. Katsaggelos, “Robust displacement vector field prediction algorithms with application to very low bit rate video coding,” Proc. SPIE Visual Communications and Image Processing, Cambridge, MA, vol. 2094, pp. 1378–1389, Nov. 1993.

    Google Scholar 

  9. M. Hotter, “Predictive contour coding for an object-oriented analysis-synthesis coder,” Proc. IEEE International Symposium on Information Theory, pp. 75, San Diego, CA, Jan. 1990.

    Google Scholar 

  10. H. H. Chen, M. R. Civanlar and B. G. Haskell, “A block transform coder for arbitrarily shaped image segments,” Proc. Very Low Bit Rate Video Workshop, paper no. 1.1, Essex, UK, 1994.

    Google Scholar 

  11. M. Hotter and R. Thoma, “Image segmentation based on object-oriented mapping parameter estimation,” Signal Processing, vol. 15, no.3, pp. 315–334, Oct. 1988.

    Article  Google Scholar 

  12. N. Diehl, “Object-oriented motion estimation and segmentation in image sequences”, Signal Processing: Image Communication, pp. 23-56, Dec. 1991.

    Google Scholar 

  13. (invited) A. K. Katsaggelos and T. özcelik, “A hybrid object-oriented approach to very low bit rate coding of video,” Proc. 1995 International Conf. on Digital Signal Processing, pp. 2-7, Limassol, Cyprus, June 1995.

    Google Scholar 

  14. (invited) T. özc̣elik and A. K. Katsaggelos, “Exploitation of spatio-temporal inter-correlation among motion, segmentation and intensity fields for very low bit rate coding of video,” Proc. Symposium on Multimedia Communications and Video Coding, New York, NY, Oct. 1995.

    Google Scholar 

  15. T. özc̣elik and A. K. Katsaggelos, “Detection and encoding of model failures in very low bit rate video coding”, Proc. SPIE Visual Communications and Image Processing, Orlando, FL, March 1996.

    Google Scholar 

  16. J. C. Brailean and A. K. Katsaggelos, “Displacement field estimation in noisy image sequences”, Proc. EUSIPCO-92, pp. 1319-1322, Brussels, Belgium, Aug. 1992.

    Google Scholar 

  17. J. C. Brailean, T. özc̣elik and A. K. Katsaggelos, “Restoration of low bit rate compressed images using mean field annealing,” Proc. IEEE ICASSP’ 94, Adelaide, Australia, vol. V, pp. 237–240, April 1994.

    Google Scholar 

  18. M. Kunt, A. Ikonomopoulos, and M. Kocher, “Second generation image coding techniques,” IEEE Proceedings, vol. 73, no. 4, pp. 549–575, April 1985.

    Article  Google Scholar 

  19. G. M. Schuster and A. K. Katsaggelos, “Optimal lossy segmentation encoding scheme,” Proc. SPIE Conf. Visual Commun, and Image Processing, Orlando, FL, March 1996.

    Google Scholar 

  20. T. özc̣elik and A. K. Katsaggelos, “Detection and encoding of occluded areas in very low bit rate video coding”, Proc. ICASSP-96, Atlanta, GA, May, 1996.

    Google Scholar 

  21. S. Efstratiadis, Y. Huang, Z. Xiong, N. P. Galatsanos and A. K. Katsaggelos, “Motion compensated priority discrete cosine transform coding of image sequences”, Proc. SPIE Con. Visual Commun. and Image Processing, pp. 16-25, Boston, MA, Nov. 1991.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1997 Springer Science+Business Media New York

About this chapter

Cite this chapter

Özc̣elik, T., Katsaggelos, A.K. (1997). Very Low Bit Rate Video Coding Based on Statistical Spatio-Temporal Prediction of Motion, Segmentation and Intensity Fields. In: Li, H.H., Sun, S., Derin, H. (eds) Video Data Compression for Multimedia Computing. The Springer International Series in Engineering and Computer Science, vol 378. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-6239-9_9

Download citation

  • DOI: https://doi.org/10.1007/978-1-4615-6239-9_9

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4613-7862-4

  • Online ISBN: 978-1-4615-6239-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics