Abstract
There are a large number of applications requiring the compression of video at Very Low Bit Rates (VLBR). Such applications include wireless video conferencing, video over the internet, multimedia database retrieval and remote sensing and monitoring. Recently, the MPEG-4 standardization effort has been a motivating factor to find a solution to this challenging problem. The existing approaches to this problem can generally be grouped into block-based, model-based, and object-oriented. Block-based approaches follow the traditional strategy of decoupling the image sequence into blocks, model-based approaches rely on complex 3-D models for specific objects that are encoded, and object-oriented approaches rely on analyzing the scene into differently moving objects. All three approaches exhibit potential problems. Block-based approaches tend to generate artifacts at the boundaries of the blocks, as well as to limit the minimum achievable bit rate due to the fixed analysis structure of the scene. Model-based codecs are limited by the complex 3-D models of the objects to be encoded. On the other hand, object-oriented codecs can generate a significant overhead due to the analysis of the scene which needs to be transmitted, which in turn can be the limiting factor in achieving the target bit rates. In this chapter, we propose a hybrid object-oriented codec in which the correlations among the three information fields, e.g., motion, segmentation and intensity fields, are exploited both spatially and temporally. In the proposed method, additional intelligence is given to the decoder, resulting in a reduction of the required bandwidth. The residual information is analyzed in three different categories, i.e., occlusion, model failures, and global refinement. With other side information, it is encoded and transmitted across the channel. Experimental results are presented which demonstrate the effectiveness of the proposed approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
K. Aizawa, H. Harashima and T. Saito, “Model-based synthesis image coding (MBASIC) system for a person’s face,” Signal Processing: Image Communication, vol. 1, no. 2, pp. 139–152, Oct. 1989.
H. G. Mussmann, M. Hotter and J. Ostermann, “Object-oriented analysis-synthesis coding of moving images,” Signal Processing: Image Communication, pp. 117–138, vol. 1, no. 2, 1989.
M. Hotter, “Object-oriented analysis-synthesis coder based on the model of flexible 2-D objects,” Signal Processing: Image Communication, vol. 2, no. 4, pp. 409–428, Dec. 1990.
T. özc̣elik, “A very low bit rate video codec,” Ph.D. Dissertation, Northwestern University, Dept. of EECS, Dec. 1994.
T. özc̣elik, J. C. Brailean, and A. K. Katsaggelos, “Image and video coding algorithms based on recovery techniques using mean field annealing,” IEEE Proceedings, vol. 83, no. 2, pp. 304–316, Feb. 1995.
M. R. Banham, J. C. Brailean, C. Chan and A. K. Katsaggelos, “Low bit rate video coding using robust motion vector regeneration in the decoder,” IEEE Trans. Image Processing, vol. 3, no. 5, pp. 652–665, Sept. 1994.
J. C. Brailean and A. K. Katsaggelos, “A recursive nonstationary MAP displacement vector field estimation algorithm,” IEEE Trans. Image Processing, vol. 4, no. 4, pp. 416–429, Apr. 1995.
T. özcelik and A. K. Katsaggelos, “Robust displacement vector field prediction algorithms with application to very low bit rate video coding,” Proc. SPIE Visual Communications and Image Processing, Cambridge, MA, vol. 2094, pp. 1378–1389, Nov. 1993.
M. Hotter, “Predictive contour coding for an object-oriented analysis-synthesis coder,” Proc. IEEE International Symposium on Information Theory, pp. 75, San Diego, CA, Jan. 1990.
H. H. Chen, M. R. Civanlar and B. G. Haskell, “A block transform coder for arbitrarily shaped image segments,” Proc. Very Low Bit Rate Video Workshop, paper no. 1.1, Essex, UK, 1994.
M. Hotter and R. Thoma, “Image segmentation based on object-oriented mapping parameter estimation,” Signal Processing, vol. 15, no.3, pp. 315–334, Oct. 1988.
N. Diehl, “Object-oriented motion estimation and segmentation in image sequences”, Signal Processing: Image Communication, pp. 23-56, Dec. 1991.
(invited) A. K. Katsaggelos and T. özcelik, “A hybrid object-oriented approach to very low bit rate coding of video,” Proc. 1995 International Conf. on Digital Signal Processing, pp. 2-7, Limassol, Cyprus, June 1995.
(invited) T. özc̣elik and A. K. Katsaggelos, “Exploitation of spatio-temporal inter-correlation among motion, segmentation and intensity fields for very low bit rate coding of video,” Proc. Symposium on Multimedia Communications and Video Coding, New York, NY, Oct. 1995.
T. özc̣elik and A. K. Katsaggelos, “Detection and encoding of model failures in very low bit rate video coding”, Proc. SPIE Visual Communications and Image Processing, Orlando, FL, March 1996.
J. C. Brailean and A. K. Katsaggelos, “Displacement field estimation in noisy image sequences”, Proc. EUSIPCO-92, pp. 1319-1322, Brussels, Belgium, Aug. 1992.
J. C. Brailean, T. özc̣elik and A. K. Katsaggelos, “Restoration of low bit rate compressed images using mean field annealing,” Proc. IEEE ICASSP’ 94, Adelaide, Australia, vol. V, pp. 237–240, April 1994.
M. Kunt, A. Ikonomopoulos, and M. Kocher, “Second generation image coding techniques,” IEEE Proceedings, vol. 73, no. 4, pp. 549–575, April 1985.
G. M. Schuster and A. K. Katsaggelos, “Optimal lossy segmentation encoding scheme,” Proc. SPIE Conf. Visual Commun, and Image Processing, Orlando, FL, March 1996.
T. özc̣elik and A. K. Katsaggelos, “Detection and encoding of occluded areas in very low bit rate video coding”, Proc. ICASSP-96, Atlanta, GA, May, 1996.
S. Efstratiadis, Y. Huang, Z. Xiong, N. P. Galatsanos and A. K. Katsaggelos, “Motion compensated priority discrete cosine transform coding of image sequences”, Proc. SPIE Con. Visual Commun. and Image Processing, pp. 16-25, Boston, MA, Nov. 1991.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1997 Springer Science+Business Media New York
About this chapter
Cite this chapter
Özc̣elik, T., Katsaggelos, A.K. (1997). Very Low Bit Rate Video Coding Based on Statistical Spatio-Temporal Prediction of Motion, Segmentation and Intensity Fields. In: Li, H.H., Sun, S., Derin, H. (eds) Video Data Compression for Multimedia Computing. The Springer International Series in Engineering and Computer Science, vol 378. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-6239-9_9
Download citation
DOI: https://doi.org/10.1007/978-1-4615-6239-9_9
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-7862-4
Online ISBN: 978-1-4615-6239-9
eBook Packages: Springer Book Archive