Object-Based Subband/Wavelet Video Compression

  • Soo-Chul Han
  • John W. Woods
Part of the The International Series in Engineering and Computer Science book series (SECS, volume 450)


This chapter presents a subband/wavelet video coder using an object-based spatiotemporal segmentation. The moving objects in a video are extracted by means of a joint motion estimation and segmentation algorithm based on a compound Markov random field (MRF) model. The two important features of our technique are the temporal linking of the objects, and the guidance of the motion segmentation with spatial color information. This results in spatiotemporal (3-D) objects that are stable in time, and leads to a new motion-compensated temporal updating and contour coding scheme that greatly reduces the bit-rate to transmit the object boundaries. The object interiors can be encoded by either 2-D or 3-D subband/wavelet coding. Simulations at very low bit-rates yield comparable performance in terms of reconstructed PSNR to the H.263 coder. The object-based coder produces visually more pleasing video with less blurriness and is devoid of block artifacts.


Motion Vector Motion Estimation Object Boundary Markov Random Field Video Compression 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

7 References

  1. [1]
    H. Musmann, M. Hotter, and J. Ostermann, “Object-oriented analysis-synthesis coding of moving images,” Signal Processing: Image Communications, vol. 1, pp. 117–138, Oct. 1989.Google Scholar
  2. [2]
    A. M. Tekalp, Digital Video Processing. Upper Saddle River, NJ: Prentice Hall, 1995.Google Scholar
  3. [3]
    S. Choi and J. Woods, “Motion-compensated 3-D subband coding of video,” IEEE Trans. Image Process. submitted for publication, 1996.Google Scholar
  4. [4]
    D. W. Murray and B. F. Buxton, “Scene segmentation from visual motion using global optimization,” IEEE Trans. Pattern Analysis and Machine Intelligence, pp. 220–228, Mar. 1987.Google Scholar
  5. [5]
    J. Konrad and E. Dubois, “Bayesian estimation of motion vector fields,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 14, pp. 910–927, Sept. 1992.CrossRefGoogle Scholar
  6. [6]
    P. Bouthemy and E. François, “Motion segmentation and qualitative dynamic scene analysis from an image sequence,” International Journal of Computer Vision, vol. 10, no. 2, pp. 157–182, 1993.CrossRefGoogle Scholar
  7. [7]
    C. Stiller, “Object-oriented video coding employing dense motion fields,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, vol. V, pp. 273–276, Adelaide, Australia, 1994.Google Scholar
  8. [8]
    M. Chang, I. Sezan, and A. Tekalp, “An algorithm for simultaneous motion estimation and scene segmentation,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, vol. V, pp. 221–224, Adelaide, Australia, 1994.Google Scholar
  9. [9]
    J. Zhang and G. G. Hanauer, “The application of mean field theory to image motion estimation,” IEEE Trans. Image Process., vol. 4, pp. 19–33, 1995.Google Scholar
  10. [10]
    S. Geman and D. Geman, “Stochastic relaxation, Gibbs distributions, and Bayesian restoration of images,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. PAMI-6, pp. 721–741, Nov. 1984.Google Scholar
  11. [11]
    R. Haralick and L. Shapiro, Computer and Robot Vision. Reading, MA: Addison-Wesley Pub. Co., 1992.Google Scholar
  12. [12]
    J. Wang and E. Adelson, “Representing moving images with layers,” IEEE Trans. Image Process., vol. 3, pp. 625–638, Sept. 1994.CrossRefGoogle Scholar
  13. [13]
    Y. Yokoyama, Y. Miyamoto, and M. Ohta, “Very low bit rate video coding using arbitrarily shaped region-based motion compensation,” IEEE Trans. Circuits and Systems for Video Technology, vol. 5, pp. 500–507, Dec. 1995.Google Scholar
  14. [14]
    H. J. Barnard, Image and Video Coding Using a Wavelet Decomposition. PhD thesis, Delft University of Technology, The Netherlands, 1994.Google Scholar
  15. [15]
    ITU-T Recommendation H.263, Video Coding for Low Bitrate Communication, Nov. 1995.Google Scholar
  16. [16]
    S. Han and J. Woods, “Three dimensional subband coding of video with object-based motion information,” to be presented at IEEE International Conference on Image Processing, Oct. 1997.Google Scholar
  17. [17]
    J. Ohm, “Three-dimensional subband coding with motion compensation,” IEEE Trans. Image Process., vol. 3, pp. 559–571, Sept. 1994.CrossRefGoogle Scholar
  18. [18]
    E. Riskin, “Optimal bit allocation via the generalized BFOS algorithm,” IEEE Trans. Inform. Theory, vol. IT-37, pp. 400–402, Mar. 1991.Google Scholar

Copyright information

© Kluwer Academic Publishers 2002

Authors and Affiliations

  • Soo-Chul Han
  • John W. Woods

There are no affiliations available

Personalised recommendations