Skip to main content
Log in

A wavelet video coding algorithm with balanced significance probability tree based on energy weighting

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

This work presents a 3-D wavelet video coding algorithm. By analyzing the contribution of each biorthogonal wavelet basis to reconstructed signal’s energy, we weight each wavelet subband according to its basis energy. Based on distribution of weighted coefficients, we further discuss a 3-D wavelet tree structure named balanced significance probability tree, which places the coefficients with similar probabilities of being significant on the same layer. It is implemented by using hybrid spatial orientation tree and temporal-domain block tree. Subsequently, a novel 3-D wavelet video coding algorithm is proposed based on the energy-weighted balanced significance probability tree. Experimental results illustrate that our algorithm always achieves good reconstruction quality for different classes of video sequences. Compared with asymmetric 3-D orientation tree, the average peak signal-to-noise ratio (PSNR) gain of our algorithm are 1.24dB, 2.54dB and 2.57dB for luminance (Y) and chrominance (U,V) components, respectively. Compared with temporal-spatial orientation tree algorithm, our algorithm gains 0.38dB, 2.92dB and 2.39dB higher PSNR separately for Y, U, and V components. In addition, the proposed algorithm requires lower computation cost than those of the above two algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

References

  1. Bjelopera A, Grgic S (2012) Scalable video coding extension of h.264/AVC. In: Proceedings of the international symposium ELMAR, vol 1, Zadar, Croatia, pp 7–12

  2. Campisi P, Gentile M, Neri A (1999) Three-dimensional wavelet-based approach for a scalable video conference system. In: Proceedings of the IEEE international conference on image processing (ICIP’99), vol 3, Kobe, Japan, pp 802–806

  3. Chai B, Vass J, Zhuang X (1999) Significance link connected component analysis for wavelet image coding. IEEE Trans Image Process 8(6):774–784

    Google Scholar 

  4. Chang Z, Zhuo L, Shen L (2006) An improved motion-compensated three dimension wavelet video coding method. J Circuits Syst 11(1):113–117,121

    Google Scholar 

  5. Chen Y, Pearlman WA (1996) Three-dimensional subband coding of video using the zerotree method. In: Proceedings of the SPIE visual communications and image processing, vol 2727, Snowbird, USA, pp 1302–1312

  6. Chen P, Woods JW (2002) Improved MC-EZBC with quarter-pixel motion vectors, Tech. rep., ISO/IEC JTC 1/SC 29/WG 11, Fairfax VA

  7. Chen PS, Woods JW (2004) Bidirectional MC-EZBC with lifting implementation. IEEE Trans Circuits Syst Video Technol 14(10):1183–1194

    Google Scholar 

  8. Cheng CC, Peng GJ, Hwang WL (2009) Subband weighting with pixel connectivity for 3-D wavelet coding. IEEE Trans Image Process 18(1):52–62

    MathSciNet  MATH  Google Scholar 

  9. Cho Y, Pearlman WA (2007) Quantifying the coding performance of zerotrees of wavelet coefficients: degree-k zerotree. IEEE Trans Signal Process 55(6):2425–2431

    MathSciNet  MATH  Google Scholar 

  10. Daubechies I (1992) Ten lectures on wavelets, 1st Edition, Society for industrial and applied mathematics

  11. Ding W, Hu J, Zhang L (2005) Optimal 3d-SPIHT video coding method by reducing redundancy between trees. J Comput Aided Des Comput Graph 17(3):563–569

    Google Scholar 

  12. Fang S, Zhong Y (2005) 3D Subband codec with full scalability. Mini-Micro Systems 26(7):1260–1263

    Google Scholar 

  13. Fowler JE, Pesquet-Popescu B (2007) An overview on wavelets in source coding, communications, and networks. EURASIP J Image Video Process 2007(1):1–27

    Google Scholar 

  14. He C, Dong J, Zheng Y, Gao Z (2003) Optimal 3-D coefficient tree structure for 3-D wavelet video coding. IEEE Trans Circuits Syst Video Technol 13(10):961–972

    Google Scholar 

  15. Hsiang ST, Woods JW (2000) Embedded image coding using zeroblocks of subband/wavelet coefficient and context modeling. In: Proceedings of the IEEE international symposium on circuit and systems (ISCAS’00), vol 3, Geneva, Switzerland, pp 662–665

  16. ISO/IEC JTC1/SC29/WG11 (2005) SVC requirements specified by MPEG, JVT-N026, Tech. rep., ISO/IEC JTC1/SC29/WG11 Hong Kong

  17. ISO/IEC JTC1/SC29/WG11 (2005) Wavelet codec reference document and software manual, Tech. Rep. ISO/MPEG Video, Tech. Rep, N7334 ISO/IEC JTC1/SC29/WG11

  18. ISO/IEC JTC 1/SC 29/WG 11 (2006) Wavelet video coding - an overview, Doc. W7824, ISO/IEC JTC1/SC29/WG11, Bangkok, Thailand

  19. ISO/IEC JTC 1/SC 29/WG 11 (2006) Status report - version 1 on wavelet video coding exploration, Doc. N7822, ISO/IEC JTC1/SC29/WG11, Bangkok, Thailand

  20. ISO/IEC JTC 1/SC 29/WG 11, ITU-T SG 16 WP 3 (2012) Joint call for proposals on scalable video coding extensions of high efficiency video coding (HEVC), Tech. Rep. N12957, ISO/IEC JTC1/SC29/WG11,Stockholm, Sweden

  21. Khalil H, Atiya AF, Shaheen SI (1999) Lowering frame-buffering requirements of 3-D wavelet transform coding of interactive video. In: Proceedings of the IEEE international conference on image processing (ICIP’99), vol 3, Kobe, Japan, pp 852–856

  22. Khan E, Ghanbari M (2001) Very low bit rate video coding using virtual spiht. Electron Lett 37(1):40–41

    Google Scholar 

  23. Khan E, Ghanbari M (2004) An efficient and scalable low bit-rate video coding with virtual SPIHT. Signal Process Image Commun 19(3):267–283

    Google Scholar 

  24. Kim BJ, Pearlman WA (1997) An embedded wavelet video coder using three-dimensional set partitioning in hierarchical trees (SPIHT). In: Proceedings of the data compression conference, vol 1, Snowbird, USA, pp 251–260

  25. Kim B, Xiong Z, Pearlman WA (2000) Low bit rate, scalable video coding with 3-D set partitioning in hierarchical trees (3-D SPIHT). IEEE Trans Circuits Syst Video Technol 10(12):1374–1387

    Google Scholar 

  26. Lasserre S, Leannec FL, Taquet J, Nassor E Low-complexity intra coding for scalable extension of HEVC based on content statistics, IEEE Transactions on Circuits and Systems for Video Technology 24 (2014) (to be published)

    Google Scholar 

  27. Liu Y, Lee JYB (2017) Post-streaming rate analysisa new approach to mobile video streaming with predictable performance. IEEE Trans Mob Comput 16(12):3488–3501

    Google Scholar 

  28. Lòpez MF, Ruiz VG, García I (2007) Efficiency of closed and open-loop scalable wavelet based video coding, Advanced concepts for intelligent vision systems. Lect Notes Comput Sci 4678(10):800–809

    Google Scholar 

  29. Lu X, Martin GR (2013) Performance comparison of the SVC, WSVC, and Motion JPEG2000 advanced scalable video coding schemes. In: Proceedings of the IET, Intelligent Signal Processing, vol 1, London, pp 1–6

  30. Marpe D, Cycon HL (1999) Very low bit-rate video coding using wavelet-based techniques. IEEE Trans Circuits Syst Video Technol 9(1):85–94

    Google Scholar 

  31. Minami G, Xiong Z, Wang A, Chou PA, Mehrotra S (2001) 3-D Wavelet coding of video with arbitrary regions of support. IEEE Trans Circuits Syst Video Technol 11(9):1063–1068

    Google Scholar 

  32. Moinuddin AA, Khan E, Ghanbari M (2008) Efficient and embedded 3-D wavelet video coding. In: Proceedings TENCON 2008, vol 1, Hyderabad, pp 1–4

  33. Moinuddin AA, Khan E, Ghanbari M (2010) The impact of tree structures on the performance of zerotree-based wavelet video codecs. Signal Process Image Commun 25(3):179–195

    Google Scholar 

  34. Park U-K, Choi H, Kang JW, Kim J-G (2012) Scalable video coding with large block for UHD video. IEEE Trans Consum Electron 28(3):932–940

    Google Scholar 

  35. Pearlman WA, Islam A, Nagaraj N, Said A (2004) Efficient low-complexity image coding with set-partitioning embedded block coder. IEEE Trans Circuits Syst Video Technol 14(11):1219–1235

    Google Scholar 

  36. Said A, Pearlman WA (1996) A new, fast, and efficient image codec based on set partitioning in hierarchical trees. IEEE Trans Circuits Syst Video Technol 6(3):243–250

    Google Scholar 

  37. Schwarz H, Marpe D, Wiegand T (2007) Overview of the scalable video coding extension of the H.264/AVC standard. IEEE Trans Circuits Syst Video Technol 17(9):1103–1120

    Google Scholar 

  38. Shah R, Narayanan PJ (2013) Interactive video manipulation using object trajectories and scene backgrounds. IEEE Trans Circuits Syst Video Technol 23(9):1565–1576

    Google Scholar 

  39. Shapiro J (1993) Embedded image coding using zerotree of wavelet coefficients. IEEE Trans Signal Process 41(12):3445–3462

    MATH  Google Scholar 

  40. Shi Z, Sun X, Wu F (2012) Spatially scalable video coding for HEVC. IEEE Trans Circuits Syst Video Technol 22(12):1813–1826

    Google Scholar 

  41. Smith MJT, Barnwell TP (1986) Exact reconstruction techniques for tree-structured subband coders. IEEE Trans Acoust Speech Sig Process 34(3):434–441

    Google Scholar 

  42. Song C-M, Wang X-H, Zhang F (2009) Visually lossless accuracy of motion vector in overcomplete wavelet-based scalable video coding. J Comput 4(9):821–828

    Google Scholar 

  43. Tao J, Wang H, Zhang J, Jiang Z (2005) Research on the scalability of 3D wavelet video coding. Mini-Micro Systems 26(2):285–288

    Google Scholar 

  44. Taubman D (2000) High performance scalable image compression with EBCOT. IEEE Trans Image Process 9(7):1158–1170

    MathSciNet  Google Scholar 

  45. Usevitch B (1996) Optimal bit allocation for biorthogonal wavelet coding. In: Proceedings of the data compression conference, vol 1, Snowbird, USA, pp 387–395

  46. Usevitch BE (2001) A tutorial on modern lossy wavelet image compression: foundations of JPEG 2000. IEEE Signal Proc Mag 18(5):22–35

    Google Scholar 

  47. Vass J, Chai B, Zhuang X (1998) 3-D SLCCA—a highly scalable very low bit-rate software-only wavelet video codec. In: Proceedings of the IEEE second workshop multimedia signal processing, vol 3, Redondo Beach, CA, pp 474–479

  48. Wang Y, Lin X, Wu L, Zhang W, Zhang Q, Huang X (2015) Robust subspace clustering for multi-view data by exploiting correlation consensus. IEEE Trans Image Process 24(11):3939–3949

    MathSciNet  MATH  Google Scholar 

  49. Wang Y, Zhang W, Wu L, Lin X, Fang M, Pan S (2016) Iterative views agreement: an iterative low-rank based structured optimization method to multi-view spectral clustering. In: Proceedings of the international joint conference on artificial intelligence, vol 1, New York, USA, pp 2153– 2159

  50. Wang Y, Zhang W, Wu L, Lin X, Zhao X (2017) Unsupervised metric fusion over multiview data by graph random walk-based cross-view diffusion. IEEE Trans Neural Netw Learn Syst 28(1):57–70

    Google Scholar 

  51. Wang Y, Lin X, Wu L, Zhang W (2017) Effective multi-query expansions: collaborative deep networks for robust landmark retrieval. IEEE Trans Image Process 26(3):1393–1404

    MathSciNet  MATH  Google Scholar 

  52. Wang Y, Wu L (2018) Beyond low-rank representations: Orthogonal clustering basis reconstruction with optimized graph structure for multi-view spectral clustering. Neural Netw 103(1):1–8

    MathSciNet  Google Scholar 

  53. Wiegand T, Sullivan GJ, Reichel J, Schwarz H, Wien H (2007) Joint draft 11 of SVC amendment, Doc. JVT-x201, Tech. rep., ISO/IEC JTC1/SC29/WG11, Geneva, Switzerland

  54. Wu L, Wang Y, Li X, Gao J Deep attention-based spatially recursive networks for fine-grained visual recognition. IEEE Transactions on Cybernetics, vol 99 (PP). https://doi.org/10.1109/TCYB.2018.2813971

    Google Scholar 

  55. Wang Y, Wu L, Lin X, Gao J Multiview spectral clustering via structured low-rank matrix factorization, IEEE Transactions on Neural Networks and Learning Systems 99 (PP). https://doi.org/10.1109/TNNLS.2017.2777489

    Google Scholar 

  56. Wu G, Ding W, Shi Y, Yin B (2013) Adaptive weighted prediction for scalable video coding based on HEVC. In: Proceedings Pacific Rim Conference on Multimedia (PCM), vol 1, London, pp 110– 121

  57. Wu L, Wang Y, Gao J, Li X (2018) Deep adaptive feature embedding with local sample distributions for person re-identification. Pattern Recogn 73(1):275–288

    Google Scholar 

  58. Wu L, Wang Y, Li X, Gao J (2018) What-and-where to match: Deep spatially multiplicative integration networks for person re-identification. Pattern Recogn 76(1):727–738

    Google Scholar 

  59. Wu L, Wang Y, Ge Z, Hu Q, Li X (2018) Structured deep hashing with convolutional neural networks for fast person re-identification. Computer Vision and Image Undersanding 167(1):63–73

    Google Scholar 

  60. Xiong R, Xu J, Wu F, Li S (2007) Barbell-lifting based 3-D wavelet coding scheme. IEEE Trans Circuits Syst Video Technol 17(9):1256–1269

    Google Scholar 

  61. Xu J, Xiong Z, Li S, Zhang Y-Q (2001) Three-dimensional embedded subband coding with optimal truncation (3-D ESCOT). Appl Comput Harmon Anal 10(3):290–315

    MathSciNet  MATH  Google Scholar 

  62. Zhang L, Wang D, Vincent A (2008) Decoupled 3-D zerotree structure for wavelet-based video coding. IEEE Trans Broadcast 54(3):430–436

    Google Scholar 

  63. Zhong MS, Ghanbari M (2004) Motion compensation based on wavelet coefficient blocks. Acta Automatica Sinica 30(1):64–69

    Google Scholar 

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (NSFC) under Grant nos. 61402214, 41671439, and 61702246, the Provincial Natural Science Foundation of Liaoning under Grant no. 20180550570, the Program for Liaoning Innovative Research Talents in University, the Open Foundation of State Key Laboratory for Novel Software Technology of Nanjing University under Grant no. KFKT2018B07, and the Dalian Foundation for Youth Science and Technology Star under Grant no. 2015R069.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiang-Hai Wang.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Song, CM., Fu, B., Wang, XH. et al. A wavelet video coding algorithm with balanced significance probability tree based on energy weighting. Multimed Tools Appl 78, 30877–30893 (2019). https://doi.org/10.1007/s11042-018-7133-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-018-7133-8

Keywords

Navigation