Abstract
This work presents a 3-D wavelet video coding algorithm. By analyzing the contribution of each biorthogonal wavelet basis to reconstructed signal’s energy, we weight each wavelet subband according to its basis energy. Based on distribution of weighted coefficients, we further discuss a 3-D wavelet tree structure named balanced significance probability tree, which places the coefficients with similar probabilities of being significant on the same layer. It is implemented by using hybrid spatial orientation tree and temporal-domain block tree. Subsequently, a novel 3-D wavelet video coding algorithm is proposed based on the energy-weighted balanced significance probability tree. Experimental results illustrate that our algorithm always achieves good reconstruction quality for different classes of video sequences. Compared with asymmetric 3-D orientation tree, the average peak signal-to-noise ratio (PSNR) gain of our algorithm are 1.24dB, 2.54dB and 2.57dB for luminance (Y) and chrominance (U,V) components, respectively. Compared with temporal-spatial orientation tree algorithm, our algorithm gains 0.38dB, 2.92dB and 2.39dB higher PSNR separately for Y, U, and V components. In addition, the proposed algorithm requires lower computation cost than those of the above two algorithms.
Similar content being viewed by others
References
Bjelopera A, Grgic S (2012) Scalable video coding extension of h.264/AVC. In: Proceedings of the international symposium ELMAR, vol 1, Zadar, Croatia, pp 7–12
Campisi P, Gentile M, Neri A (1999) Three-dimensional wavelet-based approach for a scalable video conference system. In: Proceedings of the IEEE international conference on image processing (ICIP’99), vol 3, Kobe, Japan, pp 802–806
Chai B, Vass J, Zhuang X (1999) Significance link connected component analysis for wavelet image coding. IEEE Trans Image Process 8(6):774–784
Chang Z, Zhuo L, Shen L (2006) An improved motion-compensated three dimension wavelet video coding method. J Circuits Syst 11(1):113–117,121
Chen Y, Pearlman WA (1996) Three-dimensional subband coding of video using the zerotree method. In: Proceedings of the SPIE visual communications and image processing, vol 2727, Snowbird, USA, pp 1302–1312
Chen P, Woods JW (2002) Improved MC-EZBC with quarter-pixel motion vectors, Tech. rep., ISO/IEC JTC 1/SC 29/WG 11, Fairfax VA
Chen PS, Woods JW (2004) Bidirectional MC-EZBC with lifting implementation. IEEE Trans Circuits Syst Video Technol 14(10):1183–1194
Cheng CC, Peng GJ, Hwang WL (2009) Subband weighting with pixel connectivity for 3-D wavelet coding. IEEE Trans Image Process 18(1):52–62
Cho Y, Pearlman WA (2007) Quantifying the coding performance of zerotrees of wavelet coefficients: degree-k zerotree. IEEE Trans Signal Process 55(6):2425–2431
Daubechies I (1992) Ten lectures on wavelets, 1st Edition, Society for industrial and applied mathematics
Ding W, Hu J, Zhang L (2005) Optimal 3d-SPIHT video coding method by reducing redundancy between trees. J Comput Aided Des Comput Graph 17(3):563–569
Fang S, Zhong Y (2005) 3D Subband codec with full scalability. Mini-Micro Systems 26(7):1260–1263
Fowler JE, Pesquet-Popescu B (2007) An overview on wavelets in source coding, communications, and networks. EURASIP J Image Video Process 2007(1):1–27
He C, Dong J, Zheng Y, Gao Z (2003) Optimal 3-D coefficient tree structure for 3-D wavelet video coding. IEEE Trans Circuits Syst Video Technol 13(10):961–972
Hsiang ST, Woods JW (2000) Embedded image coding using zeroblocks of subband/wavelet coefficient and context modeling. In: Proceedings of the IEEE international symposium on circuit and systems (ISCAS’00), vol 3, Geneva, Switzerland, pp 662–665
ISO/IEC JTC1/SC29/WG11 (2005) SVC requirements specified by MPEG, JVT-N026, Tech. rep., ISO/IEC JTC1/SC29/WG11 Hong Kong
ISO/IEC JTC1/SC29/WG11 (2005) Wavelet codec reference document and software manual, Tech. Rep. ISO/MPEG Video, Tech. Rep, N7334 ISO/IEC JTC1/SC29/WG11
ISO/IEC JTC 1/SC 29/WG 11 (2006) Wavelet video coding - an overview, Doc. W7824, ISO/IEC JTC1/SC29/WG11, Bangkok, Thailand
ISO/IEC JTC 1/SC 29/WG 11 (2006) Status report - version 1 on wavelet video coding exploration, Doc. N7822, ISO/IEC JTC1/SC29/WG11, Bangkok, Thailand
ISO/IEC JTC 1/SC 29/WG 11, ITU-T SG 16 WP 3 (2012) Joint call for proposals on scalable video coding extensions of high efficiency video coding (HEVC), Tech. Rep. N12957, ISO/IEC JTC1/SC29/WG11,Stockholm, Sweden
Khalil H, Atiya AF, Shaheen SI (1999) Lowering frame-buffering requirements of 3-D wavelet transform coding of interactive video. In: Proceedings of the IEEE international conference on image processing (ICIP’99), vol 3, Kobe, Japan, pp 852–856
Khan E, Ghanbari M (2001) Very low bit rate video coding using virtual spiht. Electron Lett 37(1):40–41
Khan E, Ghanbari M (2004) An efficient and scalable low bit-rate video coding with virtual SPIHT. Signal Process Image Commun 19(3):267–283
Kim BJ, Pearlman WA (1997) An embedded wavelet video coder using three-dimensional set partitioning in hierarchical trees (SPIHT). In: Proceedings of the data compression conference, vol 1, Snowbird, USA, pp 251–260
Kim B, Xiong Z, Pearlman WA (2000) Low bit rate, scalable video coding with 3-D set partitioning in hierarchical trees (3-D SPIHT). IEEE Trans Circuits Syst Video Technol 10(12):1374–1387
Lasserre S, Leannec FL, Taquet J, Nassor E Low-complexity intra coding for scalable extension of HEVC based on content statistics, IEEE Transactions on Circuits and Systems for Video Technology 24 (2014) (to be published)
Liu Y, Lee JYB (2017) Post-streaming rate analysisa new approach to mobile video streaming with predictable performance. IEEE Trans Mob Comput 16(12):3488–3501
Lòpez MF, Ruiz VG, García I (2007) Efficiency of closed and open-loop scalable wavelet based video coding, Advanced concepts for intelligent vision systems. Lect Notes Comput Sci 4678(10):800–809
Lu X, Martin GR (2013) Performance comparison of the SVC, WSVC, and Motion JPEG2000 advanced scalable video coding schemes. In: Proceedings of the IET, Intelligent Signal Processing, vol 1, London, pp 1–6
Marpe D, Cycon HL (1999) Very low bit-rate video coding using wavelet-based techniques. IEEE Trans Circuits Syst Video Technol 9(1):85–94
Minami G, Xiong Z, Wang A, Chou PA, Mehrotra S (2001) 3-D Wavelet coding of video with arbitrary regions of support. IEEE Trans Circuits Syst Video Technol 11(9):1063–1068
Moinuddin AA, Khan E, Ghanbari M (2008) Efficient and embedded 3-D wavelet video coding. In: Proceedings TENCON 2008, vol 1, Hyderabad, pp 1–4
Moinuddin AA, Khan E, Ghanbari M (2010) The impact of tree structures on the performance of zerotree-based wavelet video codecs. Signal Process Image Commun 25(3):179–195
Park U-K, Choi H, Kang JW, Kim J-G (2012) Scalable video coding with large block for UHD video. IEEE Trans Consum Electron 28(3):932–940
Pearlman WA, Islam A, Nagaraj N, Said A (2004) Efficient low-complexity image coding with set-partitioning embedded block coder. IEEE Trans Circuits Syst Video Technol 14(11):1219–1235
Said A, Pearlman WA (1996) A new, fast, and efficient image codec based on set partitioning in hierarchical trees. IEEE Trans Circuits Syst Video Technol 6(3):243–250
Schwarz H, Marpe D, Wiegand T (2007) Overview of the scalable video coding extension of the H.264/AVC standard. IEEE Trans Circuits Syst Video Technol 17(9):1103–1120
Shah R, Narayanan PJ (2013) Interactive video manipulation using object trajectories and scene backgrounds. IEEE Trans Circuits Syst Video Technol 23(9):1565–1576
Shapiro J (1993) Embedded image coding using zerotree of wavelet coefficients. IEEE Trans Signal Process 41(12):3445–3462
Shi Z, Sun X, Wu F (2012) Spatially scalable video coding for HEVC. IEEE Trans Circuits Syst Video Technol 22(12):1813–1826
Smith MJT, Barnwell TP (1986) Exact reconstruction techniques for tree-structured subband coders. IEEE Trans Acoust Speech Sig Process 34(3):434–441
Song C-M, Wang X-H, Zhang F (2009) Visually lossless accuracy of motion vector in overcomplete wavelet-based scalable video coding. J Comput 4(9):821–828
Tao J, Wang H, Zhang J, Jiang Z (2005) Research on the scalability of 3D wavelet video coding. Mini-Micro Systems 26(2):285–288
Taubman D (2000) High performance scalable image compression with EBCOT. IEEE Trans Image Process 9(7):1158–1170
Usevitch B (1996) Optimal bit allocation for biorthogonal wavelet coding. In: Proceedings of the data compression conference, vol 1, Snowbird, USA, pp 387–395
Usevitch BE (2001) A tutorial on modern lossy wavelet image compression: foundations of JPEG 2000. IEEE Signal Proc Mag 18(5):22–35
Vass J, Chai B, Zhuang X (1998) 3-D SLCCA—a highly scalable very low bit-rate software-only wavelet video codec. In: Proceedings of the IEEE second workshop multimedia signal processing, vol 3, Redondo Beach, CA, pp 474–479
Wang Y, Lin X, Wu L, Zhang W, Zhang Q, Huang X (2015) Robust subspace clustering for multi-view data by exploiting correlation consensus. IEEE Trans Image Process 24(11):3939–3949
Wang Y, Zhang W, Wu L, Lin X, Fang M, Pan S (2016) Iterative views agreement: an iterative low-rank based structured optimization method to multi-view spectral clustering. In: Proceedings of the international joint conference on artificial intelligence, vol 1, New York, USA, pp 2153– 2159
Wang Y, Zhang W, Wu L, Lin X, Zhao X (2017) Unsupervised metric fusion over multiview data by graph random walk-based cross-view diffusion. IEEE Trans Neural Netw Learn Syst 28(1):57–70
Wang Y, Lin X, Wu L, Zhang W (2017) Effective multi-query expansions: collaborative deep networks for robust landmark retrieval. IEEE Trans Image Process 26(3):1393–1404
Wang Y, Wu L (2018) Beyond low-rank representations: Orthogonal clustering basis reconstruction with optimized graph structure for multi-view spectral clustering. Neural Netw 103(1):1–8
Wiegand T, Sullivan GJ, Reichel J, Schwarz H, Wien H (2007) Joint draft 11 of SVC amendment, Doc. JVT-x201, Tech. rep., ISO/IEC JTC1/SC29/WG11, Geneva, Switzerland
Wu L, Wang Y, Li X, Gao J Deep attention-based spatially recursive networks for fine-grained visual recognition. IEEE Transactions on Cybernetics, vol 99 (PP). https://doi.org/10.1109/TCYB.2018.2813971
Wang Y, Wu L, Lin X, Gao J Multiview spectral clustering via structured low-rank matrix factorization, IEEE Transactions on Neural Networks and Learning Systems 99 (PP). https://doi.org/10.1109/TNNLS.2017.2777489
Wu G, Ding W, Shi Y, Yin B (2013) Adaptive weighted prediction for scalable video coding based on HEVC. In: Proceedings Pacific Rim Conference on Multimedia (PCM), vol 1, London, pp 110– 121
Wu L, Wang Y, Gao J, Li X (2018) Deep adaptive feature embedding with local sample distributions for person re-identification. Pattern Recogn 73(1):275–288
Wu L, Wang Y, Li X, Gao J (2018) What-and-where to match: Deep spatially multiplicative integration networks for person re-identification. Pattern Recogn 76(1):727–738
Wu L, Wang Y, Ge Z, Hu Q, Li X (2018) Structured deep hashing with convolutional neural networks for fast person re-identification. Computer Vision and Image Undersanding 167(1):63–73
Xiong R, Xu J, Wu F, Li S (2007) Barbell-lifting based 3-D wavelet coding scheme. IEEE Trans Circuits Syst Video Technol 17(9):1256–1269
Xu J, Xiong Z, Li S, Zhang Y-Q (2001) Three-dimensional embedded subband coding with optimal truncation (3-D ESCOT). Appl Comput Harmon Anal 10(3):290–315
Zhang L, Wang D, Vincent A (2008) Decoupled 3-D zerotree structure for wavelet-based video coding. IEEE Trans Broadcast 54(3):430–436
Zhong MS, Ghanbari M (2004) Motion compensation based on wavelet coefficient blocks. Acta Automatica Sinica 30(1):64–69
Acknowledgements
This work is supported by the National Natural Science Foundation of China (NSFC) under Grant nos. 61402214, 41671439, and 61702246, the Provincial Natural Science Foundation of Liaoning under Grant no. 20180550570, the Program for Liaoning Innovative Research Talents in University, the Open Foundation of State Key Laboratory for Novel Software Technology of Nanjing University under Grant no. KFKT2018B07, and the Dalian Foundation for Youth Science and Technology Star under Grant no. 2015R069.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Song, CM., Fu, B., Wang, XH. et al. A wavelet video coding algorithm with balanced significance probability tree based on energy weighting. Multimed Tools Appl 78, 30877–30893 (2019). https://doi.org/10.1007/s11042-018-7133-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-018-7133-8