Skip to main content
Log in

An enhanced fast mode decision model for spatial enhancement layers in scalable video coding

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Recently, the H.264/AVC standard has been extended to incorporate Scalable Video Coding (SVC). SVC offers the advantage of scalable (layered) coding, but has the disadvantage of a highly increased computational complexity at the encoder side when dealing with spatial scalability. To restrict the increase in required processing power, fast mode decision models for spatial enhancement layers have been proposed in literature. We propose a novel generic fast mode decision model for spatial enhancement layers for both P and B frames based on both the quantization of the enhancement layer and the correlation between the macroblock type in the enhancement layer and the co-located macroblock in the reference layer. In this paper, an evaluation of the proposed model and comparison with a state-of-the-art model is given. Results show that the proposed technique performs exceptionally well for spatial scalability. For both dyadic and non-dyadic spatial scalability, we achieve an average time saving of 75%, while only a slight bit rate increase of 2.23% and a minor PSNR decrease of 0.46 dB are measured. Compared with state of the art techniques, we further halve the complexity while having comparable rate-distortion results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Akyol E, Mukherjee D, Lui Y (2007) Complexity control for real-time video coding. In: Proceedings of the international conference on image processing (ICIP). USA

  2. Ates HF, Altunbasak Y (2008) Rate-distortion and complexity optimized motion estimation for H.264 video coding. IEEE Trans Circuits Syst Video Technol 18(2):159–171

    Article  Google Scholar 

  3. Bjøntegaard G (2001) Doc. VCEG-M33: calculation of average PSNR differences between RD-curves. ITU-T SG16 Q.6, Tech. Rep., Austin, Texas

  4. Bjøntegaard G (2008) Doc. VCEG-AI11: improvements of the BD-PSNR model. ITU-T SG16 Q.6, Tech. Rep., Berlin, Germany

  5. De Wolf K, De Schrijver D, De Neve W, De Zutter S, Lambert P, Van de Walle R (2007) Analysis of prediction mode decision in spatial enhancement layers in H.264/AVC SVC. Computer analysis of images and patterns, vol 4673/2007, pp 848–855

  6. De Wolf K, De Schrijver D, De Zutter S, Van de Walle R (2007) Scalable Video coding: analysis and coding performance of inter-layer prediction. In: Proc. of the 9th international symposium on signal processing and its applications (ISSPA)

  7. Goh G, Kang J, Cho M, Chung K (2009) Fast mode decision for scalable video coding based on neighboring macroblock analysis. In: SAC. ACM, pp 1845–1846

  8. Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (2007) Advanced Video Coding for Generic Audiovisual Services, ITU-T Rec. H.264 and ISO/IEC 14496-10 (MPEG-4 AVC), Version 8 (including SVC extension). MPEG / ITU-T, Tech. Rep

  9. Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (2007) Doc. JVT-W203: joint scalable video model 10. MPEG / ITU-T, Tech. Rep

  10. Jung S-W, Baek S-J, Park C-S, Ko S-J (2010) Fast mode decision using all-zero block detection for fidelity and spatial scalable video coding. IEEE Trans Circuits Syst Video Technol 20(2):201–206

    Article  Google Scholar 

  11. Kim S-T, reddy Konda K, Cho C-S (2009) Fast mode decision algorithm for spatial and snr scalable video coding. In: Proceedings of the IEEE international symposium on circuits and systems (ISCAS), pp 872–875

  12. Kim S-T, reddy Konda K, su Park C, Cho C-S, Ko S-J (2009) Fast mode decision algorithm for inter-layer coding in scalable video coding. IEEE Trans Consum Electron 55(3):1572–1580

    Article  Google Scholar 

  13. Li H, Li ZG, Wen C (2006) Fast mode decision algorithm for inter frame coding in fully scalable video coding. IEEE Trans Circuits Syst Video Technol 16(7):889–895

    Article  Google Scholar 

  14. Li H, Li Z, Wen C, Chau L-P (2006) Fast mode decision for spatial scalable video coding. In: International symposium on circuits and systems (ISCAS), pp 3005–3008

  15. Li H, Li Z, Wen C, Xie S (2007) Fast mode decision for coarse granular scalability via switched candidate mode set. In: Proceedings of the IEEE international conference on multimedia and expo

  16. Lin H, Peng W, Hang H (2007) Low-complexity macroblock mode decision algorithm for combined cgs and temporal scalability. MPEG/ITU-T, Tech. Rep

  17. Park C-S, Baek S-J, Yoon M-S, Kim H-K, Ko S-J (2009) Selective Inter-layer residual prediction for SVC-based video streaming. IEEE Trans Consum Electron 55(1):235–239

    Article  Google Scholar 

  18. Park C-S, Dan DB-K, Haechul C, Ko S-J (2009) A statistical approach for fast mode decision in scalable video coding. IEEE Trans Circuits Syst Video Technol 19(12):1915–1920

    Article  Google Scholar 

  19. Ren J, Kehtarnavaz ND (2008) Fast adaptive early termination for mode selection in H.264 scalable video coding. In: Proceedings of the international conference on image processing (ICIP). USA, pp 2464–2467

  20. Ren J, Kehtarnavaz ND (2009) Fast adaptive early termination for mode selection in H.264 scalable video coding. JRTIP 4(1):13–21

    Google Scholar 

  21. Schwarz H, Marpe D, Wiegand T (2007) Overview of the scalable video coding extension of the H.264/AVC standard. IEEE Trans Circuits Syst Video Technol 17(9):1103–1120

    Article  Google Scholar 

  22. Segall C, Sullivan G (2007) Spatial scalability within the H.264/AVC scalable video coding extension. IEEE Trans Circuits Syst Video Technol 17(9):1121–1135

    Article  Google Scholar 

  23. Van Leuven S, De Wolf K, Lambert P, Van de Walle R (2009) Probability analysis for macroblock types in spatial enhancement layers for SVC. In: Proceedings of the 11th IASTED international conference on signal and image processing

  24. Wiegand T, Schwarz H, Joch A, Kossentini F, Sullivan GJ (2003) Rate-constrained coder control and comparison of video coding standards. IEEE Trans Circuits Syst Video Technol 13(7):688–703

    Article  Google Scholar 

  25. Wiegand T, Sullivan GJ, Bjøntegaard G, Luthra A (2003) Overview of the H.264/AVC video coding standard. IEEE Trans Circuits Syst Video Technol 13(7):560–576

    Article  Google Scholar 

  26. Yeh C-H, Fan K-J, Chen M-J, Li G-L (2010) Fast mode decision algorithm for scalable video coding using bayesian theorem detection and Markov process. IEEE Trans Circuits Syst Video Technol 20(4):563–574

    Article  Google Scholar 

Download references

Acknowledgements

The research activities as described in this paper were funded by Ghent University, the Interdisciplinary Institute for Broadband Technology (IBBT), Ph.D. and postdoctoral fellow grants of the Agency for Innovation by Science and Technology (IWT), the Fund for Scientific Research-Flanders (FWO-Flanders), and the European Union.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sebastiaan Van Leuven.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Van Leuven, S., Van Wallendael, G., De Wolf, K. et al. An enhanced fast mode decision model for spatial enhancement layers in scalable video coding. Multimed Tools Appl 58, 215–237 (2012). https://doi.org/10.1007/s11042-010-0716-7

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-010-0716-7

Keywords

Navigation