Skip to main content

Transformation of Video Signal Processing Techniques from 2D to 3D: A Survey

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 570))

Abstract

This paper presents an advanced depth intra-coding approach for 3D video coding based on the High Efficiency Video Coding (HEVC) standard and the multiview video plus depth (MVD) representation. This paper is motivated by the fact that depth signals have specific characteristics that differ from those of natural signals, i.e., camera-view video. Our approach replaces conventional intra-picture coding for the depth component, targeting a consistent and efficient support of 3D video applications that utilize depth maps or polygon meshes or both, with a high depth coding efficiency in terms of minimal artifacts in rendered views and meshes with a minimal number of triangles for a given bit rate. For this purpose, we introduce intra-picture prediction modes based on geometric primitives along with a residual coding method in the spatial domain, substituting conventional intra-prediction modes and transform coding, respectively. The results show that our solution achieves the same quality of rendered or synthesized views with about the same bit rate as MVD coding with the 3D video extension of HEVC (3D-HEVC) for high-quality depth maps and with about 8% less overall bit rate as with 3D-HEVC without related depth tools. At the same time, the combination of 3D video with 3D computer graphics content is substantially simplified, as the geometry-based depth intra signals can be represented as a surface mesh with about 85% less triangles, generated directly in the decoding process as an alternative decoder output.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Huang Z (2015) A benchmark and comparative study of video-based face Recognition on COX face database. IEEE Trans Image Process 24(12)

    Article  MathSciNet  Google Scholar 

  2. Cao J (2016) Research on urban intelligent traffic monitoring system based on video image processing. Int J Signal Process, Image Process Pattern Recognit 9(6):393–406

    Article  Google Scholar 

  3. Shan H, Hazanchuk NA et al (2005) Adaptive edge detection for real-time video processing using FPGAs. Altera Corporation

    Google Scholar 

  4. Jorge CAF et al (2013) Improved people detection in nuclear plants by video processing for safety purpose. In: International nuclear Atlantic conference—INAC, ISBN: 978-85-99141-05-2

    Google Scholar 

  5. Winken M et al Motion vector inheritance for high efficiency 3D video plus depth coding. In: Picture coding symposium, Kraków, Poland, 2012

    Google Scholar 

  6. Müller K et al (2013) 3D high-efficiency video coding for multi-view video and depth data. IEEE Trans Image Process 22(9)

    Article  MathSciNet  Google Scholar 

  7. Bal C et al (2014) Multiview video plus depth coding with depth-based prediction mode. IEEE Trans Circuits Syst Video Technol 24(6)

    Article  Google Scholar 

  8. Merkle P, Müller K, Marpe D, Wiegand T (2016) Depth intra coding for 3D video based on geometric primitives. IEEE Trans Circuits Syst Video Technol 26(3)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sanjay Koli .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Koli, S., Shamalik, R. (2020). Transformation of Video Signal Processing Techniques from 2D to 3D: A Survey. In: Kumar, A., Mozar, S. (eds) ICCCE 2019. Lecture Notes in Electrical Engineering, vol 570. Springer, Singapore. https://doi.org/10.1007/978-981-13-8715-9_8

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-8715-9_8

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-8714-2

  • Online ISBN: 978-981-13-8715-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics