Skip to main content

Multi-channel Object-Based Spatial Parameter Compression Approach for 3D Audio

  • Conference paper
  • First Online:
Book cover Advances in Multimedia Information Processing -- PCM 2015 (PCM 2015)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9314))

Included in the following conference series:

Abstract

To improve the spatial precision of three-dimensional (3D) audio, the bit rates of spatial parameters are increased sharply. This paper presents a spatial parameters compression approach to decrease the bit rates of spatial parameters for 3D audio. Based on spatial direction filtering and spatial side information clustering, new multi-channel object-based spatial parameters compression approach (MOSPCA) is presented, through which the spatial parameters of intra-frame different frequency bands belonging to the same sound source can be compressed to one spatial parameter. In an experiment it is shown that the compression ratio of spatial parameter can reach 7:1 compared with the 1.4:1 of MPEG Surround and S3AC (spatial squeeze surround audio coding), while transparent spatial perception is maintained.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Faller, C., Baumgarte, F.: Binaural cue coding-part II: schemes and applications. IEEE Trans. Speech Audio Process. 11(6), 520–531 (2003)

    Article  Google Scholar 

  2. Pulkki, V.: Spatial sound reproduction with directional audio coding. J. Audio Eng. Soc. 55(6), 503–516 (2007)

    Google Scholar 

  3. Cheng, B., Ritz, C., Burnett, I., et al.: A general compression approach to multi-channel three-dimensional audio. IEEE Trans. Audio Speech Lang. Process. 21(8), 1676–1688 (2013)

    Article  Google Scholar 

  4. Cheng, B., Ritz, C.H., Burnett, I.S.: Psychoacoustic-based quantisation of spatial audio cues. Electron. Lett. 44(18), 1098–1099 (2008)

    Article  Google Scholar 

  5. Yang, C., Hu, R., et al.: A 3D audio coding technique based on extracting the distance parameter. In: IEEE International Conference on Multimedia and Expo (ICME) (2014)

    Google Scholar 

  6. Kallinger, M., Del, Galdo G., Kuech, F., Mahne, D., Schultz-Amling, R.: Spatial filtering using directional audio coding parameters. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 217–220 (2009)

    Google Scholar 

  7. Zheng, X.G., Ritz, C.H., Xi, J.T.: Collaborative blind source separation using location informed spatial microphones. IEEE Signal Proc. Lett. 20(1), 83–86 (2013)

    Article  Google Scholar 

  8. Gorlow, S., Marchand, S.: Informed audio source separation using linearly constrained spatial filters. IEEE Trans. Audio Speech Lang. Process. 21(1), 3–13 (2013)

    Article  Google Scholar 

  9. Blauert, J.: Spatial Hearing: The Psychophysics of Human Sound Localization. MIT Press, Cambridge (1997)

    Google Scholar 

  10. Qu, T.S., Xiao, Z., Gong, M., Huang, Y., Li, X.D., Wu, X.H.: Distance-dependent head-related transfer functions measured with high spatial resolution using a spark gap. IEEE Trans. Audio Speech Lang. Process. 17(6), 1124–1132 (2009)

    Article  Google Scholar 

Download references

Acknowledgement

This work is supported by National High Technology Research and Development Program of China (863 Program, No. 2015AA016306), National Nature Science Foundation of China (No. 61231015, 61102127, 61201340, 61201169, 61471271, U1404618), Science and Technology Plan Projects of Shenzhen (No. ZDSYS2014050916575763), Guangdong-Hongkong Key Domain Breakthrough Project of China (No. 2012A090200007), Science and Technology Foundation of Guizhou Province (No. LKS[2011]1).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ruimin Hu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Yang, C., Hu, R., Su, L., Wang, X., Zhang, M., Qu, S. (2015). Multi-channel Object-Based Spatial Parameter Compression Approach for 3D Audio. In: Ho, YS., Sang, J., Ro, Y., Kim, J., Wu, F. (eds) Advances in Multimedia Information Processing -- PCM 2015. PCM 2015. Lecture Notes in Computer Science(), vol 9314. Springer, Cham. https://doi.org/10.1007/978-3-319-24075-6_34

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-24075-6_34

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-24074-9

  • Online ISBN: 978-3-319-24075-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics