Abstract
To improve the spatial precision of three-dimensional (3D) audio, the bit rates of spatial parameters are increased sharply. This paper presents a spatial parameters compression approach to decrease the bit rates of spatial parameters for 3D audio. Based on spatial direction filtering and spatial side information clustering, new multi-channel object-based spatial parameters compression approach (MOSPCA) is presented, through which the spatial parameters of intra-frame different frequency bands belonging to the same sound source can be compressed to one spatial parameter. In an experiment it is shown that the compression ratio of spatial parameter can reach 7:1 compared with the 1.4:1 of MPEG Surround and S3AC (spatial squeeze surround audio coding), while transparent spatial perception is maintained.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Faller, C., Baumgarte, F.: Binaural cue coding-part II: schemes and applications. IEEE Trans. Speech Audio Process. 11(6), 520–531 (2003)
Pulkki, V.: Spatial sound reproduction with directional audio coding. J. Audio Eng. Soc. 55(6), 503–516 (2007)
Cheng, B., Ritz, C., Burnett, I., et al.: A general compression approach to multi-channel three-dimensional audio. IEEE Trans. Audio Speech Lang. Process. 21(8), 1676–1688 (2013)
Cheng, B., Ritz, C.H., Burnett, I.S.: Psychoacoustic-based quantisation of spatial audio cues. Electron. Lett. 44(18), 1098–1099 (2008)
Yang, C., Hu, R., et al.: A 3D audio coding technique based on extracting the distance parameter. In: IEEE International Conference on Multimedia and Expo (ICME) (2014)
Kallinger, M., Del, Galdo G., Kuech, F., Mahne, D., Schultz-Amling, R.: Spatial filtering using directional audio coding parameters. In: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 217–220 (2009)
Zheng, X.G., Ritz, C.H., Xi, J.T.: Collaborative blind source separation using location informed spatial microphones. IEEE Signal Proc. Lett. 20(1), 83–86 (2013)
Gorlow, S., Marchand, S.: Informed audio source separation using linearly constrained spatial filters. IEEE Trans. Audio Speech Lang. Process. 21(1), 3–13 (2013)
Blauert, J.: Spatial Hearing: The Psychophysics of Human Sound Localization. MIT Press, Cambridge (1997)
Qu, T.S., Xiao, Z., Gong, M., Huang, Y., Li, X.D., Wu, X.H.: Distance-dependent head-related transfer functions measured with high spatial resolution using a spark gap. IEEE Trans. Audio Speech Lang. Process. 17(6), 1124–1132 (2009)
Acknowledgement
This work is supported by National High Technology Research and Development Program of China (863 Program, No. 2015AA016306), National Nature Science Foundation of China (No. 61231015, 61102127, 61201340, 61201169, 61471271, U1404618), Science and Technology Plan Projects of Shenzhen (No. ZDSYS2014050916575763), Guangdong-Hongkong Key Domain Breakthrough Project of China (No. 2012A090200007), Science and Technology Foundation of Guizhou Province (No. LKS[2011]1).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Yang, C., Hu, R., Su, L., Wang, X., Zhang, M., Qu, S. (2015). Multi-channel Object-Based Spatial Parameter Compression Approach for 3D Audio. In: Ho, YS., Sang, J., Ro, Y., Kim, J., Wu, F. (eds) Advances in Multimedia Information Processing -- PCM 2015. PCM 2015. Lecture Notes in Computer Science(), vol 9314. Springer, Cham. https://doi.org/10.1007/978-3-319-24075-6_34
Download citation
DOI: https://doi.org/10.1007/978-3-319-24075-6_34
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24074-9
Online ISBN: 978-3-319-24075-6
eBook Packages: Computer ScienceComputer Science (R0)