Frequency Domain Coding

Bäckström, Tom

doi:10.1007/978-3-319-50204-5_10

Tom Bäckström²

Part of the book series: Signals and Communication Technology ((SCT))

1051 Accesses

Abstract

Signals which are sufficiently stationary permit highly efficient coding in the frequency domain. Such signals include speech signals such as sustained vowels and prolonged fricatives, as well as generic audio signals such as music and mixed material. The main components of frequency domain coding methods include windowing, a time-frequency transform, perceptual modelling and entropy coding of the spectral components. This chapter gives an overview of such transform domain coding methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Hardcover Book: USD 139.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

3GPP. TS 26.445, EVS Codec Detailed Algorithmic Description; 3GPP Technical Specification (Release 12), 2014
Google Scholar
Allen, J.: Short-term spectral analysis, and modification by discrete fourier transform. IEEE Trans. Acoust. Speech Signal Process. 25, 235–238 (1977)
Article MATH Google Scholar
Bosi, M., Goldberg, R.E.: Introduction to Digital Audio Coding and Standards. Kluwer Academic Publishers, Dordrecht (2003)
Book Google Scholar
Bäckström, T.: Comparison of windowing in speech and audio coding. In: Proceedings of WASPAA, New Paltz, USA (2013)
Google Scholar
Bäckström, T.: Vandermonde factorization of Toeplitz matrices and applications in filtering and warping. IEEE Trans. Signal Process. 61(24), 6257–6263 (2013)
Article MathSciNet Google Scholar
Bäckström, T., Helmrich, C.R.: Decorrelated innovative codebooks for ACELP using factorization of autocorrelation matrix. In: Proceedings of Interspeech, pp. 2794–2798 (2014)
Google Scholar
Bäckström, T., Helmrich, C.R.: Arithmetic coding of speech and audio spectra using TCX based on linear predictive spectral envelopes. In: Proceedings of ICASSP, pp. 5127–5131 (2015)
Google Scholar
Edler, B.: Codierung von audiosignalen mit Überlappender transformation und adaptiven fensterfunktionen. Frequenz 43(9), 252–256 (1989)
Article Google Scholar
Eksler, V., Jelínek, M., Salami, R.: Efficient handling of mode switching and speech transitions in the EVS codec. In: Proceedings of ICASSP, Brisbane, Australia, IEEE (2015)
Google Scholar
Fischer, T.: A pyramid vector quantizer. IEEE Trans. Inf. Theory, IT-32(4), 568–583 (1986)
Google Scholar
Fuchs, G., Subbaraman, V., Multrus, M.: Efficient context adaptive entropy coding for real-time applications. In: Proceedings of ICASSP, IEEE, pp. 493–496 (2011)
Google Scholar
Gersho, A., Gray, R.M.: Vector Quantization and Signal Compression. Springer, Berlin (1992)
Google Scholar
Harris, F.J.: On the use of windows for harmonic analysis with the discrete fourier transform. Proc. IEEE 66(1), 51–83 (1978)
Article Google Scholar
Huffman, D.A.: A method for the construction of minimum redundancy codes. Proc. IRE 40(9), 1098–1101 (1952)
Article MATH Google Scholar
ISO/IEC 23003–3:2012. MPEG-D (MPEG audio technologies), Part 3: Unified speech and audio coding (2012)
Google Scholar
Malvar, H.S.: Lapped transforms for efficient transform/subband coding. IEEE Trans. Acoust. Speech Signal Process. 38(6), 969–978 (1990)
Article Google Scholar
Malvar, H.S.: Signal Processing with Lapped Transforms. Artech House, Inc. (1992)
Google Scholar
Mitra, S.K.: Digital Signal Processing: A Computer-Based Approach. McGraw-Hill, New York (1998)
Google Scholar
Mäkinen, J., Bessette, B., Bruhn, S., Ojala, P., Salami, R., Taleb, A.: AMR-WB+: a new audio coding standard for 3rd generation mobile audio services. Proc. ICASSP 2, 1109–1112 (2005)
Google Scholar
Rissanen, J., Langdon, G.G.: Arithmetic coding. IBM J. Res. Dev. 23(2), 149–162 (1979)
Article MathSciNet MATH Google Scholar
Sanchez, V.E., Adoul, J.-P.: Low-delay wideband speech coding using a new frequency domain approach. In: Proceedings of ICASSP, IEEE, vol. 2, pp. 415–418 (1993)
Google Scholar
Svedberg, J., Grancharov, V., Sverrisson, S., Norvell, E., Toftgård, T., Pobloth, H., Bruhn, S.: MDCT audio coding with pulse vector quantizers. In: Proceedings of ICASSP, pp. 5937–5941 (2015)
Google Scholar
Valin, J.-M., Maxwell, G., Terriberry, T.B., Vos, K.: High-quality, low-delay music coding in the OPUS codec. In: Audio Engineering Society Convention 135. Audio Engineering Society (2013)
Google Scholar
Witten, I.H., Neal, R.M., Cleary, J.G.: Arithmetic coding for data compression. Commun. ACM 30(6), 520–540 (1987)
Article Google Scholar

Download references

Author information

Authors and Affiliations

International Audio Laboratories Erlangen (AudioLabs), Friedrich-Alexander University Erlangen-Nürnberg (FAU), Erlangen, Germany
Tom Bäckström

Authors

Tom Bäckström
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tom Bäckström .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Bäckström, T. (2017). Frequency Domain Coding. In: Speech Coding. Signals and Communication Technology. Springer, Cham. https://doi.org/10.1007/978-3-319-50204-5_10

Download citation

DOI: https://doi.org/10.1007/978-3-319-50204-5_10
Published: 30 March 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-50202-1
Online ISBN: 978-3-319-50204-5
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics