Skip to main content

Frequency Domain Coding

  • Chapter
  • First Online:
Speech Coding

Part of the book series: Signals and Communication Technology ((SCT))

  • 1051 Accesses

Abstract

Signals which are sufficiently stationary permit highly efficient coding in the frequency domain. Such signals include speech signals such as sustained vowels and prolonged fricatives, as well as generic audio signals such as music and mixed material. The main components of frequency domain coding methods include windowing, a time-frequency transform, perceptual modelling and entropy coding of the spectral components. This chapter gives an overview of such transform domain coding methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 139.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. 3GPP. TS 26.445, EVS Codec Detailed Algorithmic Description; 3GPP Technical Specification (Release 12), 2014

    Google Scholar 

  2. Allen, J.: Short-term spectral analysis, and modification by discrete fourier transform. IEEE Trans. Acoust. Speech Signal Process. 25, 235–238 (1977)

    Article  MATH  Google Scholar 

  3. Bosi, M., Goldberg, R.E.: Introduction to Digital Audio Coding and Standards. Kluwer Academic Publishers, Dordrecht (2003)

    Book  Google Scholar 

  4. Bäckström, T.: Comparison of windowing in speech and audio coding. In: Proceedings of WASPAA, New Paltz, USA (2013)

    Google Scholar 

  5. Bäckström, T.: Vandermonde factorization of Toeplitz matrices and applications in filtering and warping. IEEE Trans. Signal Process. 61(24), 6257–6263 (2013)

    Article  MathSciNet  Google Scholar 

  6. Bäckström, T., Helmrich, C.R.: Decorrelated innovative codebooks for ACELP using factorization of autocorrelation matrix. In: Proceedings of Interspeech, pp. 2794–2798 (2014)

    Google Scholar 

  7. Bäckström, T., Helmrich, C.R.: Arithmetic coding of speech and audio spectra using TCX based on linear predictive spectral envelopes. In: Proceedings of ICASSP, pp. 5127–5131 (2015)

    Google Scholar 

  8. Edler, B.: Codierung von audiosignalen mit Überlappender transformation und adaptiven fensterfunktionen. Frequenz 43(9), 252–256 (1989)

    Article  Google Scholar 

  9. Eksler, V., Jelínek, M., Salami, R.: Efficient handling of mode switching and speech transitions in the EVS codec. In: Proceedings of ICASSP, Brisbane, Australia, IEEE (2015)

    Google Scholar 

  10. Fischer, T.: A pyramid vector quantizer. IEEE Trans. Inf. Theory, IT-32(4), 568–583 (1986)

    Google Scholar 

  11. Fuchs, G., Subbaraman, V., Multrus, M.: Efficient context adaptive entropy coding for real-time applications. In: Proceedings of ICASSP, IEEE, pp. 493–496 (2011)

    Google Scholar 

  12. Gersho, A., Gray, R.M.: Vector Quantization and Signal Compression. Springer, Berlin (1992)

    Google Scholar 

  13. Harris, F.J.: On the use of windows for harmonic analysis with the discrete fourier transform. Proc. IEEE 66(1), 51–83 (1978)

    Article  Google Scholar 

  14. Huffman, D.A.: A method for the construction of minimum redundancy codes. Proc. IRE 40(9), 1098–1101 (1952)

    Article  MATH  Google Scholar 

  15. ISO/IEC 23003–3:2012. MPEG-D (MPEG audio technologies), Part 3: Unified speech and audio coding (2012)

    Google Scholar 

  16. Malvar, H.S.: Lapped transforms for efficient transform/subband coding. IEEE Trans. Acoust. Speech Signal Process. 38(6), 969–978 (1990)

    Article  Google Scholar 

  17. Malvar, H.S.: Signal Processing with Lapped Transforms. Artech House, Inc. (1992)

    Google Scholar 

  18. Mitra, S.K.: Digital Signal Processing: A Computer-Based Approach. McGraw-Hill, New York (1998)

    Google Scholar 

  19. Mäkinen, J., Bessette, B., Bruhn, S., Ojala, P., Salami, R., Taleb, A.: AMR-WB+: a new audio coding standard for 3rd generation mobile audio services. Proc. ICASSP 2, 1109–1112 (2005)

    Google Scholar 

  20. Rissanen, J., Langdon, G.G.: Arithmetic coding. IBM J. Res. Dev. 23(2), 149–162 (1979)

    Article  MathSciNet  MATH  Google Scholar 

  21. Sanchez, V.E., Adoul, J.-P.: Low-delay wideband speech coding using a new frequency domain approach. In: Proceedings of ICASSP, IEEE, vol. 2, pp. 415–418 (1993)

    Google Scholar 

  22. Svedberg, J., Grancharov, V., Sverrisson, S., Norvell, E., Toftgård, T., Pobloth, H., Bruhn, S.: MDCT audio coding with pulse vector quantizers. In: Proceedings of ICASSP, pp. 5937–5941 (2015)

    Google Scholar 

  23. Valin, J.-M., Maxwell, G., Terriberry, T.B., Vos, K.: High-quality, low-delay music coding in the OPUS codec. In: Audio Engineering Society Convention 135. Audio Engineering Society (2013)

    Google Scholar 

  24. Witten, I.H., Neal, R.M., Cleary, J.G.: Arithmetic coding for data compression. Commun. ACM 30(6), 520–540 (1987)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tom Bäckström .

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this chapter

Cite this chapter

Bäckström, T. (2017). Frequency Domain Coding. In: Speech Coding. Signals and Communication Technology. Springer, Cham. https://doi.org/10.1007/978-3-319-50204-5_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-50204-5_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-50202-1

  • Online ISBN: 978-3-319-50204-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics