Skip to main content
  • 989 Accesses

Abstract

In this chapter, the principles of audio coding will be described, with emphasis on low delay audio coding. Audio coding is based on psycho-acoustic masking effects, as computed by psycho-acoustic models. To use the masking effects and to obtain a good compression ratio, filter banks are used. The principles of psycho-acoustics and of the design of filter banks are presented. Further a new low delay audio coding scheme based on prediction is shown.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Technical Council of the AES: CD “Perceptual audio coders: what to listen for,” Audio Engineering Society, New York.

    Google Scholar 

  2. N. Kitawaki and K. Itoh, “Pure delay effects on speech quality in telecommunications,” IEEE J. Sel. Areas in Comm., vol. 9, pp. 586–593, May 1991.

    Google Scholar 

  3. J.-H. Chen, R. V. Cox, Y.-C. Lin, N. Jayant, and M. J. Melchner, “A low-delay CELP coder for the CCITT 16 kb/s speech coding standard,” IEEE J. Sel Areas in Comm., vol. 10, pp. 830–849, June 1992.

    Google Scholar 

  4. B. Edler and G. Schuller, “Audio coding using a psychoacoustic pre-and post-filter,” ICASSP 2000, Istanbul, Turkey, pp. 11–881–884.

    Google Scholar 

  5. S. Haykin, Adaptive Filter Theory. Englewood Cliffs, N.J.: Prentice Hall, 1999.

    Google Scholar 

  6. A. Härmä, U. K. Laine, and M. Karjalainen, “Backward adaptive warped lattice for wideband stereo coding,” in Proc. of EUSIPCO’98, (Greece), 1998.

    Google Scholar 

  7. B. Edler, C. Faller, G. Schuller, “Perceptual Audio Coding Using a Time-Varying Linear Pre-and Post-filter,” AES Symposium, Los Angeles, CA, Sept. 2000

    Google Scholar 

  8. G. Schuller, B. Yu, D. Huang, “Lossless coding of audio signals using cascaded prediction,” in Proc. ICASSP, Salt Lake City, Utah, May 2001

    Google Scholar 

  9. S. Dorward, D. Huang, S. A. Savari, G. Schuller, and B. Yu, “Low Delay Perceptually Lossless Coding of Audio Signals,” Data Compression Conference, Snowbird, UT, March 2001, pp. 312–320

    Google Scholar 

  10. V. Madisetti, D. B. Williams, eds., The Digital Signal Processing Handbook, Chapter 42, D. Sinha et al., “The Perceptual Audio Coder (PAC),” CRC Press, Boca Raton, Fl., 1998.

    Google Scholar 

  11. ITU-R, “Methods for the subjective assessment of small impairments in audio systems including multichannel sound systems,” Rec. ITU-R BS. 1116-1, Geneva, 1997

    Google Scholar 

  12. U. Zölzer, Digital Audio Signal Processing, John Wiley & Sons, 1997.

    Google Scholar 

  13. J. D. Johnston, “Estimation of perceptual entropy using noise masking criteria,” in Proc. ICASSP, pp. 2524–2527, Apr. 1988.

    Google Scholar 

  14. M. Bosi and R. E. Goldberg, Introduction to Digital Audio Coding and Standards, Kluwer Academic Publishers, 2002.

    Google Scholar 

  15. E. Zwicker, H. Fastl, and H. Frater, Psychoacoustics: Facts and Models, Springer Verlag; 2nd edition, 1999.

    Google Scholar 

  16. P. P. Vaidyanathan, Multirate Systems and Filter Banks, Prentice Hall, 1993.

    Google Scholar 

  17. G. Schuller and T. Karp, “Modulated Filter Banks with Arbitrary system delay: efficient implementations and the time-varying Case,” IEEE Transactions on Signal Processing, pp. 737–748, Mar. 2000.

    Google Scholar 

  18. G. Schuller, “Time-varying filter banks with low delay for audio coding,” 105th AES Convention, San Francisco, CA, Sept. 26–29, 1998.

    Google Scholar 

  19. G. Schuller and M. J. T. Smith, “New framework for modulated perfect reconstruction filter banks,” IEEE Transactions on Signal Processing, vol.44, pp. 1941–1954, Aug. 1996.

    Google Scholar 

  20. V. Madisetti and D. B. Williams (Editors) The Digital Signal Processing Handbook, by CRC Press, Book and CD-ROM edition, 1997.

    Google Scholar 

  21. G. M. Phillips, “Echo and its effects on the telephone user,” Bell Laboratories Record, vol. 32, pp. 281–284, Aug. 1954.

    Google Scholar 

  22. G. Schuller and A. Harma, “Low delay audio compression using predictive coding,” in Proc. ICASSP, Orlando, FL, May 13–17, 2002.

    Google Scholar 

  23. J. Herre, “Temporal noise shaping, quantization and coding methods in perceptual audio coding: a tutorial introduction,” in AES 17th International Conference, Florence, Italy, Sept. 2–5, 1999.

    Google Scholar 

  24. E. Allamanche, R. Geiger, J. Herre, and T. Sporer, “MPEG-4 Low Delay Audio Coding based on the AAC Codec,” 106th AES Convention, Munich, Germany, May, 1999.

    Google Scholar 

  25. N. S. Jayant and P. Noll, Digital Coding of Waveforms, Prentice Hall, Englewood Cliffs, New Jersey, 1984.

    Google Scholar 

  26. F. K. Soong and B.-H. Juang, “Line spectrum pair (LSP) and speech data compression,” in Proc. ICASSP, 1984, pp. 1.10.1–1.10.4.

    Google Scholar 

  27. G. Schuller, B. Yu, D. Huang, and B. Edler, “Perceptual audio coding using adaptive Pre-and post-filters and lossless compression,” IEEE Trans. Speech Audio Processing, pp. 379–390, Sept. 2002.

    Google Scholar 

  28. A. Gelman, H. Stein, and D. Rubin, Bayesian Data Analysis, New York: Chapman & Hall, 1995.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Kluwer Academic Publishers

About this chapter

Cite this chapter

Schuler, G. (2004). Audio Coding. In: Huang, Y., Benesty, J. (eds) Audio Signal Processing for Next-Generation Multimedia Communication Systems. Springer, Boston, MA. https://doi.org/10.1007/1-4020-7769-6_11

Download citation

  • DOI: https://doi.org/10.1007/1-4020-7769-6_11

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4020-7768-5

  • Online ISBN: 978-1-4020-7769-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics