Digital Audio Compression

  • Mrinal Kr. Mandal
Part of the The Springer International Series in Engineering and Computer Science book series (SECS, volume 716)


Audio data requires a large number of bits for representation. For example, CD quality stereo audio requires 176.4 Kbytes/sec data rate for transmission or storage. This bandwidth is too large for many applications such as voice transmission over the Internet. Even when there is no live audio transmission, the storage cost may be high. An audio CD only typically contains up to 74 minutes of audio. It has been found that if audio data is compressed carefully, excellent quality audio can be stored or transmitted at a much lower bit-rate. In this Chapter, we present the basic principles of audio compression techniques, followed by brief discussions on a few selected audio compression standards.


Audio Signal Audio Data Linear Predictive Code Pulse Code Modulation Audio Sample 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    C.E. Shannon, “A Mathematical theory of communication,” Bell Systems Technical Journal, Vol. XXVII, No. 3, pp. 379–423, 1948.MathSciNetGoogle Scholar
  2. 2.
    N. S. Jayant and P. Noll, Digital Coding of Waveforms: Principles and Applications to Speech and Video, Prentice-Hall, New Jersey, 1984.Google Scholar
  3. 3.
    A. Gersho, “Advances in speech and audio compression,” Proc. of the IEEE, Vol. 82, No. 6, p 900–918, Jun 1994.CrossRefGoogle Scholar
  4. 4.
    B. Tang, A. Shen, A. Alwan, G. Pottie, “Perceptually based embedded subband speech coder,” IEEE Transactions on Speech and Audio Processing, Vol. 5, No. 2, pp. 131–140, Mar 1997.CrossRefGoogle Scholar
  5. 5.
    K. C. Pohlmann, Principles of Digital Audio, McGraw-Hill, New York 2000.Google Scholar
  6. 6.
    ITU-T Recommendation G.711, Pulse Code Modulation (PCM) of Voice Frequencies, 1988.Google Scholar
  7. 7.
    ITU-T Recommendation G.722, 7 KHz audio coding within 64 kbits/s, 1988.Google Scholar
  8. 8.
    ITU-T Recommendation G.729, Coding of speech at 8 kbits/s using conjugate structure algebraic-code-excited linear-prediction (CS-ACELP), 1996.Google Scholar
  9. 9.
    D. Pan, “An overview of the MPEG/audio compression algorithm,” Proc. of SPIE — Digital Video Compression on Personal computers: Algorithms and Technologies, Vol. 2187, San Jose, February 1994.Google Scholar
  10. 10.
    ISO/IEC JTC1/SC29/WG11 MPEG, International Standard IS 11172-3, “Coding of Moving Pictures and Associated Audio for Digital Storage Media at up to about 1.5 Mbits/s, Part 3: Audio”, 1992.Google Scholar
  11. 11.
    ISO/IEC JTC1/SC29/WG11 MPEG, International Standard IS 13818-3, “Information Technology — Generic Coding of Moving Pictures and Associated Audio, Part 3: Audio”Google Scholar
  12. 12.
    ISO/IEC JTC1/SC29/WG11 Doc. N1430, “DIS 13818-7 (MPEG-2 Advanced Audio Coding)” (1996).Google Scholar
  13. 13.
    ISO/IEC JTC1/SC29/WG11 Doc. N0937, “MPEG-4 Proposal Package Description (PPD)”(1995).Google Scholar
  14. 14.
    Dolby Laboratory, Scholar
  15. 15.
    S. Vernon, “Design and implementation of AC-3 coders,” IEEE Trans, on Consumer Electronics, Vol. 41, No. 3, August 1995.Google Scholar
  16. 16.
    K. Brandenburg and M. Dosi, “Overview of MPEG audio: current and future standards for low-bit-rate audio coding,” Journal of Audio Engineering Society, Vol. 45, No. 1/2, pp. 4–21, Jan/Feb 1997.Google Scholar
  17. 17.
    Chris Bagwell, Audio File Formats FAQ, AudioFormats.html.Google Scholar

Copyright information

© Springer Science+Business Media New York 2003

Authors and Affiliations

  • Mrinal Kr. Mandal
    • 1
  1. 1.University of AlbertaCanada

Personalised recommendations