Efficient Implementations of Perfect Reconstruction Low Delay Cosine-Modulated Filter Banks in the MPEG-4 AAC-ELD

  • Vladimir Britanak
  • K. R. Rao


The MPEG committee has recently completed development of a new audio coding standard, the MPEG-4 Advanced Audio Coding-Enhanced Low Delay (AAC-ELD). State-of-the-art MPEG audio coding standards, such as MPEG-4 AAC Low Complexity (AAC-LC), High Efficiency AAC (HE-AAC), and AAC Low Delay (AAC-LD), utilize the time-to-frequency transformation of an audio block and vice versa, the well-known time domain aliasing cancellation modified discrete cosine transform (TDAC-MDCT). In order to achieve low algorithmic delay, the AAC-ELD has adopted a perfect reconstruction low delay cosine-modulated filter bank, called the low delay MDCT (LD-MDCT). Although the use of LD-MDCT substantially reduces the algorithmic delays, the transform operations in the AAC-ELD codec are still computationally intensive and the LD-MDCT filter banks need to have fast algorithms. Therefore, this chapter is concentrated on the analysis/synthesis LD-MDCT filter banks used the AAC-ELD codec and mainly on their efficient implementations. This chapter presents: Definitions of the analysis/synthesis LD-MDCT (and TDAC-MDCT) filter banks, general symmetry properties of LD-MDCT transforms both in the time and frequency domains, relations between the LD-MDCT and TDAC-MDCT transforms in the analytical forms as well as in the equivalent matrix representations, and efficient implementations of the even-length analysis/synthesis LD-MDCT filter banks. For each fast LD-MDCT algorithm the complete formulae are derived. All the fast even-length LD-MDCT algorithms are investigated and compared in terms of arithmetic complexity and structural simplicity.


  1. 1.
    Information Technology – Coding of Audio-Visual Objects – Part 3: Audio, Amendment 9: Enhanced Low Delay AAC, ISO/IEC 14496-3:2005/FDAM 9:2007(E), N9499, Shenzhen, October 2007Google Scholar

MPEG-4 AAC-LC and HE-AAC Audio Coding Standards

  1. 2.
    Information Technology – Coding of Audio-Visual Objects – Part 3: Audio, Subpart 4: General Audio Coding (GA)-AAC, TwinVQ, BSAC, ISO/IEC 14496–3:2005(E) (2005)Google Scholar
  2. 3.
    M. Wolters, K. Kjörling, D. Homm, H. Purnhagen, A closer look into MPEG-4 high efficiency AAC, in 115th AES Convention, New York, NY, October 2003. Preprint #5871Google Scholar

MPEG-4 AAC-LD Audio Coding Standard

  1. 4.
    E. Allamanche, R. Geiger, J. Herre, T. Sporer, MPEG-4 low delay audio coding based on the AAC codec, in 106th AES Convention, Munich, May 1999. Preprint #4929Google Scholar
  2. 5.
    J. Hilpert et al., Real-time implementation of the MPEG-4 low delay advanced audio coding algorithm (AAC-LD) on Motorola DSP56300, in 108th AES Convention, Paris, February 2000. Preprint #5081Google Scholar
  3. 6.
    M. Lutzky, M. Schnell, M. Schmidt, R. Geiger, Structural analysis of low latency audio coding schemes, in 119th AES Convention, New York, NY, October 2005. Preprint #6601Google Scholar

MPEG-4 AAC-ELD Audio Coding Standard

  1. 7.
    M. Lutzky, M.L. Valero, M. Schnell, J. Hilpert, AAC-ELD v2 – the new state of the art in high quality communication audio coding, in 131st AES Convention, New York, NY, October 2011. Preprint #8516Google Scholar
  2. 8.
    M. Schnell et al., Enhanced MPEG-4 low delay AAC – low bitrate high quality communication, in 122nd AES Convention, Vienna, May 2007. Preprint #6998Google Scholar
  3. 9.
    M. Schnell et al., Low delay filter banks for enhanced low delay audio coding, in Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, October 2007, pp. 235–238Google Scholar
  4. 10.
    M. Schnell et al., MPEG-4 enhanced low delay AAC – a new standard for high quality communication, in 125th AES Convention, San Francisco, CA, October 2008. Preprint #7503Google Scholar
  5. 11.
    M.L. Valero et al., A new parametric stereo and multichannel extension for MPEG-4 enhanced low delay AAC (AAC-ELD), in 128th AES Convention, London, May 2010. Preprint #8099Google Scholar

Efficient Implementations of LD-MDCT Filter Banks

  1. 12.
    V. Britanak, New fast algorithms for the low delay MDCT computation in the MPEG-4 AAC enhanced low delay audio coding standard. Signal Process. 105(12), 410–418 (2014)CrossRefGoogle Scholar
  2. 13.
    R.K. Chivukula, Y.A. Reznik, V. Devarajan, Efficient algorithms for MPEG-4 AAC-ELD, AAC-LD and AAC-LC filter banks, in Proceedings of the IEEE International Conference on Audio, Language and Image Processing, (ICALIP’2008), Shanghai, July 2008, pp. 1629–1634Google Scholar
  3. 14.
    R.K. Chivukula, Y.A. Reznik, Y. Hu, V. Devarajan, Fast algorithms for low-delay TDAC filter banks in MPEG-4 AAC-ELD. IEEE Trans. Audio Speech Lang. Process. 22(12), 1701–1712 (2014)CrossRefGoogle Scholar
  4. 15.
    Z.G. Gui, Y. Ge, D.Y. Zhang, J.S. Wu, Generalized fast mixed-radix algorithm for the computation of forward and inverse MDCTs. Signal Process. 92(2), 363–373 (2012)CrossRefGoogle Scholar
  5. 16.
    J. Wu, H. Shu, L. Senhadji, L. Luo, Mixed-radix algorithm for the computation of forward and inverse MDCTs. IEEE Trans. Circuits Syst. I: Regul. Pap. 56(4), 784–794 (2009)MathSciNetCrossRefGoogle Scholar

Existing Audio Broadcasting and Speech Communication Codecs

  1. 17.
    3GGP2 C.S0014-C v1.0, Enhanced variable rate codec, speech service Option 3, 68 and 70 for wide-band spread spectrum digital systems, 2007Google Scholar
  2. 18.
    R.K. Chivukula, Y.A. Reznik, Efficient implementation of a class of MDCT/IMDCT filterbanks for speech and audio coding, in Proceedings of the IEEE ICASSP’2008, Las Vegas, NV, March–April 2008, pp. 213–216Google Scholar
  3. 19.
    Digital Radio Mondiale (DRM): System Specification, ETSI ES 201 980 v3.1.1 (2009–08), ETSI Standard, August 2009 (available at website
  4. 20.
    W. Hoeg, T. Lauterbach (eds.), Digital Audio Broadcasting: Principles and Applications of DAB, DAB+ and DMB, chap. 3, 3rd edn. (Wiley, Chichester, 2009), pp. 93–165Google Scholar
  5. 21.
    ITU-T Recommendation G.722.1 Annex C, Low-complexity coding at 24 and 32 kbit/s for hands-free operation in systems with low frame loss, Annex C: 14 kHz at 24, 32 and 48 kbit/s, May 2005Google Scholar
  6. 22.
    ITU-T SG16 Q9 – Contribution 199: extended high-level description of the Q9 EV-VBR baseline codec, June 2007Google Scholar
  7. 23.
    L. Laaksonen et al., Super wide–band extension of G.718 and G.729.1 speech codec, in Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010, Makuhari, September 2010Google Scholar
  8. 24.
    T. Vaillancourt et al., ITU-T EV-VBR: a robust 8–32 kbit/s scalable coder for error prone telecommunication channels, in Proceedings of the 16th European Signal Processing Conference, Lausanne, August 2008Google Scholar
  9. 25.
    M. Xie, D. Lindbergh, P. Chu, From ITU-T G.722.1 to ITU-T G.722.1 Annex C: a new low-complexity 14 kHz bandwidth audio coding standard, in Proceedings of the IEEE ICASSP’2006, vol. 5, Toulouse, May 2006, pp. 173–176, also published in J. Multimed. 2(2), 65–76 (2007)Google Scholar
  10. 26.
    M. Xie, P. Chu, A. Taleb, M. Briand, ITU-T G.719: a new low-complexity full-band (20 kHz) audio coding standard for high quality conversational applications, in Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA’2009), New Paltz, NY, October 2009, pp. 265–268Google Scholar

Supporting Literature

  1. 27.
    V. Britanak, Survey of efficient MDCT implementations in MP3 audio coding standard: retrospective and state-of-the-art. Signal Process. 91(4), 1379–1394 (2011)CrossRefGoogle Scholar
  2. 28.
    V. Britanak, H.J. Lincklaen Arriëns, Fast computational structures for an efficient implementation of the complete TDAC analysis/synthesis MDCT/MDST filter banks. Signal Process. 89(7), 1379–1394 (2009)CrossRefGoogle Scholar
  3. 29.
    V. Britanak, P. Yip, K.R. Rao, Discrete Cosine and Sine Transforms: General Properties, Fast Algorithms and Integer Approximations (Academic Press Inc., Elsevier Science, Amsterdam, 2007)Google Scholar
  4. 30.
    R. Gluth, Regular FFT-related transform kernels for DCT/DST-based polyphase filter banks, in Proceedings of the IEEE ICASSP’91, Toronto, May 1991, pp. 2205–2208Google Scholar
  5. 31.
    R. Gluth, A unified approach to transform-based FIR filter banks with special regard to perfect reconstruction systems, in Proceedings of the IEEE ICASSP’93, vol. III, Minneaopolis, MN, April 1993, pp. 157–160Google Scholar
  6. 32.
    C.W. Kok, Fast algorithm for computing discrete cosine transform. IEEE Trans. Signal Process. 45(3), 757–760 (1997)CrossRefGoogle Scholar
  7. 33.
    H.S. Malvar, Signal Processing with Lapped Transforms, chap. 2 (Artech House, Norwood, MA, 1992), pp. 71–75Google Scholar
  8. 34.
    J.P. Princen, A.W. Johnson, A.B. Bradley, Subband/transform coding using filter bank designs based on time domain aliasing cancellation, in Proceedings of the IEEE ICASSP’87, Dallas, TX, April 1987, pp. 2161–2164Google Scholar
  9. 35.
    G.D.T. Schuller, M.J.T. Smith, New framework for modulated perfect reconstruction filter banks. IEEE Trans. Signal Process. 44(8), 1941–1954 (1996)CrossRefGoogle Scholar
  10. 36.
    G.D.T. Schuller, T. Karp, Modulated filter banks with arbitrary system delay: efficient implementations and time-varying case. IEEE Trans. Signal Process. 48(3), 737–748 (2000)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  • Vladimir Britanak
    • 1
  • K. R. Rao
    • 2
  1. 1.Institute of InformaticsSlovak Academy of SciencesBratislavaSlovakia
  2. 2.The University of Texas at ArlingtonArlingtonUSA

Personalised recommendations