Skip to main content

Efficient Implementations of Perfect Reconstruction Low Delay Cosine-Modulated Filter Banks in the MPEG-4 AAC-ELD

  • Chapter
  • First Online:
Cosine-/Sine-Modulated Filter Banks
  • 638 Accesses

Abstract

The MPEG committee has recently completed development of a new audio coding standard, the MPEG-4 Advanced Audio Coding-Enhanced Low Delay (AAC-ELD). State-of-the-art MPEG audio coding standards, such as MPEG-4 AAC Low Complexity (AAC-LC), High Efficiency AAC (HE-AAC), and AAC Low Delay (AAC-LD), utilize the time-to-frequency transformation of an audio block and vice versa, the well-known time domain aliasing cancellation modified discrete cosine transform (TDAC-MDCT). In order to achieve low algorithmic delay, the AAC-ELD has adopted a perfect reconstruction low delay cosine-modulated filter bank, called the low delay MDCT (LD-MDCT). Although the use of LD-MDCT substantially reduces the algorithmic delays, the transform operations in the AAC-ELD codec are still computationally intensive and the LD-MDCT filter banks need to have fast algorithms. Therefore, this chapter is concentrated on the analysis/synthesis LD-MDCT filter banks used the AAC-ELD codec and mainly on their efficient implementations. This chapter presents: Definitions of the analysis/synthesis LD-MDCT (and TDAC-MDCT) filter banks, general symmetry properties of LD-MDCT transforms both in the time and frequency domains, relations between the LD-MDCT and TDAC-MDCT transforms in the analytical forms as well as in the equivalent matrix representations, and efficient implementations of the even-length analysis/synthesis LD-MDCT filter banks. For each fast LD-MDCT algorithm the complete formulae are derived. All the fast even-length LD-MDCT algorithms are investigated and compared in terms of arithmetic complexity and structural simplicity.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

References

  1. Information Technology – Coding of Audio-Visual Objects – Part 3: Audio, Amendment 9: Enhanced Low Delay AAC, ISO/IEC 14496-3:2005/FDAM 9:2007(E), N9499, Shenzhen, October 2007

    Google Scholar 

MPEG-4 AAC-LC and HE-AAC Audio Coding Standards

  1. Information Technology – Coding of Audio-Visual Objects – Part 3: Audio, Subpart 4: General Audio Coding (GA)-AAC, TwinVQ, BSAC, ISO/IEC 14496–3:2005(E) (2005)

    Google Scholar 

  2. M. Wolters, K. Kjörling, D. Homm, H. Purnhagen, A closer look into MPEG-4 high efficiency AAC, in 115th AES Convention, New York, NY, October 2003. Preprint #5871

    Google Scholar 

MPEG-4 AAC-LD Audio Coding Standard

  1. E. Allamanche, R. Geiger, J. Herre, T. Sporer, MPEG-4 low delay audio coding based on the AAC codec, in 106th AES Convention, Munich, May 1999. Preprint #4929

    Google Scholar 

  2. J. Hilpert et al., Real-time implementation of the MPEG-4 low delay advanced audio coding algorithm (AAC-LD) on Motorola DSP56300, in 108th AES Convention, Paris, February 2000. Preprint #5081

    Google Scholar 

  3. M. Lutzky, M. Schnell, M. Schmidt, R. Geiger, Structural analysis of low latency audio coding schemes, in 119th AES Convention, New York, NY, October 2005. Preprint #6601

    Google Scholar 

MPEG-4 AAC-ELD Audio Coding Standard

  1. M. Lutzky, M.L. Valero, M. Schnell, J. Hilpert, AAC-ELD v2 – the new state of the art in high quality communication audio coding, in 131st AES Convention, New York, NY, October 2011. Preprint #8516

    Google Scholar 

  2. M. Schnell et al., Enhanced MPEG-4 low delay AAC – low bitrate high quality communication, in 122nd AES Convention, Vienna, May 2007. Preprint #6998

    Google Scholar 

  3. M. Schnell et al., Low delay filter banks for enhanced low delay audio coding, in Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, October 2007, pp. 235–238

    Google Scholar 

  4. M. Schnell et al., MPEG-4 enhanced low delay AAC – a new standard for high quality communication, in 125th AES Convention, San Francisco, CA, October 2008. Preprint #7503

    Google Scholar 

  5. M.L. Valero et al., A new parametric stereo and multichannel extension for MPEG-4 enhanced low delay AAC (AAC-ELD), in 128th AES Convention, London, May 2010. Preprint #8099

    Google Scholar 

Efficient Implementations of LD-MDCT Filter Banks

  1. V. Britanak, New fast algorithms for the low delay MDCT computation in the MPEG-4 AAC enhanced low delay audio coding standard. Signal Process. 105(12), 410–418 (2014)

    Article  Google Scholar 

  2. R.K. Chivukula, Y.A. Reznik, V. Devarajan, Efficient algorithms for MPEG-4 AAC-ELD, AAC-LD and AAC-LC filter banks, in Proceedings of the IEEE International Conference on Audio, Language and Image Processing, (ICALIP’2008), Shanghai, July 2008, pp. 1629–1634

    Google Scholar 

  3. R.K. Chivukula, Y.A. Reznik, Y. Hu, V. Devarajan, Fast algorithms for low-delay TDAC filter banks in MPEG-4 AAC-ELD. IEEE Trans. Audio Speech Lang. Process. 22(12), 1701–1712 (2014)

    Article  Google Scholar 

  4. Z.G. Gui, Y. Ge, D.Y. Zhang, J.S. Wu, Generalized fast mixed-radix algorithm for the computation of forward and inverse MDCTs. Signal Process. 92(2), 363–373 (2012)

    Article  Google Scholar 

  5. J. Wu, H. Shu, L. Senhadji, L. Luo, Mixed-radix algorithm for the computation of forward and inverse MDCTs. IEEE Trans. Circuits Syst. I: Regul. Pap. 56(4), 784–794 (2009)

    Article  MathSciNet  Google Scholar 

Existing Audio Broadcasting and Speech Communication Codecs

  1. 3GGP2 C.S0014-C v1.0, Enhanced variable rate codec, speech service Option 3, 68 and 70 for wide-band spread spectrum digital systems, 2007

    Google Scholar 

  2. R.K. Chivukula, Y.A. Reznik, Efficient implementation of a class of MDCT/IMDCT filterbanks for speech and audio coding, in Proceedings of the IEEE ICASSP’2008, Las Vegas, NV, March–April 2008, pp. 213–216

    Google Scholar 

  3. Digital Radio Mondiale (DRM): System Specification, ETSI ES 201 980 v3.1.1 (2009–08), ETSI Standard, August 2009 (available at website www.drm.org)

  4. W. Hoeg, T. Lauterbach (eds.), Digital Audio Broadcasting: Principles and Applications of DAB, DAB+ and DMB, chap. 3, 3rd edn. (Wiley, Chichester, 2009), pp. 93–165

    Google Scholar 

  5. ITU-T Recommendation G.722.1 Annex C, Low-complexity coding at 24 and 32 kbit/s for hands-free operation in systems with low frame loss, Annex C: 14 kHz at 24, 32 and 48 kbit/s, May 2005

    Google Scholar 

  6. ITU-T SG16 Q9 – Contribution 199: extended high-level description of the Q9 EV-VBR baseline codec, June 2007

    Google Scholar 

  7. L. Laaksonen et al., Super wide–band extension of G.718 and G.729.1 speech codec, in Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010, Makuhari, September 2010

    Google Scholar 

  8. T. Vaillancourt et al., ITU-T EV-VBR: a robust 8–32 kbit/s scalable coder for error prone telecommunication channels, in Proceedings of the 16th European Signal Processing Conference, Lausanne, August 2008

    Google Scholar 

  9. M. Xie, D. Lindbergh, P. Chu, From ITU-T G.722.1 to ITU-T G.722.1 Annex C: a new low-complexity 14 kHz bandwidth audio coding standard, in Proceedings of the IEEE ICASSP’2006, vol. 5, Toulouse, May 2006, pp. 173–176, also published in J. Multimed. 2(2), 65–76 (2007)

    Google Scholar 

  10. M. Xie, P. Chu, A. Taleb, M. Briand, ITU-T G.719: a new low-complexity full-band (20 kHz) audio coding standard for high quality conversational applications, in Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA’2009), New Paltz, NY, October 2009, pp. 265–268

    Google Scholar 

Supporting Literature

  1. V. Britanak, Survey of efficient MDCT implementations in MP3 audio coding standard: retrospective and state-of-the-art. Signal Process. 91(4), 1379–1394 (2011)

    Article  Google Scholar 

  2. V. Britanak, H.J. Lincklaen Arriëns, Fast computational structures for an efficient implementation of the complete TDAC analysis/synthesis MDCT/MDST filter banks. Signal Process. 89(7), 1379–1394 (2009)

    Article  Google Scholar 

  3. V. Britanak, P. Yip, K.R. Rao, Discrete Cosine and Sine Transforms: General Properties, Fast Algorithms and Integer Approximations (Academic Press Inc., Elsevier Science, Amsterdam, 2007)

    Google Scholar 

  4. R. Gluth, Regular FFT-related transform kernels for DCT/DST-based polyphase filter banks, in Proceedings of the IEEE ICASSP’91, Toronto, May 1991, pp. 2205–2208

    Google Scholar 

  5. R. Gluth, A unified approach to transform-based FIR filter banks with special regard to perfect reconstruction systems, in Proceedings of the IEEE ICASSP’93, vol. III, Minneaopolis, MN, April 1993, pp. 157–160

    Google Scholar 

  6. C.W. Kok, Fast algorithm for computing discrete cosine transform. IEEE Trans. Signal Process. 45(3), 757–760 (1997)

    Article  Google Scholar 

  7. H.S. Malvar, Signal Processing with Lapped Transforms, chap. 2 (Artech House, Norwood, MA, 1992), pp. 71–75

    Google Scholar 

  8. J.P. Princen, A.W. Johnson, A.B. Bradley, Subband/transform coding using filter bank designs based on time domain aliasing cancellation, in Proceedings of the IEEE ICASSP’87, Dallas, TX, April 1987, pp. 2161–2164

    Google Scholar 

  9. G.D.T. Schuller, M.J.T. Smith, New framework for modulated perfect reconstruction filter banks. IEEE Trans. Signal Process. 44(8), 1941–1954 (1996)

    Article  Google Scholar 

  10. G.D.T. Schuller, T. Karp, Modulated filter banks with arbitrary system delay: efficient implementations and time-varying case. IEEE Trans. Signal Process. 48(3), 737–748 (2000)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this chapter

Cite this chapter

Britanak, V., Rao, K.R. (2018). Efficient Implementations of Perfect Reconstruction Low Delay Cosine-Modulated Filter Banks in the MPEG-4 AAC-ELD. In: Cosine-/Sine-Modulated Filter Banks. Springer, Cham. https://doi.org/10.1007/978-3-319-61080-1_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-61080-1_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-61078-8

  • Online ISBN: 978-3-319-61080-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics