Advertisement

Spectral Band Replication Compression Technology: Efficient Implementations of Complex Exponential- and Cosine-Modulated QMF Banks

  • Vladimir Britanak
  • K. R. Rao
Chapter

Abstract

Spectral Band Replication (SBR) is an enhancement compression technology. The SBR is a bandwidth extension method which significantly improves the compression efficiency of perceptual audio and speech coding schemes. There are two versions of the SBR technology: Standard SBR and Low Delay SBR (LD-BR). Central to the operation of standard SBR and LD-SBR are dedicated complex exponential-modulated and real-valued cosine-modulated quadrature mirror filter (QMF) banks as the basic mathematical tools to analyze and synthesize audio signals. This chapter presents the complete unified efficient implementations of complex exponential-modulated and real-valued cosine-modulated QMF banks used both in the standard SBR and LD-SBR encoder and decoder. In general, for each QMF bank is presented: Definition in its equivalent block transform with a common parameter M representing the number of sub-bands, its general symmetry property in the frequency or time domain, and the derivation of a fast algorithm for its efficient implementation. All the fast algorithms are analyzed in detail in terms of the arithmetic complexity, regularity, and structural simplicity for a potential real-time low-cost implementation in hardware or software.

References

  1. 1.
    M. Dietz, L. Liljeryd, K. Kjörling, O. Kunz, Spectral band replication, a novel approach in audio coding, in 112th AES Convention, Munich, May 2002. Preprint #5553Google Scholar
  2. 2.
    P. Ekstrand, Bandwidth extension of audio signals by Spectral Band Replication, in Proceedings of the 1st IEEE Benelux Workshop on Model based Processing and Coding of Audio (MPCA’2002), Leuven, November 2002, pp. 53–58Google Scholar
  3. 3.
    H.-W. Hsu, W.-C. Lee, C.-M. Liu, H.-Y. Tseng, C.-H. Yang, High quality, low power QMF bank design for SBR, parametric coding, and MPEG surround decoders, in 122nd AES Convention, Vienna, May 2007. Preprint #7000Google Scholar
  4. 4.
    L. Liljeryd, P. Ekstrand, L. Henn, K. Kjörling, Source coding enhancement using spectral-band replication, US Patent Application #7283955 B2, Coding Technology Ab, Stockholm, October 2003Google Scholar
  5. 5.
    O. Shimada et al., A low power SBR algorithm for the MPEG-4 audio standard and its DSP implementation, in 116th AES Convention, Berlin, May 2004. Preprint #6048Google Scholar

Standard SBR Integrated into MPEG-1 Layer 2, MP3 and MPEG-4 AAC

  1. 6.
    Digital Radio Mondiale, available on web site: http://www.drm.org/
  2. 7.
    A. Ehret, M. Dietz, K. Kjörling, State-of-the-art audio coding for broadcasting and mobile applications, in 114th AES Convention, Amsterdam, March 2003. Preprint #5834Google Scholar
  3. 8.
    W. Hoeg, T. Lauterbach (eds.), Digital Audio Broadcasting: Principles and Applications of DAB, DAB+, 3rd edn. (Wiley, Chichester, 2009), pp. 93–165. Chapter 3: Audio services and applicationsGoogle Scholar
  4. 9.
    S. Meltzer, R. Böhm, F. Henn, SBR enhanced audio codecs for Digital Broadcasting such as Digital Radio Mondiale (DRM), in 112th AES Convention, Munich, April 2002. Preprint #5559Google Scholar
  5. 10.
    M. Schug, A. Gröschel, M. Beer, F. Henn, Enhancing audio coding efficiency of MPEG layer-2 with Spectral Band Replication (SBR) for DigitalRadio (EUREKA 147/DAB) in a backwards compatible way, in 114th AES Convention, Amsterdam, March 2003. Preprint #5850Google Scholar
  6. 11.
    XM Satellite Radio, available on web site: http://www.xmradio.com/
  7. 12.
    T. Ziegler, A. Ehret, P. Ekstrand, M. Lutzky, Enhancing mp3 with SBR: features and capabilities of the new mp3PRO algorithm, in 112th AES Convention, Munich, May 2002. Preprint #5560Google Scholar

Standard SBR in HE-AAC (aacPlus) and USAC

  1. 13.
    A.C. den Brinker et al., An overview of the coding standard MPEG-4 audio Amendments 1 and 2: HE-AAC, SSC and HE-AAC v2. EURASIP J. Audio Speech Music Process. (2009). Article ID 468971, 21 pp.Google Scholar
  2. 14.
    M. Dietz, S. Meltzer, CT-aacPlus – a state-of-the-art audio coding scheme, in European Broadcasting Union Technical Review, vol. 291, July 2002, 7 pp.Google Scholar
  3. 15.
    D. Frerichs, New MPEG-4 High-Efficiency AAC audio: enabling new applications, Document of Coding Technologies, Inc., April 2003, 8 pp.Google Scholar
  4. 16.
    Y. Gao, Audio coding standard overview: MPEG-4 AAC, HE-AAC, and HE-AAC v2, in Mobile Multimedia Broadcasting Standards: Technology and Practice, chap. 21, ed. by F.-L. Luo (Springer Science + Business Media LLC, New York, 2009), pp. 607–627CrossRefGoogle Scholar
  5. 17.
    J. Herre, M. Dietz, MPEG-4 high-efficiency AAC coding. IEEE Signal Process. Mag. 25(3), 137–142 (2008)CrossRefGoogle Scholar
  6. 18.
    Information Technology – Coding of Audio–Visual Objects – Part 3: Audio, Subpart 4: General Audio Coding (GA) — AAC, TwinVQ, BSAC, ISO/IEC 14496-3:2005(E) (2005)Google Scholar
  7. 19.
    M. Neuendorf et al., A novel scheme for low bitrate Unified Speech and Audio Coding – MPEG RM0, in 126th AES Convention, Munich, May 2009. Preprint #7713Google Scholar
  8. 20.
    M. Neuendorf et al., The ISO/MPEG Unified Speech and Audio Coding standard – consistent high quality for all content types and at all bit rates, in 132nd AES Convention, Budapest, April 2012. Preprint #8654, also published in J. Audio Eng. Soc. 61(12), 956–977 (2013)Google Scholar
  9. 21.
    S. Quackenbush, MPEG unified speech and audio coding. IEEE MultiMedia 20(2), 72–78 (2013)CrossRefGoogle Scholar
  10. 22.
    S.-U. Ryu, K. Rose, J.-H. Chang, Effective high frequency regeneration based on sinusoidal modeling for MPEG-4 HE-AAC, in Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, October 2005, pp. 211–214Google Scholar
  11. 23.
    M. Wolters, K. Kjörling, D. Homm, H. Purnhagen, A closer look into MPEG-4 high efficiency AAC, in 115th AES Convention, New York, October 2003. Preprint #5871Google Scholar

Low Delay SBR in AAC-ELD

  1. 24.
    T. Friedrich, G. Schuller, Spectral Band Replication tool for very low delay audio coding applications, in Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, October 2007, pp. 199–202Google Scholar
  2. 25.
    Information Technology – Coding of Audio-Visual Objects – Part 3: Audio, Amendment 9: Enhanced Low Delay AAC, ISO/IEC 14496-3:2005/FDAM 9:2007(E), Shenzen, October 2007Google Scholar
  3. 26.
    M. Schnell et al., Enhanced MPEG-4 low delay AAC – low bit rate high quality communication, in 122nd AES Convention, Vienna, May 2007. Preprint #6998Google Scholar
  4. 27.
    M. Schnell et al., Low delay filter banks for enhanced low delay audio coding, in Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, October 2007, pp. 235–238Google Scholar
  5. 28.
    M. Schnell et al., MPEG-4 enhanced low delay AAC – a new standard for high quality communication, in 125nd AES Convention, San Francisco, CA, October 2008. Preprint #7503Google Scholar
  6. 29.
    M. Werner, G. Schuller, An enhanced SBR tool for low-delay applications, in 127th AES Convention, New York, NY, October 2009. Preprint #7876Google Scholar
  7. 30.
    M. Werner, G. Schuller, An SBR tool for very low delay applications with flexible crossover frequency. Proceedings of the IEEE ICASSP’2010, Dallas, TX, March 2010, pp. 353–356Google Scholar

Fast (Low Delay) HQ-SBR and LP-SBR QMF Banks

  1. 31.
    V. Britanak, Spectral Band Replication (SBR) compression technology: survey of the unified efficient low-cost implementations of complex exponential- and cosine-modulated QMF banks. Signal Process. 115(10), 49–65 (2015)CrossRefGoogle Scholar
  2. 32.
    L.-G. Chen, S.-W. Huang, T.-H. Tsai, Fast filterbanks for the low power MPEG high efficiency advanced audio coding decoder, in 118th AES Convention, Barcelona, May 2005. Preprint #6336Google Scholar
  3. 33.
    R.K. Chivukula, Y. Reznik, Low complexity Spectral Band Replication (SBR) filterbanks, US Patent Application #20100262427, Qualcomm Inc., San Diego, CA, October 2010Google Scholar
  4. 34.
    R.K. Chivukula, Y.A. Reznik, V. Devarajan, M. Jayendra-Lakshman, Fast algorithms for low-delay SBR filterbanks in MPEG-4 AAC-ELD. IEEE Trans. Audio Speech Lang. Process. 20(3), 1022–1031 (2012)CrossRefGoogle Scholar
  5. 35.
    Coding Technologies, Embedded aacPlus decoder implementations – product sheet, June 2013Google Scholar
  6. 36.
    Coding Technologies, Embedded HE-AAC: implementation guideline, June 2013Google Scholar
  7. 37.
    FAAD2 – open source free-ware Advanced Audio (AAC) decoder including SBR decoding, Free software at website http://www.audiocoding.com/ by M. Bakker, Nero AG, January 2011
  8. 38.
    S.-W. Huang, T.-H. Tsai, L.-G. Chen, Fast decomposition of filterbanks for the state-of-the-art audio coding. IEEE Signal Process. Lett. 12(10), 693–696 (2005)CrossRefGoogle Scholar
  9. 39.
    J. Huang, G. Du, D. Zhang, Y. Song, L. Geng, M. Gao, VLSI design of resource shared complex-QMF bank for HE-AAC decoder, in Proceedings of the 8th IEEE International Conference on ASIC (ASICON’2009), Changsha, October 2009, pp. 796–799Google Scholar
  10. 40.
    H.-W. Hsu, C.-M. Liu, W.-C. Lee, Fast complex quadrature mirror filterbanks for MPEG-4 HE-AAC, in 121st AES Convention, San Francisco, CA, October 2006. Preprint #6871Google Scholar
  11. 41.
    Intel(R) Integrated Performance Primitives Reference Manual, Vol. 1: Signal Processing, QMF Functions (Quadrature mirror filter banks used by MPEG-4 SBR decoder), Intel Developer Zone, available at web site: http://software.intel.com/

Supporting Literature

  1. 42.
    V. Britanak, On the discrete cosine transform computation. Signal Process. 40(2–3), 183–194 (1994)CrossRefGoogle Scholar
  2. 43.
    V. Britanak, New universal rotation-based fast computational structures for an efficient implementation of the DCT-IV/DST-IV and analysis/synthesis MDCT/MDST filter banks. Signal Process. 89(11), 2213–2232 (2009)CrossRefGoogle Scholar
  3. 44.
    V. Britanak, P. Yip, K.R. Rao, Discrete Cosine and Sine Transforms: General Properties, Fast Algorithms and Integer Approximations (Academic Press Inc., Elsevier Science, Amsterdam, 2007)Google Scholar
  4. 45.
    V. Britanak, H.J. Lincklaen Arriëns, Fast computational structures for an efficient implementation of the complete TDAC analysis/synthesis MDCT/MDST filter banks. Signal Process. 89(7), 1379–1394 (2009)CrossRefGoogle Scholar
  5. 46.
    S.C. Chan, K.L. Ho, Direct methods for computing discrete sinusoidal transforms. IEE Proc. Part F: Radar Signal Process. 137(6), 433–442 (1990)MathSciNetGoogle Scholar
  6. 47.
    R. Gluth, Regular FFT-related transform kernels for DCT/DST-based polyphase filter banks, in Proceedings of the IEEE ICASSP’91, Toronto, May 1991, pp. 2205–2208Google Scholar
  7. 48.
    C.W. Kok, Fast algorithm for computing discrete cosine transform. IEEE Trans. Signal Process. 45(3), 757–760 (1997)CrossRefGoogle Scholar
  8. 49.
    K. Konstantinides, Fast subband filtering in MPEG audio coding. IEEE Signal Process. Lett. 1(2), 26–28 (1994)CrossRefGoogle Scholar
  9. 50.
    C.-M. Liu, W.-C. Lee, A unified fast algorithm for cosine modulated filter banks in current audio coding standards. J. Audio Eng. Soc. 47(12), 1061–1075 (1999)Google Scholar
  10. 51.
    H.S. Malvar, Signal Processing with Lapped Transforms, chap. 2 (Artech House, Norwood, MA, 1992), pp. 71–75Google Scholar
  11. 52.
    N. Rama Murthy, M.N.S. Swamy, On the algorithms for the computation of even discrete cosine transform-2 (EDCT-2) of real sequences. IEEE Trans. Circuits Syst. 37(5), 625–627 (1990)CrossRefGoogle Scholar
  12. 53.
    K.R. Rao, D.N. Kim, J.J. Hwang, Fast Fourier Transform: Algorithms and Applications (Springer Science + Business Media B.V., New York, 2010)CrossRefGoogle Scholar
  13. 54.
    G.D.T. Schuller, T. Karp, Modulated filter banks with arbitrary system delay: efficient implementations and time-varying case. IEEE Trans. Signal Process. 48(3), 737–748 (2000)CrossRefGoogle Scholar
  14. 55.
    G.D.T. Schuller, M.J.T. Smith, New framework for modulated perfect reconstruction filter banks. IEEE Trans. Signal Process. 44(8), 1941–1954 (1996)CrossRefGoogle Scholar
  15. 56.
    P.P. Vaidyanathan, Cosine modulated filter banks, in Multirate Systems and Filter Banks, chap. 8 (Prentice–Hall, Englewood Cliffs, NJ, 1993), pp. 353–393Google Scholar
  16. 57.
    M. Vetterli, H.J. Nussbaumer, Simple FFT and DCT algorithms with reduced number of operations. Signal Process. 6(4), 267–278 (1984)MathSciNetCrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  • Vladimir Britanak
    • 1
  • K. R. Rao
    • 2
  1. 1.Institute of InformaticsSlovak Academy of SciencesBratislavaSlovakia
  2. 2.The University of Texas at ArlingtonArlingtonUSA

Personalised recommendations