Integer Approximate Cosine/Sine-Modulated Filter Banks

Britanak, Vladimir; Rao, K. R.

doi:10.1007/978-3-319-61080-1_9

Vladimir Britanak³ &
K. R. Rao⁴

648 Accesses

The original version of this chapter was revised. A correction to this chapter is available at https://doi.org/10.1007/978-3-319-61080-1_10

Abstract

Digital audio signals to be processed are integer-valued. But the perfect reconstruction cosine/sine-modulated filter banks and cosine-modulated QMF banks are real-valued transforms which map integer signal into real-valued spectral coefficients. Although their fast algorithms reduce the computational complexity, due to floating-point finite-length representation and corresponding rounding-off errors they cannot be used for lossless audio coding. Actually, almost all modern perceptual audio coding schemes developed so far operate in floating-point arithmetic and therefore, are lossy in nature. However, some audio coding applications require completely lossless preservation of the audio signal. An enabling technology for transform-based lossless audio coding is the integer transform. Integer transform is a transform which maps integers to integers by a reversible (invertible) way so that it preserves all mathematical properties of the original real-valued transform such as perfect reconstruction, energy compaction property and fast algorithm. Indeed, the integer modified discrete cosine transform (IntMDCT) or integer modulated lapped transform (IntMLT) enabled to design and implement this innovative coding technology for scalable lossy to lossless audio coding. The local and global methods to integer approximation of perfect reconstruction cosine/sine-modulated filter banks and cosine-modulated QMF banks are discussed in detail. They are based on computational methods of linear algebra, matrix theory and matrix computations, and in particular, on the matrix decompositions. In fact, the scalar and block matrix decompositions are powerful mathematical tools to construct the reversible (invertible) integer transforms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Change history

08 January 2019
On p. v, in the third line from top, “in” was inserted after comma to read “. . ., and vice versa, in many sub-band/transform-based schemes . . .”

References

J.A.S. Angus, Finite fields transforms for lossless audio signal processing, in 105th AES Convention, San Francisco, CA, September (1998). Preprint #483
Google Scholar
M. Hans, R.W. Schafer, Lossless compression of digital audio. IEEE Signal Process. Mag. 18(4), 21–32 (2001)
Article Google Scholar
ISO/IEC 14496–3:2005/Amd.2:2006, Coding of audio–visual objects – Part 3: audio, amendment 2: audio lossless coding (ALS), New Audio Profiles and BSAC Extensions. International Standards Organization, Geneva, Switzerland (2006)
Google Scholar
J. Koller, T. Sporer, K. Brandenburg, Improving lossless audio coding, in Proceedings of the 17th International AES Conference: High–quality Audio Coding, Florence, Italy, August (1999), pp. 299–306
Google Scholar
T. Liebchen, Y.A. Reznik, MPEG-4 ALS: an emerging standard for lossless audio coding, in Proceedings of Data Compression Conference (DCC’2006), Snowbird, UT, March (2006), pp. 439–448
Google Scholar
T. Moriya, N. Iwakami, A. Jin, T. Mori, A design of lossy and lossless scalable audio coding, in Proceedings of the IEEE ICASSP’2000, Istanbul, June (2000), pp. 889–892
Google Scholar
T. Moriya, A. Jin, T. Mori, K. Ikeda, T. Kaneko, Lossless scalable audio coder and quality enhancement, in Proceedings of the IEEE ICASSP’2002, vol. 2, Orlando, FL, May (2002), pp. 1829–1832
Google Scholar
M. Purat, T. Liebchen, P. Noll, Lossless transform coding of audio signals, in 102nd AES Convention, Munich, Germany, March (1997). Preprint #4414
Google Scholar

Integer DCT/DST (IntDCT/IntDST)

V. Britanak, P. Yip, K.R. Rao, Discrete Cosine and Sine Transforms: General Properties, Fast Algorithms and Integer Approximations, Chap. 5 (Academic, Elsevier Science, Amsterdam, 2007), pp. 141–304
Chapter Google Scholar
A.R. Calderbank, I. Daubechies, W. Sweldens, B.L. Yeo, Wavelet transforms that map integers to integers. Appl. Comput. Harmon. Anal. 5, 332–369 (1998)
Article MathSciNet MATH Google Scholar
R.J. Cintra, F.M. Bayer, C.J. Tablada, Low-complexity 8-point DCT approximations based on integer functions. Signal Process. 99(6), 201–214 (2014)
Article Google Scholar
V.K. Goyal, Transform coding with integer-to-integer transforms. IEEE Trans. Inf. Theory 46(2), 465–473 (2000)
Article MathSciNet MATH Google Scholar
P. Hao, Q. Shi, Matrix factorizations for reversible integer mapping. IEEE Trans. Signal Process. 49(10), 2314–2324 (2001)
Article MathSciNet MATH Google Scholar
X. Li, B. Tao, M.T. Orchard, On implementing transforms from integers to integers, in Proceedings of the IEEE International Conference on Image Processing (ICIP’98), Chicago, IL, October (1998), pp. 881–885
Google Scholar
G. Plonka, A global method for invertible integer DCT and integer wavelet algorithms. Appl. Comput. Harmon. Anal. 16(2), 90–110 (2004)
Article MathSciNet MATH Google Scholar
G. Plonka, M. Tasche, Invertible integer DCT algorithms. Appl. Comput. Harmon. Anal. 15(1), 70–88 (2003)
Article MathSciNet MATH Google Scholar
M. Primbs, Worst–case error analysis of lifting-based fast DCT-algorithms. IEEE Trans. Signal Process. 53(8), 3211–3218 (2005). Part 2
Article MathSciNet MATH Google Scholar
Y. She, P. Hao, On the necessity and sufficiency of PLUS factorizations. Linear Algebra Appl. 400, 193–202 (2005)
Article MathSciNet MATH Google Scholar
S. Srinivasan, Modulo transforms – An alternative to lifting. IEEE Trans. Signal Process. 54(5), 1864–1874 (2006)
Article MATH Google Scholar
C.J. Tablada, F.M. Bayer, R.J. Cintra, A class of DCT approximations based on the Feigh–Winograd algorithm. Signal Process. 113(8), 38–51 (2015)
Article Google Scholar
J. Wang, J. Sun, S. Yu, 1-D and 2-D transforms from integers to integers, in Proceedings of the IEEE ICASSP’2003, vol. 2, Hong Kong, April (2003), pp. 549–552
Google Scholar
L. Yang, P. Hao, Infinity-norm rotation transforms. IEEE Trans. Signal Process. 57(7), 2594–2603 (2009)
Article MathSciNet MATH Google Scholar

Integer FFT (IntFFT)

S.C. Chan, P.M. Yiu, An efficient multiplierless approximation of the fast Fourier transform using sum-of-powers-of–two (SOPOT) coefficients. IEEE Signal Process. Lett. 9(10), 322–325 (2002)
Article Google Scholar
S. Oraintara, Y.-J. Chen, T. Nguyen, Integer fast Fourier transform (IntFFT), in Proceedings of the IEEE ICASSP’2001, Salt Lake City, UT, May (2001)
Google Scholar
S. Oraintara, Y.-J. Chen, T. Nguyen, Integer fast Fourier transform. IEEE Trans. Signal Process. 50(3), 607–618 (2002)
Article MathSciNet MATH Google Scholar
K.R. Rao, D.N. Kim, J.J. Hwang, Integer fast fourier transform, in Fast Fourier Transform: Algorithms and Applications (Springer Science + Business Media B. V., Heidelberg, 2010), pp. 111–126
MATH Google Scholar
Y. Yokotani, S. Oraintara, R. Geiger, G. Schuller, K.R. Rao, A comparison of integer fast Fourier transforms for lossless coding, in Proceedings of the International Symposium on Communications and Information Technologies (ISCIT 2004), Sapporo, October (2004), pp. 1069–1073
Google Scholar

Reversible Integer Lapped Orthogonal Transforms and Integer MLT

S.C. Chan, P.M. Yiu, Multiplier–less discrete sinusoidal and lapped transforms using sum-of-powers-of-two (SOPOT) coefficients, in Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS’2001), vol. 2, Sydney, May (2001), pp. 13–16
Google Scholar
K. Komatsu, K. Sezaki, Design of lossless block transforms and filter banks for image coding. IEICE Trans. Fundam. E82–A(8), 1656–1664 (1999)
Google Scholar
K. Komatsu, K. Sezaki, Design of lossless LOT and its performance evaluation, in Proceedings of the IEEE ICASSP’2000, vol. 4, Istanbul, June (2000), pp. 2119–2122
Google Scholar
J. Li, A progressive to lossless embedded audio coder (PLEAC) with reversible modulated lapped transform, in Proceedings of the IEEE International Conference on Multimedia and Expo (ICME’2003), vol. 3, Baltimore, MD, July (2003), pp. 221–224
Google Scholar
H.S. Malvar, Lossless and near-lossless audio compression using integer-reversible modulated lapped transforms, in Proceedings of the IEEE Data Compression Conference (DCC’2007), Snowbird, UT, March (2007)
Google Scholar
T.D. Tran, The LiftLT: fast-lapped transforms via lifting steps. IEEE Signal Process Lett. 7(6), 145–148 (2000)
Article Google Scholar

Integer MDCT (IntMDCT) and Approximation Error

S.C. Chan, W. Yiu, K.L. Ho, Multiplierless perfect reconstruction modulated filter banks with sum-of-powers-of-two coefficients. IEEE Signal Process. Lett. 8(6), 163–166 (2001)
Article Google Scholar
R. Geiger, G. Schuller, Integer low delay and MDCT filter banks, in Proceedings of the 36th Asilomar Conference on Signals, Systems and Computers, vol. 1, Pacific Grove, CA, November (2002), pp. 811–815
Google Scholar
R. Geiger, Y. Yokotani, G. Schuller, Improved integer transforms for lossless audio coding, in Proceedings of the 37th Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, November (2003), pp. 2119–2123
Google Scholar
R. Geiger, T. Sporer, J. Koller, K. Brandenburg, Audio coding based on integer transforms, in 111th AES Convention, New York, NY, September (2001). Preprint #5471
Google Scholar
R. Geiger, J. Herre, J. Koller, K. Brandenburg, IntMDCT – A link between perceptual and lossless audio coding, in Proceedings of the IEEE ICASSP’2002, vol. 2, Orlando, FL, May (2002), pp. 1813–1816
Google Scholar
R. Geiger, J. Herre, G. Schuller, T. Sporer, Fine grain scalable perceptual and lossless audio coding based on IntMDCT, in Proceedings of the IEEE ICASSP’2003, vol. 5, Hong Kong, April (2003), pp. 445–448
Google Scholar
R. Geiger, G. Schuller, T. Sporer, Fine grain scalable perceptual and lossless audio coding based on IntMDCT, in Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, October (2003), p. 50
Google Scholar
R. Geiger, Y. Yokotani, G. Schuller, J. Herre, Improved integer transforms using multidimensional lifting, in Proceedings of the IEEE ICASSP’2004, vol. 2, Montreal, May (2004), pp. 1005–1008
Google Scholar
H. Huang, R. Yu, X. Lin, S. Rahardja, Method for realising reversible integer type-IV discrete cosine transform. Electron. Lett. 40(8), 514–515 (2004)
Article Google Scholar
H. Huang, S. Rahardja, R. Yu, X. Lin, A fast algorithm of integer MDCT for lossless audio coding, in Proceedings of the IEEE ICASSP’2004, vol. 4, Montreal, May (2004), pp. 177–180
Google Scholar
H. Huang, S. Rahardja, R. Yu, X. Lin, Integer MDCT with enhanced approximation of the DCT-IV. IEEE Trans. Signal Process. 54(3), 1156–1159 (2006)
Article MATH Google Scholar
J. Li, Reversible FFT and MDCT via matrix lifting, in Proceedings of the IEEE ICASSP’2004, vol. 4, Montreal, May (2004), pp. 173–176
Google Scholar
J. Li, Low noise reversible MDCT (RMDCT) and its application in progressive-to-lossless embedded audio coding. IEEE Trans. Signal Process. 53(5), 1870–1880 (2005)
Article MathSciNet MATH Google Scholar
V.M. Prasad, C.D. Creusere, Analyzing reversible lapped transforms using Reng probing, in Proceedings of the 40th Asilomar Conference on Signals, System and Computers, Pacific Grove, CA, October–November (2006), pp. 873–877
Google Scholar
Y. Yokotani, S. Oraintara, Lossless audio compression using integer modified discrete cosine transform, in Proceedings of the IEEE International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS’2003), Awaji Island, December (2003), pp. 120–126
Google Scholar
Y. Yokotani, R. Geiger, G. Schuller, S. Oraintara, K.R. Rao, Improved lossless audio coding using the noise-shaped IntMDCT, in Proceedings of the IEEE 11th Digital Signal Processing Workshop and Signal Processing Education Workshop, Taos Ski Valley, NM, August (2004), pp. 356–360
Google Scholar
Y. Yokotani, S. Oraintara, R. Geiger, G. Schuller, K.R. Rao, Approximation error analysis for transform-based lossless audio coding, in Proceedings of the IEEE Global Telecommunications Conference (GLOBECOM’2004), vol. 2, Dallas, TX, November–December (2004), pp. 595–599
Google Scholar
Y. Yokotani, R. Geiger, G.D.T. Schuller, S. Oraintara, K.R. Rao, Lossless audio coding using the IntMDCT and rounding error shaping. IEEE Trans. Audio Speech Lang. Process. 14(6), 2201–2211 (2006)
Article Google Scholar
Y. Zhang, G. Gao, A scalable and lossless audio coding system based on integer transform, in Proceedings of the IEEE International Symposium on Communications and Information Technologies (ISCIT’2006), Bangkok, September–October (2006), pp. 414–417
Google Scholar
Y. Zhang, R. Hu, Scalable audio coding based on integer transform, in Proceedings of the IEEE 1st International Conference on Communications and Networking in China (ChinaCom’2006), Beijing, October (2006), pp. 1–5
Google Scholar

IntMDCT in MP3 Audio Coding

T. Krishnan, S. Oraintara, Fast and lossless implementation of the forward and inverse MDCT computation in MPEG audio coding, in Proceedings of of the IEEE International Symposium on Circuits and Systems (ISCAS’2002), vol. 2, Phoenix, Scottsdale, AR, May (2002), pp. 181–184
Google Scholar
T. Krishnan, S. Oraintara, The integer MDCT and its application in the MPEG layer III audio, in Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS’2003), vol. 4, Bangkok, May (2003), pp. 301–304
Google Scholar
H.-J. Quan, T. Zhang, Y.-N. Che, L. Zhao, The application of integer MDCT in MP3 audio, in Proceedings of the IEEE International Conference on Computer Science and Network Technology, Dalian, December (2011), pp. 1153–1157
Google Scholar
L. Wang, J. Wu, L. Senhadji, H. Shu, Comparison of three IntMDCT algorithms in audio compression. J. Southwest Univ. (Nat. Sci. Ed.). 42(2), 259–264 (2012) [in Chinesse]
Google Scholar

MPEG-4 HD-AAC/SLS Scalable Lossless Audio Coding Standard

R. Geiger, G. Schuller, J. Herre, R. Sperschneider, T. Sporer, Scalable perceptual and lossless audio coding based on MPEG-4 AAC, in 115th AES Convention, New York, NY, October (2003). Preprint #5868
Google Scholar
R. Geiger, R. Yu, J. Herre, S. Rahardja, S.-W. Kim, X. Lin, M. Schmidt, ISO/IEC MPEG-4 High-definition scalable advanced audio coding. J. Audio Eng. Soc. 55(1/2), 27–43 (2007)
Google Scholar
ISO/IEC JTC1/SC29/WG11 Moving Picture Experts Group, Call for proposals on MPEG-4 lossless audio coding. No. N5040, Awaji Island (2002)
Google Scholar
ISO/IEC JTC1/SC29/WG11 Moving Picture Experts Group, Final call for proposals on MPEG-4 lossless audio coding, No. N5208, Shanghai, China, October (2002)
Google Scholar
ISO/IEC JTC1/SC29/WG11 Moving Picture Experts Group, Workplan for audio scalable lossless audio coding (SLS). No. N5720, Trondheim, Norway, July (2003)
Google Scholar
ISO/IEC 14496–3:2005/Amd.3:2006, Coding of audio–visual objects – Part 3: audio, amendment 3: scalable lossless coding (SLS). International Standards Organization, Geneva, Switzerland (2006)
Google Scholar
T. Li, S. Rahardja, R. Yu, S. Koh, Study on rounding errors of IntMDCT in perceptual audio coding, in Proceedings of the 7th IEEE International Symposium on Multimedia (ISM’2005), Irvine, CA, December (2005)
Google Scholar
T. Li, S. Rahardja, R. Yu, S.N. Koh, On integer MDCT for perceptual audio coding. IEEE Trans. Audio Speech Lang. Process. 15(8), 2236–2248 (2007)
Article Google Scholar
R. Yu, X. Lin, S. Rahardja, C.C. Ko, A fine granular scalable perceptually lossy and lossless audio codec, in Proceedings of the IEEE International Conference on Multimedia & Expo (ICME’2003), vol. I, Baltimore, MD, July (2003), pp. 65–68
Google Scholar
R. Yu, X. Lin, S. Rahardja, C.C. Ko, A scalable lossy to lossless audio coder for MPEG-4 lossless audio coding, in Proceedings of the IEEE ICASSP’2004, vol. III, Montreal, Canada, May (2004), pp. 1004–1007
Google Scholar
R. Yu, R. Geiger, S. Rahardja, J. Herre, X. Lin, H. Huang, MPEG-4 scalable to lossless audio coding, in 117th AES Convention, San Francisco, CA, October (2004). Preprint #6183
Google Scholar
R. Yu, X. Lin, S. Rahardja, C. C. Ko, H. Huang, Improving coding efficiency for MPEG-4 audio scalable lossless coding, in Proceedings of the IEEE ICASSP’2005, vol. III, Philadephia, PA, May (2005), pp. 169–172
Google Scholar
R. Yu, X. Lin, S. Rahardja, H. Huang, MPEG-4 scalable to lossless audio coding – Emerging international standard for digital audio compression, in Proceedings of the IEEE 7th Workshop on Multimedia Signal Processing, Shanghai, China, October–November (2005), pp. 1–4
Google Scholar
R. Yu, S. Rahardja, X. Lin, C.C. Ko, A fine granular scalable to lossless audio coding. IEEE Trans. Audio Speech Lang. Process. 14(4), 1352–1363 (2006)
Article Google Scholar
R. Yu, T. Li, S. Rahardja, Perceptually enhanced bit–plane coding for scalable audio, in Proceedings of the IEEE International Conference on Multimedia & Expo (ICME’2006), Toronto, July (2006), pp. 1153–1156
Google Scholar

Supporting Literature

T.M. Apostol, Historical introduction, in Introduction to Analytic Number Theory (Springer, New York, 1976), pp. 1–12
Google Scholar
G. Bi, Y. Zeng, Transforms and Fast Algorithms for Signal Analysis and Representations, Chap. 6 (Birkhäuser, Boston, 2004), pp. 210–211
Chapter MATH Google Scholar
V. Britanak, New universal rotation-based fast computational structures for an efficient implementation of the DCT-IV/DST-IV and analysis/synthesis MDCT/MDST filter banks. Signal Process. 89(11), 2213–2232 (2009)
Article MATH Google Scholar
V. Britanak, H.J. Lincklaen Arriëns, Fast computational structures for an efficient implementation of the complete TDAC analysis/synthesis MDCT/MDST filter banks. Signal Process. 89(7), 1379–1394 (2009)
Article MATH Google Scholar
V. Britanak, K.R. Rao, A new fast algorithm for the unified forward and inverse MDCT/MDST computation. Signal Process. 82(3), 433–459 (2002)
Article MATH Google Scholar
M. Bosi, R.E. Goldberg, Audio coding standards, in Introduction to Digital Audio Coding and Standards, Part II (Springer Science+Business Media, New York, 2003)
Book Google Scholar
P. Duhamel, B. Piron, J.M. Etcheto, On computing the inverse DFT. IEEE Trans. Acoust. Speech Signal Process. 36(2), 285–286 (1988)
Article MATH Google Scholar
F.R. Gantmacher, The Theory of Matrices, 2nd edn. (Nauka, Moscow, 1966) [in Russian], English translation: Vols. 1 and 2, (Chelsea, New York, 1959)
MATH Google Scholar
G.H. Golub, C.F. Van Loan, Matrix Computations, 3rd edn. (The Johns Hopkins University Press, Baltimore, MD, 1996)
MATH Google Scholar
H.S. Malvar, Signal Processing with Lapped Transforms, Chap. 2 (Artech House, Norwood, MA, 1992), pp. 71–75
Google Scholar
H.J. Nussbaumer, Elements of number theory and polynomial algebra, in Fast Fourier Transform and Convolution Algorithms, Chap. 2 (Springer, Berlin, 1981), pp. 4–31
Google Scholar
G. Plonka, M. Tasche, Fast and numerically stable algorithms for discrete cosine transforms. Linear Algebra Appl. 394(1), 309–345 (2005)
Article MathSciNet MATH Google Scholar
J.P. Princen, A.W. Johnson, A.B. Bradley, Sub–band/transform coding using filter bank designs based on time domain aliasing cancellation, in Proceedings of the IEEE ICASSP’87, Dallas, TX, April (1987), pp. 2161–2164
Google Scholar
G.W. Stewart, Gaussian elimination, in Matrix Algorithms, Volume I: Basic Decompositions, Chap. 3 (SIAM Society for Industrial and Applied Mathematics, Philadelpha, PA, 1998), pp. 149–250
Google Scholar
J. Wu, H. Shu, L. Senhadji, L. Luo, Mixed-radix algorithm for the computation of forward and inverse MDCTs. IEEE Trans. Circuits Syst. Regul. Pap. 56(4), 784–794 (2009)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Informatics, Slovak Academy of Sciences, Bratislava, Slovakia
Vladimir Britanak
The University of Texas at Arlington, Arlington, TX, USA
K. R. Rao

Authors

Vladimir Britanak
View author publications
You can also search for this author in PubMed Google Scholar
K. R. Rao
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Britanak, V., Rao, K.R. (2018). Integer Approximate Cosine/Sine-Modulated Filter Banks. In: Cosine-/Sine-Modulated Filter Banks. Springer, Cham. https://doi.org/10.1007/978-3-319-61080-1_9

Download citation

DOI: https://doi.org/10.1007/978-3-319-61080-1_9
Published: 04 August 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-61078-8
Online ISBN: 978-3-319-61080-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics