Abstract
The perfect reconstruction cosine/sine-modulated filter banks belonging to the class of modulated filter banks have been studied extensively due to their attractive features (simple structure, analysis and synthesis filters are of equal length, low computational complexity), and consequently, they have received a great interest in audio coding applications. In fact, they are employed in the international speech and audio coding standards and proprietary audio compression algorithms. The oddly and evenly stacked modified discrete cosine transform (MDCT) and the corresponding modified discrete sine transform (MDST), the modulated lapped transform (MLT), the extended lapped transforms (ELTs), and their biorthogonal versions are real-valued cosine/sine-modulated filter banks satisfying the perfect reconstruction property. The modulated complex lapped transform (MCLT) is the complex-valued filter bank whose real part is the MLT or equivalently, the oddly stacked MDCT, and the imaginary part is the oddly stacked MDST. In this chapter, definitions, general properties, and matrix representations of the MDCT/MDST, MLT, ELT, and MCLT filter banks are presented. In order to an analysis/synthesis filter bank be perfect reconstruction, the necessary and sufficient conditions imposed on the analysis and synthesis windowing functions play an important role. Therefore, additionally the windowing procedure and perfect reconstruction (biorthogonal) conditions in the case of identical and (nonidentical) analysis and synthesis windowing functions, design of a windowing function including definitions of commonly windowing functions used in audio coding applications, adaptive switching of transform block sizes and windowing functions, and general perfect reconstruction conditions for the ELT filter bank with multiple overlapping factor both for the orthogonal and biorthogonal cases are derived and/or discussed in detail.
Change history
08 January 2019
On p. v, in the third line from top, “in” was inserted after comma to read “. . ., and vice versa, in many sub-band/transform-based schemes . . .”
References
R. Gluth, A unified approach to transform-based FIR filter banks with special regard to perfect reconstruction, in Proceedings of the IEEE ICASSP’93, Minneapolis, MN, vol. III, April 1993, pp. 157–160
R. Gluth, U. Heute, Analysis/synthesis filter banks based on generalized sinusiodal transforms with an application to speech coding, in Proceedings of the 6th European Signal Processing Conference (EUSIPCO’92), vol. 1, Brussels, August 1992, pp. 215–218
R.A. Gopinath, Modulated filter banks and wavelets – a general unified theory, in Proceedings of the IEEE ICASSP’96, Atlanta, GA, May 1996, pp. 1585–1588
R.A. Gopinath, C.S. Burrus, Theory of modulated filter banks and modulated wavelet tight frames, in Proceedings of the IEEE ICASSP’93, Minneapolis, MN, vol. III, April 1993, pp. 169–172
R.D. Koilpillai, P.P. Vaidyanathan, New results on cosine-modulated FIR filter banks satisfying perfect reconstruction, in Proceedings of the IEEE ICASSP’91, Toronto, May 1991, pp. 1793–1796
R.D. Koilpillai, P.P. Vaidyanathan, Cosine-modulated FIR filter banks satisfying perfect reconstruction. IEEE Trans. Signal Process. 40(4), 770–783 (1992)
Y.-P. Lin, P.P. Vaidyanathan, Linear phase cosine modulated maximally decimated filter banks with perfect reconstruction. IEEE Trans. Signal Process. 42(11), 2525–2539 (1995)
J. Mau, Perfect reconstruction modulated filter banks, in Proceedings of the IEEE ICASSP’92, vol. IV, San Francisco, CA, April 1992, pp. 273–276
J. Mau, Perfect reconstruction modulated filter banks: fast algorithms and attractive new properties, in Proceedings of the IEEE ICASSP’93, Minneapolis, MN, vol. III, April 1993, pp. 225–228
J. Mau, Regular M-band modulated orthogonal transforms, in Proceedings of the IEEE ICASSP’93, vol. III, Adelaide, April 1994, pp. 125–128
T.Q. Nguyen, R.D. Koilpillai, The theory and design of arbitrary-length cosine-modulated filter banks and wavelets, satisfying perfect reconstruction. IEEE Trans. Signal Process. 44(3), 473–483 (1996)
T.A. Ramstad, J.P. Tanem, Cosine-modulated analysis-synthesis filterbank with critical sampling and perfect reconstruction, in Proceedings of the IEEE ICASSP’91, Toronto, May 1991, pp. 1789–1792
G.D. Schuller, M.J. Smith, New framework for modulated perfect reconstruction filter banks. IEEE Trans. Signal Process. 44(8), 1941–1954 (1996)
M. Vetterli, D. Le Gall, Perfect Reconstruction FIR filter banks: some properties and factorizations. IEEE Trans. Acoust. Speech Signal Process. 37(7), 1057–1071 (1989)
Evenly Stacked MDCT/MDST Analysis and Synthesis Filter Banks
V. Britanak, A note on the MDCT/MDST and pseudoinverse matrix. Comput. Inform. 23(3), 205–214 (2004)
V. Britanak, K.R. Rao, A unified fast MDCT/MDST computation in the evenly stacked analysis/synthesis system. Circuits, Syst. Signal Process. 21(4), 415–426 (2002)
T.D. Lookabaugh, M.G. Perkins, Application of the Princen–Bradley filter bank to speech and image compression. IEEE Trans. Acoust. Speech Signal Process. ASSP-38(11), 1914–1926 (1990)
J.P. Princen, A.B. Bradley, Analysis/synthesis filter bank design based on time domain aliasing cancellation. IEEE Trans. Acoust. Speech Signal Process. ASSP-34(5), 1153–1161 (1986)
Oddly Stacked MDCT/MDST Analysis and Synthesis Filter Banks
V. Britanak, An efficient computing of oddly stacked MDCT/MDST computation via evenly stacked MDCT/MDST and vice versa. Signal Process. 85(7), 1353–1374 (2005)
V. Britanak, H.J.L. Arriëns, Fast computational structures for an efficient implementation of the complete TDAC analysis/synthesis MDCT/MDST filter banks. Signal Process. 89(7), 1379–1394 (2009)
V. Britanak, K.R. Rao, A new fast algorithm for the unified forward and inverse MDCT/MDST computation. Signal Process. 82(3), 433–459 (2002)
S. Cramer, R. Gluth, Computationally efficient real-valued filter banks based on a modified O 2 DFT, in Proceedings of EUSIPCO’90, Signal Processing V: Theories and Applications (Elsevier Science Publishers B.V., Barcelona, 1990), pp. 585–588
A.W. Johnson, A.B. Bradley, Adaptive transform coding incorporating time domain aliasing cancellation. Speech Commun. 6(4), 299–308 (1987)
J.P. Princen, A.W. Johnson, A.B. Bradley, Sub-band/transform coding using filter bank designs based on time domain aliasing cancellation, in Proceedings of the IEEE ICASSP’87, Dallas, TX, April 1987, pp. 2161–2164
K. Suresh, T.V. Sreenivas, Direct MDCT domain psychoacoustic modeling, in Proceedings of the IEEE International Symposium on Signal Processing and Information Technology (ISSPIT’2007), Cairo, December 2007, pp. 742–747
K. Suresh, T.V. Sreenivas, Linear filtering in DCT-IV/DST-IV and MDCT/MDST domain. Signal Process. 89(6), 1081–1089 (2009)
Y. Wang, M. Vilermo, Modified discrete cosine transform – its implications for audio coding and error concealment. J. Audio Eng. Soc. 51(1/2), 52–61 (2003)
Y. Wang, L. Yaroslavsky, M. Vilermo, On the relationship between MDCT, SDFT and DFT, in Proceedings of the 5th International Conference on Signal Processing (ICSP’2000), Beijing, August 2000, pp. 44–47
Y. Wang, L. Yaroslavsky, M. Vilermo, M. Väänänen, Some peculiar properties of the MDCT, in Proceedings of the 5th International Conference on Signal Processing (ICSP’2000), Beijing, August 2000, pp. 61–64
K. Wright, Notes on Ogg Vorbis and the MDCT. Draft document on web site: www.free-comp-shop.com/vorbis.html, May 2003, 7 pp.
S. Zhang, W. Dou, H. Yang, DFT spectrum estimation from critically sampled lapped transforms. Signal Process. 91(2), 300–310 (2011)
Windowing Procedure, Perfect Reconstruction Conditions, and Design of Windowing Function
M. Bosi, R.E. Golberg, Introduction to Digital Audio Coding and Standards (Springer Science+Business Media, New York, NY, 2003)
M. Bosi, K. Brandenburg, S. Quackenbush, L. Fielder, K. Akagiri, H. Fuchs, M. Dietz, J. Herre, G. Davidson, Y. Oikawa, ISO/IEC MPEG-2 advanced audio coding, in 101st AES Convention, Los Angeles, CA, November 1996. Preprint #4382. Also published in J. Audio Eng. Soc. 45(10), 789–813 (1997)
B. Edler, Coding of audio signals with overlapping block transform and adaptive window functions. Frequenz 43(9), 252–256 (1989) (in German)
A.J. Ferreira, Convolutional effects in transform coding with TDAC: an optimal window. IEEE Trans. Speech Audio Process. 4(2), 104–114 (1996)
F.J. Harris, On the use of windows for harmonic analysis with the discrete Fourier transform. Proc. IEEE 66(1), 51–83 (1978)
T. Mochizuki, Perfect reconstruction conditions for adaptive blocksize MDCT. IEICE Trans. Fundam. E77-A(5), 894–899 (1994)
T. Painter, A. Spanias, Perceptual coding of digital audio. Proc. IEEE 88(4), 451–513 (2000)
J.R. Rice, The Approximation of Functions, vol. I (Addison-Wesley, Reading, MA, 1964), pp. 124–131
G. Smart, A.B. Bradley, Filter bank design based on time domain aliasing cancellation with non-identical windows, in Proceedings of the IEEE ICASSP’94, vol. III, Adelaide, April 1994, pp. 185–188
A. Spanias, T. Painter, V. Atti, Audio Signal Processing and Coding, chap. 6 (Wiley, Hoboken, NJ, 2007), pp. 145–210
G. Wolberg, Digital Image Warping (IEEE Computer Press, Los Alamitos, CA, 1990), pp. 137–144
Low (Reduced) Overlap Windowing Functions
E. Allamanche, R. Geiger, J. Herre, T. Sporer, MPEG-4 low delay audio coding based on the AAC codec, in 106th AES Convention, Munich, May 1999. Preprint #4929
M. Lutzky, M. Schnell, M. Schmidt, R. Geiger, Structural analysis of low latency audio coding schemes, in 119th AES Convention, New York, NY, October 2005. Preprint #6601
OPUS interactive speech and audio codec, Available on web site: www.opus-codec.org (2016)
J.-M. Valin, T.B. Terriberry, G. Maxwell, A full-band audio codec with low complexity and very low delay, in Proceedings of the 17th European Signal Processing Conference (EUSIPCO’2009), Glasgow, August 2009, pp. 1254–1258
J.-M. Valin, T.B. Terriberry, C. Montgomery, G. Maxwell, A high-quality speech and audio codec with less than 10 ms delay. IEEE Trans. Audio Speech Lang. Process. 18(1), 58–67 (2010)
J.-M. Valin, G. Maxwell, T.B. Terriberry, C. Montgomery, K. Vos, High-quality, low-delay music coding in the Opus codec, in 135th AES Convention, New York, NY, October 2013. Preprint #8942
(Generalized) Lapped Orthogonal Transforms
A.N. Akansu, F.E. Wadas, On lapped orthogonal transform. IEEE Trans. Signal Process. 40(2), 439–443 (1992)
R.L. de Queiroz, T.Q. Nguyen, K.R. Rao, Generalized lapped orthogonal transforms. Electron. Lett. 30(2), 107–108 (1994)
R.L. de Queiroz, T.Q. Nguyen, K.R. Rao, The GenLOT: generalized linear-phase lapped orthogonal transform. IEEE Trans. Signal Process. 44(3), 497–507 (1996)
H.S. Malvar, Reduction of blocking effects in image coding with a lapped orthogonal transform, in Proceedings of the IEEE ICASSP’88, New York, NY, April 1988, pp. 781–784
H.S. Malvar, The LOT: a link between block transform coding and multirate filter banks, in Proceedings of International Symposium on Circuits and Systems, Espoo, June 1988, pp. 835–838
H.S. Malvar, Pseudolapped orthogonal transforms. Electron. Lett. 25(5), 312–314 (1989)
H.S. Malvar, D.H. Staelin, The LOT: transform coding without blocking effects. IEEE Trans. Audio Speech Lang. Process. 37(4), 553–559 (1989)
V.K. Madisetti, D.B. Williams (eds.), The Digital Signal Processing Handbook, Part 38, Lapped Transforms (CRC & IEEE, Boca Raton, FL, 1998), pp. 1–7
K.R. Rao, P.C. Yip (eds.), Lapped transforms for image compression (Chapter 5), in The Transform and Data Compression Handbook (CRC, Boca Raton, FL, 2001), pp. 197–265
MLT and ELT (Biorthogonal, Nonuniform) Analysis and Synthesis Filter Banks
S. Cheung, J.S. Lim, Incorporation of biorthogonality into lapped transforms for audio compression, in Proceedings of the IEEE ICASSP’95, Detroit, MI, April 1995, pp. 3079–3082
H.S. Malvar, Lapped transforms for efficient transform/sub-band coding. IEEE Trans. Acoust. Speech Signal Process. 38(6), 969–978 (1990)
H.S. Malvar, Modulated QMF filter banks with perfect reconstruction. Electron. Lett. 26(13), 906–910 (1990)
H.S. Malvar, Extended lapped transforms: fast algorithms and applications, in Proceedings of the IEEE ICASSP’91, Toronto, May 1991, pp. 1797–1800
H.S. Malvar, Extended lapped transforms: properties, applications, and fast algorithms. IEEE Trans. Signal Process. 40(11), 2703–2714 (1992)
H.S. Malvar, Signal Processing with Lapped Transforms (Artech House, Norwood, MA, 1992). Chapter 4: Lapped Orthogonal Transforms, pp. 143–173 and Chapter 5: Modulated Lapped Transforms, pp. 175–218
H.S. Malvar, Extended cosine bases and applications to audio coding. Comput. Appl. Math. 15(2), 111–123 (1996)
H.S. Malvar, Biorthogonal and nonuniform lapped transforms for transform coding with reduced blocking and ringing artifacts. IEEE Trans. Signal Process. 46(4), 1043–1053 (1998)
M. Padmanabhan, K. Martin, Some further results on modulated/extended lapped transforms, in Proceedings of the IEEE ICASSP’92, vol. IV, San Francisco, CA, April 1992, pp. 265–268
MCLT Analysis and Synthesis Filter Banks
T. Aach, D. Kunz, A lapped directional transform for spectral image analysis and its application to restoration and enhancement. Signal Process. 80(11), 2347–2364 (2000)
V. Britanak, New recursive fast radix-2 algorithm for the modulated complex lapped transform. IEEE Trans. Signal Process. 60(12), 6703–6708 (2012)
S. Cheng, Z. Xiong, Audio coding and image denoising based on the nonuniform modulated complex lapped transform. IEEE Trans. Multimedia 7(10), 817–827 (2005)
D. Kunz, T. Aach, Lapped directional transform: a new transform for spectral image analysis, in Proceedings of the IEEE ICASSP’99, Phoenix, AR, May 1999, pp. 3433–3436
H. Malvar, A modulated complex lapped transform and its applications to audio processing, in Proceedings of the IEEE ICASSP’99, Phoenix, AR, May 1999, pp. 1421–1424
Z. Xiong, H.S. Malvar, A nonuniform modulated complex lapped transform. IEEE Signal Process. Lett. 8(9), 257–260 (2001)
B.-J. Yoon, H.S. Malvar, Coding over-complete representations of audio using the MCLT, in Proceedings of the IEEE Data Compression Conference, Snowbird, UT, March 2008, pp. 152–161
Supporting Literature
V. Britanak, P. Yip, K.R. Rao, Discrete Cosine and Sine Transforms: General Properties, Fast Algorithms and Integer Approximations (Academic, Elsevier Science, Amsterdam, 2007)
R.E. Crochiere, L.R. Rabiner, Multirate techniques in filter banks and spectrum analyzers and synthesizers (Chapter 7), in Multirate Digital Signal Processing (Prentice-Hall, Englewood Cliffs, NJ, 1983), pp. 289–400
M. Fiedler, Special Matrices and their using in Numerical Mathematics (SNTL, Prague, 1981) (in Czech)
F.R. Gantmacher, The Theory of Matrices, 2nd edn. (Nauka, Moscow, 1966) (in Russian). English translation: Vol. 1 and 2, Chelsea, New York, 1959
G.H. Golub, C.F. Van Loan, Matrix Computations, 3rd edn. (Johns Hopkins University Press, Baltimore, MD, 1996)
P.P. Vaidyanathan, Multirate Systems and Filter Banks (Prentice-Hall, Englewood Cliffs, NJ, 1992)
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this chapter
Cite this chapter
Britanak, V., Rao, K.R. (2018). MDCT/MDST, MLT, ELT, and MCLT Filter Banks: Definitions, General Properties, and Matrix Representations. In: Cosine-/Sine-Modulated Filter Banks. Springer, Cham. https://doi.org/10.1007/978-3-319-61080-1_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-61080-1_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-61078-8
Online ISBN: 978-3-319-61080-1
eBook Packages: EngineeringEngineering (R0)