Low-complexity 8-point DCT approximation based on angle similarity for image and video coding

Oliveira, Raíza S.; Cintra, Renato J.; Bayer, Fábio M.; da Silveira, Thiago L. T.; Madanayake, Arjuna; Leite, André

doi:10.1007/s11045-018-0601-5

Low-complexity 8-point DCT approximation based on angle similarity for image and video coding

Published: 20 July 2018

Volume 30, pages 1363–1394, (2019)
Cite this article

Multidimensional Systems and Signal Processing Aims and scope Submit manuscript

Raíza S. Oliveira^1,2,
Renato J. Cintra^2,3,
Fábio M. Bayer⁴,
Thiago L. T. da Silveira⁵,
Arjuna Madanayake⁶ &
…
André Leite²

648 Accesses
23 Citations
3 Altmetric
Explore all metrics

Abstract

The principal component analysis (PCA) is widely used for data decorrelation and dimensionality reduction. However, the use of PCA may be impractical in real-time applications, or in situations were energy and computing constraints are severe. In this context, the discrete cosine transform (DCT) becomes a low-cost alternative to data decorrelation. This paper presents a method to derive computationally efficient approximations to the DCT. The proposed method aims at the minimization of the angle between the rows of the exact DCT matrix and the rows of the approximated transformation matrix. The resulting transformations matrices are orthogonal and have extremely low arithmetic complexity. Considering popular performance measures, one of the proposed transformation matrices outperforms the best competitors in both matrix error and coding capabilities. Practical applications in image and video coding demonstrate the relevance of the proposed transformation. In fact, we show that the proposed approximate DCT can outperform the exact DCT for image encoding under certain compression ratios. The proposed transform and its direct competitors are also physically realized as digital prototype circuits using FPGA technology.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Perceptual image quality assessment: a survey

Article 26 April 2020

Guangtao Zhai & Xiongkuo Min

Brief review of image denoising techniques

Article Open access 08 July 2019

Linwei Fan, Fan Zhang, … Caiming Zhang

A robust digital image watermarking technique in LWT-DCT domain using particle swarm optimization and statistical distortion correction

Article 13 April 2024

Saharul Alom Barlaskar, Anish Monsley Kirupakaran, … Taimoor Khan

References

Ahmed, N., Natarajan, T., & Rao, K. R. (1974). Discrete cosine transform. IEEE Transactions on Computers, C-23(1), 90–93.
Álvarez-Cortés, S., Amrani, N., Hernández-Cabronero, M., & Serra-Sagristà, J. (2017). Progressive lossy-to-lossless coding of hyperspectral images through regression wavelet analysis. International Journal of Remote Sensing, 39, 1–21.
Google Scholar
Arai, Y., Agui, T., & Nakajima, M. (1988). A fast DCT-SQ scheme for images. Transactions of the IEICE, E-71(11), 1095–1097.
Bae, J., & Yoo, H. (2017). Analysis of color transforms for lossless frame memory compression. International Journal of Applied Engineering Research, 12(24), 15664–15667.
Google Scholar
Bayer, F. M., & Cintra, R. J. (2010). Image compression via a fast DCT approximation. IEEE Latin America Transactions, 8(6), 708–713.
Article Google Scholar
Bayer, F. M., & Cintra, R. J. (2012). DCT-like transform for image compression requires 14 additions only. Electronics Letters, 48(15), 919–921.
Article Google Scholar
Bayer, F. M., Cintra, R. J., Edirisuriya, A., & Madanayake, A. (2012). A digital hardware fast algorithm and FPGA-based prototype for a novel 16-point approximate DCT for image compression applications. Measurement Science and Technology, 23(8), 114010.
Article Google Scholar
Bjøntegaard, G. (2001). Calculation of average PSNR differences between RD-curves. In 13th VCEG Meeting, Austin, TX, USA, Apr 2001, document VCEG-M33.
Blahut, R. E. (2010). Fast algorithms for signal processing. Cambridge: Cambridge University Press.
Book MATH Google Scholar
Bossen, F. (2013). Common test conditions and software reference configurations, San Jose, CA, USA, Feb 2013, document JCT-VC L1100.
Bouguezel, S., Ahmad, M. O., & Swamy, M. N. S. (2008). Low-complexity \(8\times 8\) transform for image compression. Electronics Letters, 44(21), 1249–1250.
Bouguezel, S., Ahmad, M. O., & Swamy, M. N. S. (2009). A fast \(8\times 8\) transform for image compression. In 2009 international conference on microelectronics (ICM) (pp. 74–77), Dec 2009.
Bouguezel, S., Ahmad, M. O., & Swamy, M. N. S. (2011). A low-complexity parametric transform for image compression. In Proceedings of the 2011 IEEE international symposium on circuits and systems, May 2011.
Bouguezel, S., Ahmad, M. O., & Swamy, M. N. S. (2013). Binary discrete cosine and Hartley transforms. IEEE Transactions on Circuits and Systems I: Regular Papers, 60(4), 989–1002.
Article MathSciNet Google Scholar
Britanak, V., Yip, P., & Rao, K. R. (2007). Discrete cosine and sine transforms. New York: Academic Press.
Google Scholar
Cham, W. K. (1989). Development of integer cosine transforms by the principle of dyadic symmetry. IEE Proceedings I Communications, Speech and Vision, 136(4), 276–282.
Article Google Scholar
Chan, R. K. W., & Lee, M.-C. (2006). Multiplierless fast DCT algorithms with minimal approximation errors. In International conference on pattern recognition (Vol. 3, pp. 921–925). Los Alamitos, CA, USA: IEEE Computer Society.
Chen, W. H., Smith, C., & Fralick, S. (1977). A fast computational algorithm for the discrete cosine transform. IEEE Transactions on Communications, 25(9), 1004–1009.
Article MATH Google Scholar
Chen, H., & Zeng, B. (2012). New transforms tightly bounded by DCT and KLT. IEEE Signal Processing Letters, 19(6), 344–347.
Article Google Scholar
Choi, K., Lee, S., & Jang, E. S. (2010). Zero coefficient-aware IDCT algorithm for fast video decoding. IEEE Transactions on Consumer Electronics, 56(3), 1822–1829.
Article Google Scholar
Cintra, R. J. (2011). An integer approximation method for discrete sinusoidal transforms. Journal of Circuits, Systems, and Signal Processing, 30(6), 1481–1501.
Article MathSciNet MATH Google Scholar
Cintra, R. J., & Bayer, F. M. (2011). A DCT approximation for image compression. IEEE Signal Processing Letters, 18(10), 579–582.
Article Google Scholar
Cintra, R. J., Bayer, F. M., & Tablada, C. J. (2014). Low-complexity 8-point DCT approximations based on integer functions. Signal Processing, 99, 201–214.
Article Google Scholar
Clarke, R. J. (1981). Relation between the Karhunen-Loève and cosine transforms. IEEE Proceedings F Communications, Radar and Signal Processing, 128(6), 359–360.
Article Google Scholar
Cormen, T., Leiserson, C., Rivest, R., & Stein, C. (2001). Introduction to algorithms, chapter 16. Cambridge: MIT Press.
MATH Google Scholar
Coutinho, V. A., Cintra, R. J., Bayer, F. M., Kulasekera, S., & Madanayake, A. (2015). A multiplierless pruned DCT-like transformation for image and video compression that requires ten additions only. Journal of Real-Time Image Processing, 12, 1–9.
Google Scholar
Dimitrov, V., Jullien, G., & Miller, W. (1998). A new DCT algorithm based on encoding algebraic integers. In Proceedings of the 1998 IEEE international conference on acoustics, speech and signal processing, 1998 (Vol. 3, pp. 1377–1380 ), May 1998.
Dunteman, G. H. (1989). Principal components analysis (Vol. 69). Beverly Hills: Sage.
Book Google Scholar
Feig, E., & Winograd, S. (1992). Fast algorithms for the discrete cosine transform. IEEE Transactions on Signal Processing, 40(9), 2174–2193.
Article MATH Google Scholar
Fong, C.-K., & Cham, W.-K. (2012). LLM integer cosine transform and its fast algorithm. IEEE Transactions on Circuits and Systems for Video Technology, 22(6), 844–854.
Article Google Scholar
Gonzalez, R. C., & Woods, R. E. (2006). Digital image processing (3rd ed.). Upper Saddle River, NJ: Prentice-Hall Inc.
Google Scholar
Gorban, A. N., Kgl, B., Wunsch, D. C., & Zinovyev, A. (2007). Principal manifolds for data visualization and dimension reduction (1st ed.). Springer Publishing Company, Incorporated.
Goyal, V. K. (2001). Theoretical foundations of transform coding. IEEE Signal Processing Magazine, 18(5), 9–21.
Article Google Scholar
Han, J., Xu, Y., & Mukherjee, D. (2013). A butterfly structured design of the hybrid transform coding scheme. In Picture coding symposium (PCS), 2013 (pp. 17–20). IEEE.
Hanhart, P., & Ebrahimi, T. (2014). Calculation of average coding efficiency based on subjective quality scores. Journal of Visual Communication and Image Representation, 25(3), 555–564. qoE in 2D/3D Video Systems.
Article Google Scholar
Haweel, T. I. (2001). A new square wave transform based on the DCT. Signal Processing, 82, 2309–2319.
Article MATH Google Scholar
Heideman, M. T., & Burrus, C. S. (1988). Multiplicative complexity, convolution, and the DFT, ser. Signal processing and digital filtering. Berlin: Springer.
Higham, N. J. (1986). Computing the polar decomposition—With applications. SIAM Journal on Scientific and Statistical Computing, 7(4), 1160–1174.
Article MathSciNet MATH Google Scholar
Higham, N. J. (1987). Computing real square roots of a real matrix. Linear Algebra and Its Applications, 88–89, 405–430.
Article MathSciNet MATH Google Scholar
Higham, N. J. (2008). Functions of matrices: Theory and computation, ser. SIAM e-books. Society for Industrial and Applied Mathematics (SIAM, 3600 Market Street, Floor 6, Philadelphia, PA 19104).
Higham, N. J., & Schreiber, R. S. (1988). Fast polar decomposition of an arbitrary matrix, Ithaca, NY, USA, Tech. Rep., October 1988.
Hou, H. S. (1987). A fast recursive algorithm for computing the discrete cosine transform. IEEE Transactions on Acoustic, Signal, and Speech Processing, 6(10), 1455–1461.
Google Scholar
International Telecommunication Union. (1990). ITU-T recommendation H.261 version 1: Video codec for audiovisual services at \(p \times 64\) kbits, ITU-T, Tech. Rep.
International Telecommunication Union. (1995). ITU-T recommendation H.263 version 1: Video coding for low bit rate communication, ITU-T, Tech. Rep.
Jammalamadaka, S., & Sengupta, A. (2001). Topics in circular statistics, ser. Series on multivariate analysis. Singapore: World Scientific.
Joint Collaborative Team on Video Coding (JCT-VC), “HEVC reference software documentation”. (2013). Fraunhofer Heinrich Hertz Institute. [Online]. Available: https://hevc.hhi.fraunhofer.de/. Accessed 19 Sept 2016.
Jolliffe, I. (2002). Principal component analysis. New York: Wiley Online Library.
MATH Google Scholar
Jridi, M., Alfalou, A., & Meher, P. K. (2015). A generalized algorithm and reconfigurable architecture for efficient and scalable orthogonal approximation of DCT. IEEE Transactions on Circuits and Systems I: Regular Papers, 62(2), 449–457.
Article Google Scholar
Katto, J., & Yasuda, Y. (1991). Performance evaluation of subband coding and optimization of its filter coefficients. Journal of Visual Communication and Image Representation, 2(4), 303–313.
Article Google Scholar
Le Gall, D. J. (1992). The MPEG video compression algorithm. Signal Processing: Image Communication, 4(2), 129–140.
Google Scholar
Lee, B. G. (1984). A new algorithm for computing the discrete cosine transform. IEEE Transactions on Acoustics, Speech and Signal Processing, ASSP-32, 1243–1245.
Lengwehasatit, K., & Ortega, A. (2004). Scalable variable complexity approximate forward DCT. IEEE Transactions on Circuits and Systems for Video Technology, 14(11), 1236–1248.
Article Google Scholar
Liang, J., & Tran, T. D. (2001). Fast multiplierless approximation of the DCT with the lifting scheme. IEEE Transactions on Signal Processing, 49, 3032–3044.
Article Google Scholar
Loeffler, C., Ligtenberg, A., & Moschytz, G. (1989). Practical fast 1D DCT algorithms with 11 multiplications. In Proceedings of the international conference on acoustics, speech, and signal processing (pp. 988–991), May 1989.
Luthra, A., Sullivan, G. J., & Wiegand, T. (2003). Introduction to the special issue on the H.264/AVC video coding standard. IEEE Transactions on Circuits and Systems for Video Technology, 13(7), 557–559.
Article Google Scholar
Mahan, R. P. (1991) Circular statistical methods: Applications in spatial and temporal performance analysis, ser. Special report. U.S. Army Research Institute for the Behavioral and Social Sciences.
Mardia, K., & Jupp, P. (2009). Directional statistics, ser. Wiley series in probability and statistics. New York: Wiley.
Masera, M., Martina, M., & Masera, G. (2017a). Odd type DCT/DST for video coding: Relationships and low-complexity implementations. In 2017 IEEE International Workshop on Signal Processing Systems (SiPS), pp. 1–6. IEEE.
Masera, M., Martina, M., & Masera, G. (2017b). Adaptive approximated dct architectures for HEVC. IEEE Transactions on Circuits and Systems for Video Technology, 27(12), 2714–2725.
Google Scholar
Meher, P. K., Park, S. Y., Mohanty, B. K., Lim, K. S., & Yeo, C. (2014). Efficient integer DCT architectures for HEVC. IEEE Transactions on Circuits and Systems for Video Technology, 24(1), 168–178.
Article Google Scholar
Ohm, J.-R., Sullivan, G. J., Schwarz, H., Tan, T. K., & Wiegand, T. (2012). Comparison of the coding efficiency of video coding standards—Including high efficiency video coding (HEVC). IEEE Transactions on Circuits and Systems for Video Technology, 22(12), 1669–1684.
Article Google Scholar
Pao, I.-M., & Sun, M.-T. (1998). Approximation of calculations for forward discrete cosine transform. IEEE Transactions on Circuits and Systems for Video Technology, 8(3), 264–268.
Article Google Scholar
Park, J.-S., Nam, W.-J., Han, S.-M., & Lee, S.-S. (2012). 2-D large inverse transform (\(16\times 16\), \(32\times 32\)) for HEVC (high efficiency video coding). JSTS: Journal of Semiconductor Technology and Science, 12(2), 203–211.
Article Google Scholar
Potluri, U. S., Madanayake, A., Cintra, R. J., Bayer, F. M., Kulasekera, S., & Edirisuriya, A. (2014). Improved 8-point approximate DCT for image and video compression requiring only 14 additions. IEEE Transactions on Circuits and Systems I: Regular Papers, 61(6), 1727–1740.
Article Google Scholar
Pourazad, M. T., Doutre, C., Azimi, M., & Nasiopoulos, P. (2012). HEVC: The new gold standard for video compression: How does HEVC compare with H.264/AVC? IEEE Consumer Electronics Magazine, 1(3), 36–46.
Article Google Scholar
Puri, A. (2004). Video coding using the H.264/MPEG-4 AVC compression standard. Signal Processing: Image Communication, 19, 793–849.
Google Scholar
Rao, K. R., & Yip, P. (1990). Discrete cosine transform: Algorithms, advantages, applications. San Diego, CA: Academic Press.
Book MATH Google Scholar
Salomon, D., Motta, G., & Bryant, D. (2007). Data compression: The complete reference, ser. Molecular biology intelligence unit. Berlin: Springer.
Seber, G. A. F. (2008). A matrix handbook for statisticians, ser. Wiley series in probability and mathematical statistics. New York: Wiley.
Senapati, R. K., Pati, U. C., & Mahapatra, K. K. (2010). A low complexity orthogonal \(8\times 8\) transform matrix for fast image compression. In Proceeding of the annual IEEE India conference (INDICON), Kolkata, India (pp. 1–4).
Snigdha, F. S., Sengupta, D., Hu, J., & Sapatnekar, S. S. (2016). Optimal design of JPEG hardware under the approximate computing paradigm. In Proceedings of the 53rd annual design automation conference (p. 106). ACM.
Strang, G. (1988). Linear algebra and its applications. Belmont: Brooks Cole.
MATH Google Scholar
Sullivan, G. J., Ohm, J.-R., Han, W.-J., & Wiegand, T. (2012). Overview of the high efficiency video coding (HEVC) standard. IEEE Transactions on Circuits Systems for Video Technology, 22(12), 1649–1668.
Article Google Scholar
Suzuki, T., & Ikehara, M. (2010). Integer DCT based on direct-lifting of DCT-IDCT for lossless-to-lossy image coding. IEEE Transactions on Image Processing, 19(11), 2958–2965.
Article MathSciNet MATH Google Scholar
Tablada, C. J., Bayer, F. M., & Cintra, R. J. (2015). A class of DCT approximations based on the Feig–Winograd algorithm. Signal Processing, 113, 38–51.
Article Google Scholar
Thomakos, D. (2016). Smoothing non-stationary time series using the discrete cosine transform. Journal of Systems Science and Complexity, 29(2), 382–404.
Article MathSciNet MATH Google Scholar
USC-SIPI Image Database. (2017). University of Southern California. [Online]. Available: http://sipi.usc.edu/database/. Accessed 15 Sept 2016.
Wallace, G. K. (1992). The JPEG still picture compression standard. IEEE Transactions on Consumer Electronics, 38(1), xviii–xxxiv.
Wang, Z. (2011). Combined DCT and companding for PAPR reduction in OFDM signals. Journal of Signal and Information Processing, 2(2), 100–104.
Article Google Scholar
Wang, Z., & Bovik, A. C. (2009). Mean squared error: Love it or leave it? A new look at signal fidelity measures. IEEE Signal Processing Magazine, 26(1), 98–117.
Article Google Scholar
Wang, Z., Bovik, A. C., Sheikh, H. R., & Simoncelli, E. P. (2004). Image quality assessment: From error visibility to structural similarity. IEEE Transactions on Image Processing, 13(4), 600–612.
Article Google Scholar
Watkins, D. S. (2004). Fundamentals of matrix computations, ser. Pure and applied mathematics: A Wiley series of texts, monographs and tracts. New York: Wiley.
Xu, X., Li, J., Huang, X., Dalla Mura, M., & Plaza, A. (2016). Multiple morphological component analysis based decomposition for remote sensing image classification. IEEE Transactions on Geoscience and Remote Sensing, 54(5), 3083–3102.
Article Google Scholar
Yip, P., & Rao, K. (1988). The decimation-in-frequency algorithms for a family of discrete sine and cosine transforms. Circuits, Systems and Signal Processing, 7(1), 3–19.
Article MathSciNet MATH Google Scholar
Yuan, W., Hao, P., & Xu, C. (2006). Matrix factorization for fast DCT algorithms. In IEEE international conference on acoustic, speech, signal processing (ICASSP) (Vol. 3, pp. 948–951), May 2006.
Zeng, J., Cheung, G., Chao, Y.-H., Blanes, I., Serra-Sagristá, J., & Ortega, A. (2017). Hyperspectral image coding using graph wavelets. In Proceedings of the IEEE international conference on image processing (ICIP).

Download references

Acknowledgements

The authors acknowldege the partial support from Brazilian funding agencies CNPq and FACEPE.

Author information

Authors and Affiliations

Programa de Pós-Graduação em Engenharia Elétrica, Universidade Federal de Pernambuco (UFPE), Recife, Brazil
Raíza S. Oliveira
Signal Processing Group, Departamento de Estatística, Universidade Federal de Pernambuco (UFPE), Recife, Brazil
Raíza S. Oliveira, Renato J. Cintra & André Leite
ECE, University of Calgary, Calgary, AB, Canada
Renato J. Cintra
Departamento de Estatística, Universidade Federal de Santa Maria, Santa Maria, Brazil
Fábio M. Bayer
Programa de Pós-Graduação em Computação, Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil
Thiago L. T. da Silveira
Department of Electrical and Computer Engineering, University of Akron, Akron, OH, USA
Arjuna Madanayake

Authors

Raíza S. Oliveira
View author publications
You can also search for this author in PubMed Google Scholar
Renato J. Cintra
View author publications
You can also search for this author in PubMed Google Scholar
Fábio M. Bayer
View author publications
You can also search for this author in PubMed Google Scholar
Thiago L. T. da Silveira
View author publications
You can also search for this author in PubMed Google Scholar
Arjuna Madanayake
View author publications
You can also search for this author in PubMed Google Scholar
André Leite
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Renato J. Cintra.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Oliveira, R.S., Cintra, R.J., Bayer, F.M. et al. Low-complexity 8-point DCT approximation based on angle similarity for image and video coding. Multidim Syst Sign Process 30, 1363–1394 (2019). https://doi.org/10.1007/s11045-018-0601-5

Download citation

Received: 23 March 2017
Revised: 15 June 2018
Accepted: 19 June 2018
Published: 20 July 2018
Issue Date: 01 July 2019
DOI: https://doi.org/10.1007/s11045-018-0601-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Low-complexity 8-point DCT approximation based on angle similarity for image and video coding

Abstract

Access this article

Similar content being viewed by others

Perceptual image quality assessment: a survey

Brief review of image denoising techniques

A robust digital image watermarking technique in LWT-DCT domain using particle swarm optimization and statistical distortion correction

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Low-complexity 8-point DCT approximation based on angle similarity for image and video coding

Abstract

Access this article

Similar content being viewed by others

Perceptual image quality assessment: a survey

Brief review of image denoising techniques

A robust digital image watermarking technique in LWT-DCT domain using particle swarm optimization and statistical distortion correction

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation