Skip to main content
Log in

Low-complexity 8-point DCT approximation based on angle similarity for image and video coding

  • Published:
Multidimensional Systems and Signal Processing Aims and scope Submit manuscript

Abstract

The principal component analysis (PCA) is widely used for data decorrelation and dimensionality reduction. However, the use of PCA may be impractical in real-time applications, or in situations were energy and computing constraints are severe. In this context, the discrete cosine transform (DCT) becomes a low-cost alternative to data decorrelation. This paper presents a method to derive computationally efficient approximations to the DCT. The proposed method aims at the minimization of the angle between the rows of the exact DCT matrix and the rows of the approximated transformation matrix. The resulting transformations matrices are orthogonal and have extremely low arithmetic complexity. Considering popular performance measures, one of the proposed transformation matrices outperforms the best competitors in both matrix error and coding capabilities. Practical applications in image and video coding demonstrate the relevance of the proposed transformation. In fact, we show that the proposed approximate DCT can outperform the exact DCT for image encoding under certain compression ratios. The proposed transform and its direct competitors are also physically realized as digital prototype circuits using FPGA technology.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  • Ahmed, N., Natarajan, T., & Rao, K. R. (1974). Discrete cosine transform. IEEE Transactions on Computers, C-23(1), 90–93.

  • Álvarez-Cortés, S., Amrani, N., Hernández-Cabronero, M., & Serra-Sagristà, J. (2017). Progressive lossy-to-lossless coding of hyperspectral images through regression wavelet analysis. International Journal of Remote Sensing, 39, 1–21.

    Google Scholar 

  • Arai, Y., Agui, T., & Nakajima, M. (1988). A fast DCT-SQ scheme for images. Transactions of the IEICE, E-71(11), 1095–1097.

  • Bae, J., & Yoo, H. (2017). Analysis of color transforms for lossless frame memory compression. International Journal of Applied Engineering Research, 12(24), 15664–15667.

    Google Scholar 

  • Bayer, F. M., & Cintra, R. J. (2010). Image compression via a fast DCT approximation. IEEE Latin America Transactions, 8(6), 708–713.

    Article  Google Scholar 

  • Bayer, F. M., & Cintra, R. J. (2012). DCT-like transform for image compression requires 14 additions only. Electronics Letters, 48(15), 919–921.

    Article  Google Scholar 

  • Bayer, F. M., Cintra, R. J., Edirisuriya, A., & Madanayake, A. (2012). A digital hardware fast algorithm and FPGA-based prototype for a novel 16-point approximate DCT for image compression applications. Measurement Science and Technology, 23(8), 114010.

    Article  Google Scholar 

  • Bjøntegaard, G. (2001). Calculation of average PSNR differences between RD-curves. In 13th VCEG Meeting, Austin, TX, USA, Apr 2001, document VCEG-M33.

  • Blahut, R. E. (2010). Fast algorithms for signal processing. Cambridge: Cambridge University Press.

    Book  MATH  Google Scholar 

  • Bossen, F. (2013). Common test conditions and software reference configurations, San Jose, CA, USA, Feb 2013, document JCT-VC L1100.

  • Bouguezel, S., Ahmad, M. O., & Swamy, M. N. S. (2008). Low-complexity \(8\times 8\) transform for image compression. Electronics Letters, 44(21), 1249–1250.

  • Bouguezel, S., Ahmad, M. O., & Swamy, M. N. S. (2009). A fast \(8\times 8\) transform for image compression. In 2009 international conference on microelectronics (ICM) (pp. 74–77), Dec 2009.

  • Bouguezel, S., Ahmad, M. O., & Swamy, M. N. S. (2011). A low-complexity parametric transform for image compression. In Proceedings of the 2011 IEEE international symposium on circuits and systems, May 2011.

  • Bouguezel, S., Ahmad, M. O., & Swamy, M. N. S. (2013). Binary discrete cosine and Hartley transforms. IEEE Transactions on Circuits and Systems I: Regular Papers, 60(4), 989–1002.

    Article  MathSciNet  Google Scholar 

  • Britanak, V., Yip, P., & Rao, K. R. (2007). Discrete cosine and sine transforms. New York: Academic Press.

    Google Scholar 

  • Cham, W. K. (1989). Development of integer cosine transforms by the principle of dyadic symmetry. IEE Proceedings I Communications, Speech and Vision, 136(4), 276–282.

    Article  Google Scholar 

  • Chan, R. K. W., & Lee, M.-C. (2006). Multiplierless fast DCT algorithms with minimal approximation errors. In International conference on pattern recognition (Vol. 3, pp. 921–925). Los Alamitos, CA, USA: IEEE Computer Society.

  • Chen, W. H., Smith, C., & Fralick, S. (1977). A fast computational algorithm for the discrete cosine transform. IEEE Transactions on Communications, 25(9), 1004–1009.

    Article  MATH  Google Scholar 

  • Chen, H., & Zeng, B. (2012). New transforms tightly bounded by DCT and KLT. IEEE Signal Processing Letters, 19(6), 344–347.

    Article  Google Scholar 

  • Choi, K., Lee, S., & Jang, E. S. (2010). Zero coefficient-aware IDCT algorithm for fast video decoding. IEEE Transactions on Consumer Electronics, 56(3), 1822–1829.

    Article  Google Scholar 

  • Cintra, R. J. (2011). An integer approximation method for discrete sinusoidal transforms. Journal of Circuits, Systems, and Signal Processing, 30(6), 1481–1501.

    Article  MathSciNet  MATH  Google Scholar 

  • Cintra, R. J., & Bayer, F. M. (2011). A DCT approximation for image compression. IEEE Signal Processing Letters, 18(10), 579–582.

    Article  Google Scholar 

  • Cintra, R. J., Bayer, F. M., & Tablada, C. J. (2014). Low-complexity 8-point DCT approximations based on integer functions. Signal Processing, 99, 201–214.

    Article  Google Scholar 

  • Clarke, R. J. (1981). Relation between the Karhunen-Loève and cosine transforms. IEEE Proceedings F Communications, Radar and Signal Processing, 128(6), 359–360.

    Article  Google Scholar 

  • Cormen, T., Leiserson, C., Rivest, R., & Stein, C. (2001). Introduction to algorithms, chapter 16. Cambridge: MIT Press.

    MATH  Google Scholar 

  • Coutinho, V. A., Cintra, R. J., Bayer, F. M., Kulasekera, S., & Madanayake, A. (2015). A multiplierless pruned DCT-like transformation for image and video compression that requires ten additions only. Journal of Real-Time Image Processing, 12, 1–9.

    Google Scholar 

  • Dimitrov, V., Jullien, G., & Miller, W. (1998). A new DCT algorithm based on encoding algebraic integers. In Proceedings of the 1998 IEEE international conference on acoustics, speech and signal processing, 1998 (Vol. 3, pp. 1377–1380 ), May 1998.

  • Dunteman, G. H. (1989). Principal components analysis (Vol. 69). Beverly Hills: Sage.

    Book  Google Scholar 

  • Feig, E., & Winograd, S. (1992). Fast algorithms for the discrete cosine transform. IEEE Transactions on Signal Processing, 40(9), 2174–2193.

    Article  MATH  Google Scholar 

  • Fong, C.-K., & Cham, W.-K. (2012). LLM integer cosine transform and its fast algorithm. IEEE Transactions on Circuits and Systems for Video Technology, 22(6), 844–854.

    Article  Google Scholar 

  • Gonzalez, R. C., & Woods, R. E. (2006). Digital image processing (3rd ed.). Upper Saddle River, NJ: Prentice-Hall Inc.

    Google Scholar 

  • Gorban, A. N., Kgl, B., Wunsch, D. C., & Zinovyev, A. (2007). Principal manifolds for data visualization and dimension reduction (1st ed.). Springer Publishing Company, Incorporated.

  • Goyal, V. K. (2001). Theoretical foundations of transform coding. IEEE Signal Processing Magazine, 18(5), 9–21.

    Article  Google Scholar 

  • Han, J., Xu, Y., & Mukherjee, D. (2013). A butterfly structured design of the hybrid transform coding scheme. In Picture coding symposium (PCS), 2013 (pp. 17–20). IEEE.

  • Hanhart, P., & Ebrahimi, T. (2014). Calculation of average coding efficiency based on subjective quality scores. Journal of Visual Communication and Image Representation, 25(3), 555–564. qoE in 2D/3D Video Systems.

    Article  Google Scholar 

  • Haweel, T. I. (2001). A new square wave transform based on the DCT. Signal Processing, 82, 2309–2319.

    Article  MATH  Google Scholar 

  • Heideman, M. T., & Burrus, C. S. (1988). Multiplicative complexity, convolution, and the DFT, ser. Signal processing and digital filtering. Berlin: Springer.

  • Higham, N. J. (1986). Computing the polar decomposition—With applications. SIAM Journal on Scientific and Statistical Computing, 7(4), 1160–1174.

    Article  MathSciNet  MATH  Google Scholar 

  • Higham, N. J. (1987). Computing real square roots of a real matrix. Linear Algebra and Its Applications, 88–89, 405–430.

    Article  MathSciNet  MATH  Google Scholar 

  • Higham, N. J. (2008). Functions of matrices: Theory and computation, ser. SIAM e-books. Society for Industrial and Applied Mathematics (SIAM, 3600 Market Street, Floor 6, Philadelphia, PA 19104).

  • Higham, N. J., & Schreiber, R. S. (1988). Fast polar decomposition of an arbitrary matrix, Ithaca, NY, USA, Tech. Rep., October 1988.

  • Hou, H. S. (1987). A fast recursive algorithm for computing the discrete cosine transform. IEEE Transactions on Acoustic, Signal, and Speech Processing, 6(10), 1455–1461.

    Google Scholar 

  • International Telecommunication Union. (1990). ITU-T recommendation H.261 version 1: Video codec for audiovisual services at \(p \times 64\) kbits, ITU-T, Tech. Rep.

  • International Telecommunication Union. (1995). ITU-T recommendation H.263 version 1: Video coding for low bit rate communication, ITU-T, Tech. Rep.

  • Jammalamadaka, S., & Sengupta, A. (2001). Topics in circular statistics, ser. Series on multivariate analysis. Singapore: World Scientific.

  • Joint Collaborative Team on Video Coding (JCT-VC), “HEVC reference software documentation”. (2013). Fraunhofer Heinrich Hertz Institute. [Online]. Available: https://hevc.hhi.fraunhofer.de/. Accessed 19 Sept 2016.

  • Jolliffe, I. (2002). Principal component analysis. New York: Wiley Online Library.

    MATH  Google Scholar 

  • Jridi, M., Alfalou, A., & Meher, P. K. (2015). A generalized algorithm and reconfigurable architecture for efficient and scalable orthogonal approximation of DCT. IEEE Transactions on Circuits and Systems I: Regular Papers, 62(2), 449–457.

    Article  Google Scholar 

  • Katto, J., & Yasuda, Y. (1991). Performance evaluation of subband coding and optimization of its filter coefficients. Journal of Visual Communication and Image Representation, 2(4), 303–313.

    Article  Google Scholar 

  • Le Gall, D. J. (1992). The MPEG video compression algorithm. Signal Processing: Image Communication, 4(2), 129–140.

    Google Scholar 

  • Lee, B. G. (1984). A new algorithm for computing the discrete cosine transform. IEEE Transactions on Acoustics, Speech and Signal Processing, ASSP-32, 1243–1245.

  • Lengwehasatit, K., & Ortega, A. (2004). Scalable variable complexity approximate forward DCT. IEEE Transactions on Circuits and Systems for Video Technology, 14(11), 1236–1248.

    Article  Google Scholar 

  • Liang, J., & Tran, T. D. (2001). Fast multiplierless approximation of the DCT with the lifting scheme. IEEE Transactions on Signal Processing, 49, 3032–3044.

    Article  Google Scholar 

  • Loeffler, C., Ligtenberg, A., & Moschytz, G. (1989). Practical fast 1D DCT algorithms with 11 multiplications. In Proceedings of the international conference on acoustics, speech, and signal processing (pp. 988–991), May 1989.

  • Luthra, A., Sullivan, G. J., & Wiegand, T. (2003). Introduction to the special issue on the H.264/AVC video coding standard. IEEE Transactions on Circuits and Systems for Video Technology, 13(7), 557–559.

    Article  Google Scholar 

  • Mahan, R. P. (1991) Circular statistical methods: Applications in spatial and temporal performance analysis, ser. Special report. U.S. Army Research Institute for the Behavioral and Social Sciences.

  • Mardia, K., & Jupp, P. (2009). Directional statistics, ser. Wiley series in probability and statistics. New York: Wiley.

  • Masera, M., Martina, M., & Masera, G. (2017a). Odd type DCT/DST for video coding: Relationships and low-complexity implementations. In 2017 IEEE International Workshop on Signal Processing Systems (SiPS), pp. 1–6. IEEE.

  • Masera, M., Martina, M., & Masera, G. (2017b). Adaptive approximated dct architectures for HEVC. IEEE Transactions on Circuits and Systems for Video Technology, 27(12), 2714–2725.

    Google Scholar 

  • Meher, P. K., Park, S. Y., Mohanty, B. K., Lim, K. S., & Yeo, C. (2014). Efficient integer DCT architectures for HEVC. IEEE Transactions on Circuits and Systems for Video Technology, 24(1), 168–178.

    Article  Google Scholar 

  • Ohm, J.-R., Sullivan, G. J., Schwarz, H., Tan, T. K., & Wiegand, T. (2012). Comparison of the coding efficiency of video coding standards—Including high efficiency video coding (HEVC). IEEE Transactions on Circuits and Systems for Video Technology, 22(12), 1669–1684.

    Article  Google Scholar 

  • Pao, I.-M., & Sun, M.-T. (1998). Approximation of calculations for forward discrete cosine transform. IEEE Transactions on Circuits and Systems for Video Technology, 8(3), 264–268.

    Article  Google Scholar 

  • Park, J.-S., Nam, W.-J., Han, S.-M., & Lee, S.-S. (2012). 2-D large inverse transform (\(16\times 16\), \(32\times 32\)) for HEVC (high efficiency video coding). JSTS: Journal of Semiconductor Technology and Science, 12(2), 203–211.

    Article  Google Scholar 

  • Potluri, U. S., Madanayake, A., Cintra, R. J., Bayer, F. M., Kulasekera, S., & Edirisuriya, A. (2014). Improved 8-point approximate DCT for image and video compression requiring only 14 additions. IEEE Transactions on Circuits and Systems I: Regular Papers, 61(6), 1727–1740.

    Article  Google Scholar 

  • Pourazad, M. T., Doutre, C., Azimi, M., & Nasiopoulos, P. (2012). HEVC: The new gold standard for video compression: How does HEVC compare with H.264/AVC? IEEE Consumer Electronics Magazine, 1(3), 36–46.

    Article  Google Scholar 

  • Puri, A. (2004). Video coding using the H.264/MPEG-4 AVC compression standard. Signal Processing: Image Communication, 19, 793–849.

    Google Scholar 

  • Rao, K. R., & Yip, P. (1990). Discrete cosine transform: Algorithms, advantages, applications. San Diego, CA: Academic Press.

    Book  MATH  Google Scholar 

  • Salomon, D., Motta, G., & Bryant, D. (2007). Data compression: The complete reference, ser. Molecular biology intelligence unit. Berlin: Springer.

  • Seber, G. A. F. (2008). A matrix handbook for statisticians, ser. Wiley series in probability and mathematical statistics. New York: Wiley.

  • Senapati, R. K., Pati, U. C., & Mahapatra, K. K. (2010). A low complexity orthogonal \(8\times 8\) transform matrix for fast image compression. In Proceeding of the annual IEEE India conference (INDICON), Kolkata, India (pp. 1–4).

  • Snigdha, F. S., Sengupta, D., Hu, J., & Sapatnekar, S. S. (2016). Optimal design of JPEG hardware under the approximate computing paradigm. In Proceedings of the 53rd annual design automation conference (p. 106). ACM.

  • Strang, G. (1988). Linear algebra and its applications. Belmont: Brooks Cole.

    MATH  Google Scholar 

  • Sullivan, G. J., Ohm, J.-R., Han, W.-J., & Wiegand, T. (2012). Overview of the high efficiency video coding (HEVC) standard. IEEE Transactions on Circuits Systems for Video Technology, 22(12), 1649–1668.

    Article  Google Scholar 

  • Suzuki, T., & Ikehara, M. (2010). Integer DCT based on direct-lifting of DCT-IDCT for lossless-to-lossy image coding. IEEE Transactions on Image Processing, 19(11), 2958–2965.

    Article  MathSciNet  MATH  Google Scholar 

  • Tablada, C. J., Bayer, F. M., & Cintra, R. J. (2015). A class of DCT approximations based on the Feig–Winograd algorithm. Signal Processing, 113, 38–51.

    Article  Google Scholar 

  • Thomakos, D. (2016). Smoothing non-stationary time series using the discrete cosine transform. Journal of Systems Science and Complexity, 29(2), 382–404.

    Article  MathSciNet  MATH  Google Scholar 

  • USC-SIPI Image Database. (2017). University of Southern California. [Online]. Available: http://sipi.usc.edu/database/. Accessed 15 Sept 2016.

  • Wallace, G. K. (1992). The JPEG still picture compression standard. IEEE Transactions on Consumer Electronics, 38(1), xviii–xxxiv.

  • Wang, Z. (2011). Combined DCT and companding for PAPR reduction in OFDM signals. Journal of Signal and Information Processing, 2(2), 100–104.

    Article  Google Scholar 

  • Wang, Z., & Bovik, A. C. (2009). Mean squared error: Love it or leave it? A new look at signal fidelity measures. IEEE Signal Processing Magazine, 26(1), 98–117.

    Article  Google Scholar 

  • Wang, Z., Bovik, A. C., Sheikh, H. R., & Simoncelli, E. P. (2004). Image quality assessment: From error visibility to structural similarity. IEEE Transactions on Image Processing, 13(4), 600–612.

    Article  Google Scholar 

  • Watkins, D. S. (2004). Fundamentals of matrix computations, ser. Pure and applied mathematics: A Wiley series of texts, monographs and tracts. New York: Wiley.

  • Xu, X., Li, J., Huang, X., Dalla Mura, M., & Plaza, A. (2016). Multiple morphological component analysis based decomposition for remote sensing image classification. IEEE Transactions on Geoscience and Remote Sensing, 54(5), 3083–3102.

    Article  Google Scholar 

  • Yip, P., & Rao, K. (1988). The decimation-in-frequency algorithms for a family of discrete sine and cosine transforms. Circuits, Systems and Signal Processing, 7(1), 3–19.

    Article  MathSciNet  MATH  Google Scholar 

  • Yuan, W., Hao, P., & Xu, C. (2006). Matrix factorization for fast DCT algorithms. In IEEE international conference on acoustic, speech, signal processing (ICASSP) (Vol. 3, pp. 948–951), May 2006.

  • Zeng, J., Cheung, G., Chao, Y.-H., Blanes, I., Serra-Sagristá, J., & Ortega, A. (2017). Hyperspectral image coding using graph wavelets. In Proceedings of the IEEE international conference on image processing (ICIP).

Download references

Acknowledgements

The authors acknowldege the partial support from Brazilian funding agencies CNPq and FACEPE.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Renato J. Cintra.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Oliveira, R.S., Cintra, R.J., Bayer, F.M. et al. Low-complexity 8-point DCT approximation based on angle similarity for image and video coding. Multidim Syst Sign Process 30, 1363–1394 (2019). https://doi.org/10.1007/s11045-018-0601-5

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11045-018-0601-5

Keywords

Navigation