Skip to main content

A New Spatio-temporal JND Model based on 3D Pyramid Decomposition

  • Conference paper
Advances in Multimedia Information Processing - PCM 2010 (PCM 2010)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6298))

Included in the following conference series:

Abstract

In this paper, a new Pyramidal Just-Noticeable-Distortion (PJND) model is proposed for video. This model incorporates the most relevant HVS properties such as: the spatio-temporal contrast sensitivity function, the influence of eye movements, the contrast masking effect and the saliency masking effect. The video sequence is first analyzed into multi-scales representation using the Laplacian pyramid decomposition (extended for 3D case). For each level, a sub JND threshold is then derived providing a global pyramidal JND map. To this end, we consider, for the first time, the influence of visual attention when designing JND model. For free viewing condition, human visual cortex is driven by a bottom-up mechanism so that it is attended only by selective salient regions. By this way, salient regions tend to mask non-salient regions. JND threshold is hence modulated by two masking mechanisms: contrast masking and “saliency masking”. Recent JND models do not take into account this phenomenon and therefore do not completely exploit human visual system (HVS) limitation. Intensive experiments are carried out to demonstrate the proposed model’s performance. Evaluation is performed in terms of distortion tolerance as well as perceptual transparency by means of PSNR and subjective tests. Compared to other state-of-art JNDs, our proposed PJND better exploits HVS properties by the fact that it can tolerate much more distortion while maintaining a good level of perceptual quality.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hubel, D.H., Wiesel, T.N.: Receptive fields of single neurons in the cat’s striate cortex. Journal of Physiology 148, 574–591 (1959)

    Google Scholar 

  2. Watson, A.B.: DCT quantization matrices visually optimized for individual images. In: Proc. SPIE Int. Conf. Human Version, Visual Processing, and Digital Display - IV, pp. 202–216 (1993)

    Google Scholar 

  3. Ahumada, A.J., Peterson, H.A.: Luminance-Model-Based DCT Quantization for Color Image Compression. In: Proc. of SPIE, vol. 1666, pp. 365–374 (1992)

    Google Scholar 

  4. Chou, C.H., Li, Y.C.: A perceptually Tuned Subband Image Coder Based on the Measure of Just-Noticeable-Distortion Profile. IEEE Transaction on Circuits and Systems for Video Technology 5(6), 467–476 (1995)

    Article  Google Scholar 

  5. Zhang, X., Lin, W., Xue, P.: Improved Estimation for Just-noticeable Visual Distortion. Signal Processing 85(4), 795–808 (2005)

    Article  MATH  Google Scholar 

  6. Jia, Y., Lin, W., Kassim, A.A.: Estimating Just-Noticeable Distortion for Video. IEEE Trans. on Circuits and Systems for Video Tech. 16(7), 820–829 (2006)

    Article  Google Scholar 

  7. Wei, Z., Ngan, K.N.: Spatio-Temporal Just Noticeable Distortion Profile for Grey Scale Image/Video in DCT Domain. IEEE Trans. on Circuits and Systems for Video Tech. 19(3), 337–346 (2009)

    Article  Google Scholar 

  8. Burt, P.J., Adelson, E.H.: The Laplacian Pyramid as a Compact Image Code. IEEE Transactions on Communications, 532–540 (April 1983)

    Google Scholar 

  9. Robson, J.G.: Spatial and Temporal Contrast Sensitivity Functions of the Visual System. Journal of Optical Society of America 56, 1411–1412 (1966)

    Article  Google Scholar 

  10. Kelly, D.H.: Motion and vision. II. Stabilized spatio-temporal threshold surface. Journal of Opt. Soc. Amer. 69, 1340–1349 (1979)

    Article  Google Scholar 

  11. Daly, S.J.: Engineering Observations from Spatiovelocity and Spatiotemporal Visual Models. In: IS&T/SPIE Conference on Human Vision and Electronic Imaging III, San Jose, California, vol. 3299, pp. 180–191 (January 1998)

    Google Scholar 

  12. Lucas, B.D., Kanade, T.: An iterative image registration technique with an application to stereo vision. In: Proceedings of Imaging Understanding Workshop, pp. 121–130 (1981)

    Google Scholar 

  13. Nguyen, P.B., Beghdadi, A., Luong, M.: Perceptual watermarking using pyramidal JND maps. In: Proc. of 10th IEEE International Symposium on Multimedia, Berkeley, CA, USA, pp. 418–423 (2008)

    Google Scholar 

  14. Nguyen, P.B., Beghdadi, A., Luong, M.: Robust Watermarking in DoG Scale Space Using a Multi-scale JND Model. In: Muneesawang, P., Wu, F., Kumazawa, I., Roeksabutr, A., Liao, M., Tang, X. (eds.) PCM 2009. LNCS, vol. 5879, pp. 561–573. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

  15. Lubin, J.: A Visual Discrimination Model for Imaging System Design and Evaluation. In: Peli, E. (ed.) Vision Models for Target Detection and Recognition, pp. 245–283 (1995)

    Google Scholar 

  16. Nguyen, P.B., Beghdadi, A., Luong, M.: Perceptual watermarking using a multi-scale JNC Model. In: Nguyen, N.T., Le, M.T., Świątek, J. (eds.) ACIIDS. LNCS, vol. 5991, pp. 471–480. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  17. Legge, G.E., Foley, J.M.: Contrast Masking in Human Vision. Journal of the Optical Society of America 70, 1458–1471 (1980)

    Article  Google Scholar 

  18. Solomon, J.A., Watson, A.B., Ahumada, A.: Visibility of DCT basis functions: Effects of contrast masking. In: Proceedings of Data Compression Conference, pp. 361–370 (1994)

    Google Scholar 

  19. Harel, J., Koch, C., Perona, P.: Graph-based visual saliency. In: Advances in Neural Information Processing Systems, vol. 19, pp. 545–552. MIT Press, Cambridge (2007)

    Google Scholar 

  20. Marszałek, M., Laptev, I., Schmid, C.: Actions in Context. In: IEEE Conference on Computer Vision & Pattern Recognition (2009)

    Google Scholar 

  21. ITU-R BT.500-11 Recommendation: Methodology for the Subjective Assessment of the Quality of Television Pictures (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Nguyen, PB., Beghdadi, A., Luong, M. (2010). A New Spatio-temporal JND Model based on 3D Pyramid Decomposition. In: Qiu, G., Lam, K.M., Kiya, H., Xue, XY., Kuo, CC.J., Lew, M.S. (eds) Advances in Multimedia Information Processing - PCM 2010. PCM 2010. Lecture Notes in Computer Science, vol 6298. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15696-0_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-15696-0_19

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-15695-3

  • Online ISBN: 978-3-642-15696-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics