Abstract
In this paper, a new Pyramidal Just-Noticeable-Distortion (PJND) model is proposed for video. This model incorporates the most relevant HVS properties such as: the spatio-temporal contrast sensitivity function, the influence of eye movements, the contrast masking effect and the saliency masking effect. The video sequence is first analyzed into multi-scales representation using the Laplacian pyramid decomposition (extended for 3D case). For each level, a sub JND threshold is then derived providing a global pyramidal JND map. To this end, we consider, for the first time, the influence of visual attention when designing JND model. For free viewing condition, human visual cortex is driven by a bottom-up mechanism so that it is attended only by selective salient regions. By this way, salient regions tend to mask non-salient regions. JND threshold is hence modulated by two masking mechanisms: contrast masking and “saliency masking”. Recent JND models do not take into account this phenomenon and therefore do not completely exploit human visual system (HVS) limitation. Intensive experiments are carried out to demonstrate the proposed model’s performance. Evaluation is performed in terms of distortion tolerance as well as perceptual transparency by means of PSNR and subjective tests. Compared to other state-of-art JNDs, our proposed PJND better exploits HVS properties by the fact that it can tolerate much more distortion while maintaining a good level of perceptual quality.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Hubel, D.H., Wiesel, T.N.: Receptive fields of single neurons in the cat’s striate cortex. Journal of Physiology 148, 574–591 (1959)
Watson, A.B.: DCT quantization matrices visually optimized for individual images. In: Proc. SPIE Int. Conf. Human Version, Visual Processing, and Digital Display - IV, pp. 202–216 (1993)
Ahumada, A.J., Peterson, H.A.: Luminance-Model-Based DCT Quantization for Color Image Compression. In: Proc. of SPIE, vol. 1666, pp. 365–374 (1992)
Chou, C.H., Li, Y.C.: A perceptually Tuned Subband Image Coder Based on the Measure of Just-Noticeable-Distortion Profile. IEEE Transaction on Circuits and Systems for Video Technology 5(6), 467–476 (1995)
Zhang, X., Lin, W., Xue, P.: Improved Estimation for Just-noticeable Visual Distortion. Signal Processing 85(4), 795–808 (2005)
Jia, Y., Lin, W., Kassim, A.A.: Estimating Just-Noticeable Distortion for Video. IEEE Trans. on Circuits and Systems for Video Tech. 16(7), 820–829 (2006)
Wei, Z., Ngan, K.N.: Spatio-Temporal Just Noticeable Distortion Profile for Grey Scale Image/Video in DCT Domain. IEEE Trans. on Circuits and Systems for Video Tech. 19(3), 337–346 (2009)
Burt, P.J., Adelson, E.H.: The Laplacian Pyramid as a Compact Image Code. IEEE Transactions on Communications, 532–540 (April 1983)
Robson, J.G.: Spatial and Temporal Contrast Sensitivity Functions of the Visual System. Journal of Optical Society of America 56, 1411–1412 (1966)
Kelly, D.H.: Motion and vision. II. Stabilized spatio-temporal threshold surface. Journal of Opt. Soc. Amer. 69, 1340–1349 (1979)
Daly, S.J.: Engineering Observations from Spatiovelocity and Spatiotemporal Visual Models. In: IS&T/SPIE Conference on Human Vision and Electronic Imaging III, San Jose, California, vol. 3299, pp. 180–191 (January 1998)
Lucas, B.D., Kanade, T.: An iterative image registration technique with an application to stereo vision. In: Proceedings of Imaging Understanding Workshop, pp. 121–130 (1981)
Nguyen, P.B., Beghdadi, A., Luong, M.: Perceptual watermarking using pyramidal JND maps. In: Proc. of 10th IEEE International Symposium on Multimedia, Berkeley, CA, USA, pp. 418–423 (2008)
Nguyen, P.B., Beghdadi, A., Luong, M.: Robust Watermarking in DoG Scale Space Using a Multi-scale JND Model. In: Muneesawang, P., Wu, F., Kumazawa, I., Roeksabutr, A., Liao, M., Tang, X. (eds.) PCM 2009. LNCS, vol. 5879, pp. 561–573. Springer, Heidelberg (2009)
Lubin, J.: A Visual Discrimination Model for Imaging System Design and Evaluation. In: Peli, E. (ed.) Vision Models for Target Detection and Recognition, pp. 245–283 (1995)
Nguyen, P.B., Beghdadi, A., Luong, M.: Perceptual watermarking using a multi-scale JNC Model. In: Nguyen, N.T., Le, M.T., Świątek, J. (eds.) ACIIDS. LNCS, vol. 5991, pp. 471–480. Springer, Heidelberg (2010)
Legge, G.E., Foley, J.M.: Contrast Masking in Human Vision. Journal of the Optical Society of America 70, 1458–1471 (1980)
Solomon, J.A., Watson, A.B., Ahumada, A.: Visibility of DCT basis functions: Effects of contrast masking. In: Proceedings of Data Compression Conference, pp. 361–370 (1994)
Harel, J., Koch, C., Perona, P.: Graph-based visual saliency. In: Advances in Neural Information Processing Systems, vol. 19, pp. 545–552. MIT Press, Cambridge (2007)
Marszałek, M., Laptev, I., Schmid, C.: Actions in Context. In: IEEE Conference on Computer Vision & Pattern Recognition (2009)
ITU-R BT.500-11 Recommendation: Methodology for the Subjective Assessment of the Quality of Television Pictures (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Nguyen, PB., Beghdadi, A., Luong, M. (2010). A New Spatio-temporal JND Model based on 3D Pyramid Decomposition. In: Qiu, G., Lam, K.M., Kiya, H., Xue, XY., Kuo, CC.J., Lew, M.S. (eds) Advances in Multimedia Information Processing - PCM 2010. PCM 2010. Lecture Notes in Computer Science, vol 6298. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15696-0_19
Download citation
DOI: https://doi.org/10.1007/978-3-642-15696-0_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15695-3
Online ISBN: 978-3-642-15696-0
eBook Packages: Computer ScienceComputer Science (R0)