Abstract
This paper presents a framework to define an objective measure of the similarity (or dissimilarity) between two images for image processing. The problem is twofold: 1) define a set of features that capture the information contained in the image relevant for the given task and 2) define a similarity measure in this feature space.
In this paper, we propose a feature space as well as a statistical measure on this space. Our feature space is based on a global descriptor of the image in a multiscale transformed domain. After decomposition into a Laplacian pyramid, the coefficients are arranged in intrascale/ interscale/interchannel patches which reflect the dependencies between neighboring coefficients in presence of specific structures or textures. At each scale, the probability density function (pdf) of these patches is used as a descriptor of the relevant information. Because of the sparsity of the multiscale transform, the most significant patches, called Sparse Multiscale Patches (SMP), characterize efficiently these pdfs. We propose a statistical measure (the Kullback-Leibler divergence) based on the comparison of these probability density functions. Interestingly, this measure is estimated via the nonparametric, k-th nearest neighbor framework without explicitly building the pdfs.
This framework is applied to a query-by-example image retrieval task. Experiments on two publicly available databases showed the potential of our SMP approach. In particular, it performed comparably to a SIFT-based retrieval method and two versions of a fuzzy segmentation-based method (the UFM and CLUE methods), and it exhibited some robustness to different geometric and radiometric deformations of the images.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Deselaers, T., Keysers, D., Ney, H.: Features for image retrieval: An experimental comparison. Information Retrieval 11, 77–107 (2008)
Loupias, E., Sebe, N., Bres, S., Jolion, J.M.: Wavelet-based salient points for image retrieval. In: ICIP, vol. 2, pp. 518–521 (2000)
Swain, M., Ballard, D.: Color indexing. IJCV 7, 11–32 (1991)
Mikolajczyk, K., Schmid, C.: A performance evaluation of local descriptors. IEEE Trans. Pattern Anal. Mach. Intell. 27, 1615–1630 (2005)
Huang, J., Mumford, D.: Statistics of natural images and models. In: CVPR, Fort Collins, CO, USA, vol. 1, pp. 541–547 (1999)
Black, M., Anandan, P.: A framework for the robust estimation of optical flow. In: ICCV, Berlin, Germany, pp. 231–236 (1993)
Black, M.J., Anandan, P.: The robust estimation of multiple motions: parametric and piecewise-smooth flow fields. CVIU 63, 75–104 (1996)
Comaniciu, D., Ramesh, V., Meer, P.: Real-time tracking of non-rigid objects using mean shift. In: CVPR, vol. 2, pp. 142–149 (2000)
Viola, P., Wells, I., Wainwright, M.: Alignment by maximization of mutual information. IJCV 24, 137–154 (1997)
Bansal, R., Staib, L.H., Chen, Z., Rangarajan, A., Knisely, J., Nath, R., Duncan, J.S.: Entropy-based, multiple-portal-to-3dct registration for prostate radiotherapy using iteratively estimated segmentation. In: Taylor, C., Colchester, A. (eds.) MICCAI 1999. LNCS, vol. 1679, pp. 567–578. Springer, Heidelberg (1999)
Marques, O., Mayron, L.M., Borba, G.B., Gamba, H.R.: On the potential of incorporating knowledge of human visual attention into cbir systems. In: ICME, pp. 773–776 (2006)
Puzicha, J., Rubner, Y., Tomasi, C., Buhmann, J.M.: Empirical evaluation of dissimilarity measures for color and texture. In: ICCV, pp. 1165–1172 (1999)
Buades, A., Coll, B., Morel, J.M.: A review of image denoising algorithms, with a new one. Multiscale Modeling and Simulation 4, 490–530 (2005)
Awate, S.P., Whitaker, R.T.: Unsupervised, information-theoretic, adaptive image filtering for image restoration. IEEE Trans. on PAMI 28, 364–376 (2006)
Angelino, C.V., Debreuve, E., Barlaud, M.: Image restoration using a knn-variant of the mean-shift. In: ICIP, San Diego, USA (2008)
Portilla, J., Strela, V., Wainwright, M., Simoncelli, E.P.: Image denoising using a scale mixture of Gaussians in the wavelet domain. TIP 12, 1338–1351 (2003)
Do, M., Vetterli, M.: Wavelet based texture retrieval using generalized Gaussian density and Kullback-Leibler distance. TIP 11, 146–158 (2002)
Wang, Z., Wu, G., Sheikh, H.R., Simoncelli, E.P., Yang, E.H., Bovik, A.C.: Quality-aware images. TIP 15, 1680–1689 (2006)
Donoho, D.L., Johnstone, I.M.: Ideal spatial adaptation by wavelet shrinkage. Biometrika 81, 425–455 (1994)
Romberg, J.K., Choi, H., Baraniuk, R.G.: Bayesian tree-structured image modeling using wavelet-domain hidden markov models. TIP 10, 1056–1068 (2001)
Pierpaoli, E., Anthoine, S., Huffenberger, K., Daubechies, I.: Reconstructing sunyaev-zeldovich clusters in future cmb experiments. Mon. Not. Roy. Astron. Soc. 359, 261–271 (2005)
Burt, P.J., Adelson, E.H.: The Laplacian pyramid as a compact image code. IEEE Trans. Communications 31, 532–540 (1983)
Piro, P., Anthoine, S., Debreuve, E., Barlaud, M.: Image retrieval via kullback-leibler divergence of patches of multiscale coefficients in the knn framework. In: CBMI, London, UK (2008)
Nielsen, F., Boissonnat, J.D., Nock, R.: On bregman voronoi diagrams. In: SODA, pp. 746–755 (2007)
Nielsen, F., Nock, R.: On the smallest enclosing information disk. Inf. Process. Lett. 105, 93–97 (2008)
Boltz, S., Debreuve, E., Barlaud, M.: High-dimensional kullback-leibler distance for region-of-interest tracking: Application to combining a soft geometric constraint with radiometry. In: CVPR, Minneapolis, USA (2007)
Ahmad, I., Lin, P.E.: A nonparametric estimation of the entropy for absolutely continuous distributions. IEEE Trans. Inform. Theory 22, 372–375 (1976)
Terrell, G.R., Scott, D.W.: Variable kernel density estimation. The Annals of Statistics 20, 1236–1265 (1992)
Loftsgaarden, D., Quesenberry, C.: A nonparametric estimate of a multivariate density function. AMS 36, 1049–1051 (1965)
Goria, M., Leonenko, N., Mergel, V., Novi Inverardi, P.: A new class of random vector entropy estimators and its applications in testing statistical hypotheses. J. Nonparametr. Stat. 17, 277–298 (2005)
Lowe, D.: Distinctive image features from scale-invariant keypoints. In: IJCV, vol. 20, pp. 91–110 (2003)
Chen, Y., Wang, J.Z.: A region-based fuzzy feature matching approach to content-based image retrieval. TIP 24, 1252–1267 (2003)
Nistér, D., Stewénius, H.: Scalable recognition with a vocabulary tree. In: CVPR, vol. 2, pp. 2161–2168 (2006)
Chen, Y., Wang, J.Z., Krovetz, R.: Clue: Cluster-based retrieval of images by unsupervised learning. TIP 14, 1187–1201 (2005)
Lowe, D.: Sift keypoint detector, http://www.cs.ubc.ca/~lowe/keypoints/
Arya, S., Mount, D.M., Netanyahu, N.S., Silverman, R., Wu, A.Y.: An optimal algorithm for approximate nearest neighbor searching fixed dimensions. J. ACM 45, 891–923 (1998)
Garcia, V., Debreuve, E., Barlaud, M.: Fast k nearest neighbor search using GPU. In: CVPR Workshop on Computer Vision on GPU (2008)
ITU-T, JTC1, I.: Scalable video coding - joint draft (April 6, 2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Piro, P., Anthoine, S., Debreuve, E., Barlaud, M. (2009). Sparse Multiscale Patches for Image Processing. In: Nielsen, F. (eds) Emerging Trends in Visual Computing. ETVC 2008. Lecture Notes in Computer Science, vol 5416. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00826-9_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-00826-9_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00825-2
Online ISBN: 978-3-642-00826-9
eBook Packages: Computer ScienceComputer Science (R0)