Abstract
Currently, global-features-based image copy detection is vulnerable to geometric transformations like cropping, shift, and rotations. To resolve this problem, some algorithms based on local descriptors have been proposed. However, the local descriptors, which were originally designed for object recognition, are not suitable for copy detection because they cause the problems of false positives and ambiguities. Instead of relying on the local gradient statistic as many existing descriptors do, we propose a new invariant local descriptor based on local polar-mapping and discrete Fourier transform. Then based on this descriptor, we propose a new framework of copy detection, in which virtual prior attacks and attack weight are employed for training and selecting only a few robust features. This consequently improves the storage and detection efficiency. In addition, it is worth noting that the feature matching takes the locations and orientations of interest points into consideration, which increases the number of matched regions and improves the recall. Experimental results show that the new descriptor is more robust and distinctive, and the proposed copy detection scheme using this descriptor can substantially enhance the accuracy and recall of copy detection and lower the false positives and ambiguities.
Similar content being viewed by others
References
Sivic J, Zisserman A. Video Google: A text retrieval approach to object matching in videos. In Proc. IEEE International Conference on Computer Vision, Nice, France, Oct. 14–17, 2003, pp.1470–1477.
Indyk P, Motwani R. Approximate nearest neighbors: Towards removing the curse of dimensionality. In Proc. ACM Symp. Theory of Computing, Dallas, USA, May 23–26, 1998, pp.604–613.
Yuan J, Tian Q, Ranganath S. Fast and robust search method for short video clips from large video collection. In Proc. International Conference on Pattern Recognition, Cambridge, UK, Aug. 23–26, 2004, pp.866–869.
Tang H X, Wei H. A coarse-to-fine method for shape recognition. Journal of Computer Science and Technology, 2007, 22(2): 329–333.
Chang E Y, James Ze W, Chen L, Gio W. RIME: A replicated image detector for the World Wide Web. In Proc. SPIE Symposium of Voice, Video, and Data Communications, Boston, USA, Nov. 2–5, 1998, pp.58–67.
Kim C. Content-based image copy detection. Signal Processing: Image Communication, 2003, 18(3): 169–184.
Hsiao J H, Chen C S, Chien L F, Chen M S. A new approach to image copy detection based on extended feature sets. IEEE Transactions on Image Processing, 2007, 16(8): 2069–2079.
Amsaleg L, Gros P. Content-based retrieval using local descriptors: Problems and issues from a database perspective. Pattern Analysis and Applications, 2001, 4(2/3): 108–124.
Berrani S A, Amsaleg L, Gros P. Robust content-based image searches for copyright protection. In Proc. ACM International Workshop on Multimedia Databases (MMDB 2003), New Orleans, USA, Nov. 7, 2003, pp.70–77.
Ke Y, Sukthankar R, Huston L. Efficient near-duplicate detection and sub-image retrieval. In Proc. the 12th ACM International Conference on Multimedia (MM 2004), New York, USA, Oct. 10–16, 2004, pp.869–876.
Ke Y, Sukthankar R, Huston L. PCA-SIFT: A more distinctive representation for local image descriptors. In Proc. IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2004), Washington DC, USA, Jun. 27-Jul. 2, 2004, pp.506–513.
Foo J J, Sinha R. Pruning SIFT for scalable near-duplicate image matching. In Proc. the 18th Australasian Database Conference (ADC 2007), Ballarat, Australia, Jan. 29-Feb. 2, 2007, pp.63–71.
Lowe D G. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 2004, 60(2): 91–110.
Joly A, Frelicot C, Buisson O. Content-based video copy detection in large databases: A local fingerprints statistical similarity search approach. In Proc. IEEE International Conference on Image Processing, Genova, Italy, Sept. 11–14, 2005, pp.505–508.
Low-To J, Bussion O, Gouet-Brunet V, Boujemaa N. Robust voting algorithm based on labels of behavior for video copy detection. In Proc. ACM International Conference on Multimedia, Santa Barbara, USA, Oct. 23–27, 2006, pp.201–210.
Lu C S, Hsu C Y. Geometric distortion-resilient image hashing scheme and its applications on copy detection and authentication. Multimedia Systems, 2005, 11(2): 159–173.
Alexis J. New local descriptors based on dissociated dipoles. In Proc. ACM International Conference on Image and Video Retrieval, Amsterdam, The Netherlands, Jul. 9–11, 2007, pp.573–580.
Maani E, Tsaftaris S A, Katsaggelos A K. Local feature extraction for video copy detection in a database. In Proc. IEEE International Conference on Image Processing, San Diego, USA, Oct. 12–15, 2008, pp.1716–1719.
Mikolajczyk K, Schmid C. A performance evaluation of local descriptors. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005, 27(10): 1615–1630.
Mortensen E N, Deng H, Shapiro L. A SIFT descriptor with global context. In Proc. the IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2005), Piscataway, USA, Jun. 20–26, 2005, pp.184–190.
Sheng Y, Arsenault H H. Experiments on the pattern recognition using invariant Fourier-Mellin descriptors. Journal of the Optical Society of America A: Optics, Image Science, and Vision, 1986, 3(6): 771–776.
Mavandadi S, Aarabi P, Plataniotis K N. Fourier-based rotation invariant image features. In Proc. IEEE International Conference on Image Processing, Cairo, Egypt, Nov. 7–10, 2009, pp.2041–2044.
Choksuriwong A, Laurent H, Emile B. Comparison of invariant descriptors for object recognition. In Proc. IEEE International Conference on Image Processing, Genova, Italy, Sept. 11–14, 2005, pp.377–380.
Ekinci M, Aykut M. Human gait recognition based on kernel PCA using projections. Journal of Computer Science and Technology, 2007, 22(6): 867–876.
Tang C W, Hang H M. A feature-based robust digital image watermarking scheme. IEEE Transactions on Signal Processing, 2003, 51(4): 950–959.
Schwartz E L. Computational anatomy and functional architecture of striate cortex: A spatial mapping approach to perceptual coding. Vision Research, 1980, 20(8): 645–669.
Fox P D, Cheng J, Lu J. Theory and experiment of Fourier-Bessel field calculation and tuning of a pulsed wave annular array. Journal of the Acoustical Society of America, 2003, 113(5): 2412–2423.
Smeraldi F, Bigun J. Retinal vision applied to facial features detection and face authentication. Pattern Recognition Letters, 2002, 23(4): 463–475.
Zana Y, Cesar R M. Face recognition based on polar frequency features. ACM Transactions on Applied Perception, 2006, 3(1): 62–82.
Mikolajczyk K, Schmid C. Scale and affine invariant interest point detectors. International Journal of Computer Vision, 2004, 60(1): 63–86.
Mikolajczyk K. Binaries for affine covariant region descriptors. http://www.robots.ox.ac.uk/~vgg/research/affine/, 2007.
Petitcolas F A P. Watermarking schemes evaluation. IEEE Signal Processing Magazine, 2000, 17(5): 58–64.
Author information
Authors and Affiliations
Corresponding author
Additional information
Supported by the National Natural Science Foundation of China under Grant Nos. 60873226, 60803112, the National High Technology Research and Development 863 Program of China under Grant No. 2009AA01Z411.
Electronic Supplementary Material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Ling, HF., Wang, LY., Yan, LY. et al. PM-DFT: A New Local Invariant Descriptor Towards Image Copy Detection. J. Comput. Sci. Technol. 26, 558–567 (2011). https://doi.org/10.1007/s11390-011-1155-2
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11390-011-1155-2