Mining near duplicate image groups

Li, Jing; Qian, Xueming; Li, Qing; Zhao, Yisi; Wang, Liejun; Tang, Yuan Yan

doi:10.1007/s11042-014-2008-0

Mining near duplicate image groups

Published: 02 May 2014

Volume 74, pages 655–669, (2015)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Jing Li¹,
Xueming Qian¹,
Qing Li¹,
Yisi Zhao¹,
Liejun Wang² &
…
Yuan Yan Tang³

450 Accesses
17 Citations
Explore all metrics

Abstract

Most recently the social media sharing websites such as Flickr, Facebook, and Picasa have allowed users to share their personal photos with friends. Moreover, people like to follow, forward their favorite images, which is one of the main source of near duplicate images. And also, the worldwide place of interests such as Roma, Statue of Liberty and London Tower Bridge etc., attract world-wide visitors. For these places, travelers take photos, write travelogues and share them with their social friends. The photos taken from the same place with or without viewpoint variations are near duplicate images. How to detect them is an ad-hoc problem in the area of image analysis and multimedia processing. The existing near duplicate image processing approaches mainly focused on finding the near duplicate images for a given input image, where a query image is needed. However, how to find the near duplicate image groups (NDIG) automatically from the web-scale social images is very challenging. So, in this paper, instead of searching near duplicates image for certain input image, we proposed an automatic NDIG mining approach by utilizing adaptive global feature clustering and local feature refinement. The proposed NDIG mining approach is achieved by utilizing a hierarchical model. It is a two-layer hierarchical structure by first utilizing adaptive global feature clustering based candidate NDIG detection and then using local feature refinement based NDIG verification. The global clustering is mainly for reducing computational cost for processing the large scale image set. The local refinement is for improving NDIG detection performances. Experiments on four datasets show the effectiveness of our approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Learning with Noisy Correspondence

Article 13 April 2024

Image Generation: A Review

Article 11 March 2022

Open Set Recognition in Real World

Article 07 March 2024

References

Battiato S, Farinella GM, Guarnera GC (2010) Bags of phrases with codebooks alignment for near duplicate image detection, MiFOR’10, October 29, 2010, Firenze, Italy
Chum J, Isard O, Sivic M, Zisserman JA (2007) Object retrieval with large vocabularies and fast spatial matching. Comput Vis Pattern Recognit, CVPR ‘07
Gao Y, Wang M, Tian Q (2011) Less is more: efficient 3-D object retrieval with query view selection. IEEE Trans Multimed 13(5)
Gao Y, Wang M, Shen J (2013) Visual-textual joint relevance learning for tag-based social image search. IEEE Trans Image Process 22(1)
Han Y, Xu Z, Ma Z, Huang Z (2013) Image classification with manifold learning forout-of-sample data. Sig Process 93(8), August
Han Y, Yang Y, Ma Z, Shen H, Sebe N, Zhou X (2014) Image attribute adaptation. IEEE Trans Multimed
Hu Y, Cheng X, Chia LT, Xie X (2009) Coherent phrase model for efficient image near-duplicate retrieval. IEEE Trans Multimed 11(8), DECEMBER, 2009
Kennedy L, Naaman M (2008) Generating diverse and representative image search results for landmarks. WWW
Lee J, Tong W, Jin R, Jain AK Image retrieval in forensics: application to tattoo image database. IEEE Multimed
Li X, Snoek CGM, Worring M, Smeulders AWM (2011) Social negative bootstrapping for visual categorization. ICMR’11, April 17–20, Trento, Italy
Li J, Qian X, Tang Y, Yang L (2013) GPS estimation for users’ photos. MMM
Li J, Qian X, Tang Y, Yang L, Mei T (2014) GPS estimation for places of interest from social users’ uploaded photos. IEEE Trans Multimed
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60:91–110
Article Google Scholar
Philbin J, Sivic J, Zisserman A (2010) Geometric latent Dirichlet allocation on amatching graph for large-scale image datasets. Int J Comput Vis
Qian X, Guo D, Hou X, Li Z, Wang H, Liu G (2012) HWVP: Hierarchical wavelet packet descriptors and their applications in scene categorization and semantic concept retrieval. Multimed Tools Appl pp 1–24
Sayad IE, Martinet J, Urruty T, Benabbas Y, Djeraba C (2011) A semantically significant visual representation for social image retrieval vol. 978-1-61284-350-6/11/$26.00 ©2011 IEEE
Song J, Yi Y, Huang Z, Shen H, Hong R (2011) Multiple feature hashing for real-time large scale near-duplicate video retrieval. MM’11, November 28–December 1, 2011, Scottsdale, Arizona, USA
Stricker MA, Orengo M (1995) Similarity of color images. Proc IS&T/SPIE\’s Symp Electron Imaging Sci Technol pp 381–392
Wang X, Zhang L, Ma W Duplicate-search-based image annotation using web-scaledata. doi:10.1109/JPROC.2012.2193109
Wang B, Li Z, Li M, Ma W (2006) Large-scale duplicate detection for web image search. ICME
Wang M, Yang K, Hua X (2010) Towards a relevant and diverse search of social images. IEEE Trans Multimed 12(8)
Wang Y, Hou Z, Leman K, Pham NT, Chua TW (2011) Combination of local and global features for near-duplicate detection. In: Lee K-T et al. (eds), MMM 2011, Part I, LNCS 6523, pp 328–338
Wu P, Hoi SCH, Zhao P, He Y (2011) Mining social images with distance metric learning for automated image tagging. WSDM’11, February 9–12, 2011, Hong Kong, China
Xu D, Cham TJ, Yan S, Chang S Near duplicate image identification with spatially aligned pyramid matching
Xu D, Cham TJ, Yan S, Duan L, Chang S (2010) Near duplicate identification with spatially aligned pyramid matching. IEEE Trans Circ Syst Video Technol
Zhou W, Lu Y, Li H, Song Y, Tian Q (2010) Spatial coding for large scale partial-duplicate web image search. MM’10, October 25–29, 2010, Firenze, Italy

Download references

Author information

Authors and Affiliations

SMILES LAB at School of Electronics and Information Engineering, Xi’an Jiaotong University, Xi’an, 710049, China
Jing Li, Xueming Qian, Qing Li & Yisi Zhao
Xinjiang University, Urumqi, China
Liejun Wang
Macau University, Macau, China
Yuan Yan Tang

Authors

Jing Li
View author publications
You can also search for this author in PubMed Google Scholar
Xueming Qian
View author publications
You can also search for this author in PubMed Google Scholar
Qing Li
View author publications
You can also search for this author in PubMed Google Scholar
Yisi Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Liejun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yuan Yan Tang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Xueming Qian or Liejun Wang.

Additional information

This work is supported in part by National Natural Science Foundation of China (NSFC) Project No.61173109 and No.60903121, No.61261036, and Microsoft Research Asia.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Li, J., Qian, X., Li, Q. et al. Mining near duplicate image groups. Multimed Tools Appl 74, 655–669 (2015). https://doi.org/10.1007/s11042-014-2008-0

Download citation

Received: 29 October 2013
Revised: 16 March 2014
Accepted: 04 April 2014
Published: 02 May 2014
Issue Date: January 2015
DOI: https://doi.org/10.1007/s11042-014-2008-0

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Mining near duplicate image groups

Abstract

Access this article

Similar content being viewed by others

Learning with Noisy Correspondence

Image Generation: A Review

Open Set Recognition in Real World

References

Author information

Authors and Affiliations

Corresponding authors

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Mining near duplicate image groups

Abstract

Access this article

Similar content being viewed by others

Learning with Noisy Correspondence

Image Generation: A Review

Open Set Recognition in Real World

References

Author information

Authors and Affiliations

Corresponding authors

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation