Abstract
Since the emergence of cloud datacenters provides an enormous amount of resources easily accessible to people, it is challenging to provide an efficient search framework in such a distributed environment. However, traditional search techniques only allow users to search images over exact-match keywords through a centralized index. These methods are insufficient to meet requirements of content based image retrieval (CBIR) and more powerful search frameworks are needed. In this paper, we present LFFIR, a multi-feature image retrieval framework for content similar search in the distributed situation. The key idea is to effectively incorporate image retrieval based on multi-feature into the peer-to-peer (P2P) paradigm. LFFIR fuses the multiple features in order to capture the overall image characteristics. And then it constructs the distributed indexes for the fusion feature through exploiting the property of locality sensitive hashing (LSH). We implement a prototype system to evaluate the system performance with two image datasets. Comprehensive performance evaluations demonstration that our approach brings major performance and accuracy gains compared to the advanced distributed image retrieval framework.
Similar content being viewed by others
References
Androutsos P, Androutsos D, Venetsanopoulos AN (2006) A distributed fault-tolerant MPEG-7 retrieval scheme based on small world theory. IEEE Trans Multimed 8(2):278–288
Batko M, Falchi F, Lucchese C et al (2010) Building a web-scale image similarity search system[J]. Multimed Tools Appl 47(3):599–629
Bawa M, Manku G, Raghavan P (2003) SETS: search enhanced by topic segmentation. Proceedings of the 26th Annual International ACM SIGIR Conference (SIGIR’03), Toronto, Canada, 306–313
Chen J, Hu C, Su C (2008) Scalable retrieval and mining with optimal peer-to-peer configuration. IEEE Trans Multimed 10(2):209–220
Crespo A, Garcia-Molina H (2002) Routing indices for peer-to-peer systems. Proceedings of the 22nd IEEE International Conference on Distributed Computing Systems (ICDCS’02), Vienna, Austria, 23–32
Datar M, Immorlica N, Indyk P, Mirrokni VS (2004) Locality-sensitive hashing scheme based on p-stable distributions. Proceedings of the 20th Annual Symposium on Computational Geometry (SoCG’04), New York, USA, 253–262
Datta R, Joshi D, Li J, Wang JZ (2008) Image retrieval: ideas, influences, and trends of the new age. ACM Comput Surv 40(2), Article 5
Dikaiakos MD, Katsaros D, Mehra P, Pallis G, Vakali A (2009) Cloud computing: distributed internet computing for IT and scientific research. IEEE Internet Comput 13(5):10–13
Eisenhardt M, Muller W, Henrich A, Blank D, Allali SE (2006) Clustering-based source selection for efficient image retrieval in peer-to-peer networks. Proceedings of the 8th IEEE International Symposium on Multimedia (ISM ’06), Washington DC, USA, 823–830
Falchi F, Gennaro C, Zezula P (2005) A content-addressable network for similarity search in metric spaces. Proceedings of the 6th International Workshop on Databases, Information Systems and Peer-to-Peer Computing (DBISP2P’05), Toronto, Canada, 79–92
Forestiero A, Leonardi E, Mastroianni C, Meo M (2010) Self-chord: a bio-inspired P2P framework for self-organizing distributed systems. IEEE/ACM Trans Netw 18(5):1651–1664
Gaeta R, Sereno M (2011) Generalized probabilistic flooding in unstructured peer-to-peer networks. IEEE Trans Parallel Distrib Syst 22(12):2055–2062
Gnutella (2000) Gnutella website. http://www.Gnutella.com
Guo C, Lu G, Li D et al (2009) BCube: a high performance, server-centric network architecture for modular data centers[J]. ACM SIGCOMM Comput Commun Rev 39(4):63–74
Haghani P, Michel S, Aberer K (2009) Distributed similarity search in high dimensions using locality sensitive hashing. Proceedings of the 12th International Conference on Extending Database Technology (EDBT’09), Saint Petersburg, Russia, 744–755
Indyk P, Motwani R (1998) Approximate nearest neighbors: towards removing the curse of dimensionality. Proceedings of the 13th ACM Symposium on Theory of computing (STOC’98), Dallas, Texas, 604–613
Jagadish HV, Ooi BC, Vu QH (2005) BATON: a balanced tree structure for peer-to-peer networks. Proceedings of the 31st international conference on Very large data bases (VLDB’05), Trondheim, Norway, 661–672
Kalnis P, Ng WS, Ooi BC, Tan K (2004) Answering similarity queries in peer-to-peer networks. Inf Syst 31(1):57–72
King I, Ng CH, Sia KC (2004) Distributed content-based visual information retrieval system on peer-to-peer networks. ACM Trans Inf Syst 22(3):477–501
Li F, Fergus R, Perona P (2007) Learning generative visual models from few training examples: an incremental Bayesian approach tested on 101 object categories. Comput Vis Image Underst 106(1):59–70
Liao J, Wang J, Wu B, Wu W (2012) Toward a multi-plane framework of NGSON: a required guideline to achieve pervasive services and efficient resource utilization. IEEE Commun Mag 50(1):90–97
Liao J, Yang D, Li T, Wang J, Qi Q, Zhu X (2014) A scalable approach for content based image retrieval in cloud datacenter. Inf Syst Front 16(1):129–141
Liu G, Zhang L, Hon Y, Li Z, Yang J (2010) Image retrieval based on multi-texton histogram. Pattern Recogn 43(7):2380–2389
Lv Q, Cao P, Cohen E, Li K, Shenker S (2002) Search and replication in unstructured peer-to-peer networks. Proceedings of the 16th ACM Annual International Conference on Supercomputing (ICS’02), New York, USA, 84–95
Novak D, Zezula P (2006) M-Chord: a scalable distributed similarity search structure. Proceedings of the First International Conference on Scalable Information System (INFOSCALE’ 06), Hong Kong, China, Article 19
Peng C, Kim M, Zhang Z, Lei H (2012) VDN: virtual machine image distribution network for cloud data centers. IEEE International Conference on Computer Communications (INFOCOM’12), Orlando, Florida, 181–189
Peng C, Ksim M, Zhang Z, Lei H (2012) VDN: virtual machine image distribution network for cloud data centers. IEEE International Conference on Computer Communications (INFOCOM’12), Orlando, Florida, 181–189
Ratnasamy S, Francis P, Handley M, Karp R, Shenker S (2001) Scalable content-addressable networks. The 2001 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communications (SIGCOMM’01), San Diego, USA, 161–172
Sahin OD, Gulbeden A, Emekci F, Agrawal D, Abbadi AE (2005) PRISM: indexing multi-dimensional data in p2p networks using reference vectors. Proceedings of the 13rd Annual ACM International Conference on Multimedia, ACM Multimedia (MM’05), Singapore, 946–955
Snoek C,Worring M, Smeulders AWM (2005) Early versus late fusion in semantic video analysis. Proceedings of the 13rd Annual ACM International Conference on Multimedia, ACM Multimedia (MM’05), Singapore, 399–402
Sripanidkulchai K, Maggs BM, Zhang H (2003) Efficient content location using interest-based locality in peer-to-peer systems. Proceedings of the 22nd Annual Joint Conference of the IEEE Computer and Communications Societies (INFOCOM’03), San Francisco
Stoica I, Morris R, Karger D, Kaashoek MF, Balakrishnan H (2001) Chord: a scalable peer-to-peer lookup service for internet applications. The 2001 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communications (SIGCOMM’01), San Diego, USA, 149–160
Tang C, Xu Z, Dwarkadas S (2003) Peer-to-peer information retrieval using self-organizing semantic overlay networks. The 2003 Conference on Applications, Technologies, Architectures, and Protocols for Computer Communications (SIGCOMM’03), Karlsruhe, Germany, 175–186
Tian X, Jiao L, Liu X, Zhang X (2014) Feature integration of EODH and Color-SIFT: application to image retrieval based on codebook. Signal Process Image Commun 29(4):530–545
Urdaneta G, Pierre G, Steen MV (2011) A survey of DHT security techniques[J]. ACM Comput Surv (CSUR) 43(2):8
Vlachou A, Doulkeridis C, Kotidis Y (2012) Metric-based similarity search in unstructured peer-to-peer systems. Trans Large Scale Data Knowl Centered Syst 5:28–48
Wang X, Zhang B, Yang H (2014) Content-based image retrieval by integrating color and texture features. Multimed Tools Appl 68(3):545–569
Yang Z, Zhao BY, Xing Y et al (2010) AmazingStore: available, low-cost online storage service using cloudlets. Proceedings of the 9th International Workshops on Peer-to-Peer Systems (IPTPS’10), San Jose, USA, 1–5
Zhang X, Shou L, Tan K, Chen G (2010) iDISQUE: tuning high-dimensional similarity queries in DHT networks. Proceedings of the 15th International Conference on Database Systems for Advanced Applications (DASFAA’10), Tsukuba, Japan, 19–33
Zhu Y, Hu Y (2007) Efficient semantic search on DHT overlays. J Parallel Distrib Comput 67(5):604–616
Acknowledgments
This work was jointly supported by: (1) the National Basic Research Program of China (No. 2013CB329102); (2) National Natural Science Foundation of China (No. 61471063, 61421061, 61372120,61271019, 61101119, 61121001); (3) the Key(Keygrant) Project of Chinese Ministry of Education.(No. MCM20130310); (4) Beijing Municipal Natural Science Foundation (No. 4152039); (5) Beijing Higher Education Young Elite Teacher Project (No. YETP0473); (6)Spanish Research Council (No: TIN2013-46883); (7)Regional Government of Madrid (No: S2013/ICE-2894) cofunded by FSE & FEDER.
Author information
Authors and Affiliations
Corresponding authors
Rights and permissions
About this article
Cite this article
Liao, J., Yang, D., Li, T. et al. Fusion feature for LSH-based image retrieval in a cloud datacenter. Multimed Tools Appl 75, 15405–15427 (2016). https://doi.org/10.1007/s11042-015-2892-y
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-015-2892-y