Abstract
Hadoop has become a widely used open source framework for large scale data processing. MapReduce is the core component of Hadoop. It is this programming paradigm that allows for massive scalability across hundreds or thousands of servers in a Hadoop cluster. It allows processing of extremely large video files or image files on data nodes. This can be used for implementing Content Based Image Retrieval (CBIR) algorithms on Hadoop to compare and match query images to the previously stored terabytes of an image descriptors databases. This work presents the implementation for one of the well-known CBIR algorithms called Scale Invariant Feature Transformation (SIFT) for image features extraction and matching using Hadoop platform. It gives focus on utilizing the parallelization capabilities of Hadoop MapReduce to enhance the CBIR performance and decrease data input\output operations through leveraging Partitioners and Combiners. Additionally, image processing and computer vision tools such as Hadoop Image Processing (HIPI) and Open Computer Vision (OpenCV) are integration is shown.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Nausheen, K.M., Ram, M.S. Haarfilter: a machine learning tool for image processing in Hadoop. Int. J. Technol. Res. Eng. 3 (2015)
Barapatre, M.H., Nirgun, M.V., Jagtap, M.H., Ginde, M.S.: Image processing using mapreduce with performance analysis. Int. J. Emerg. Technol. Innovative Eng. I(4) (2015)
Yan, Y., Huang, L.: Large scale image processing research cloud. In: Cloud Computing, pp. 88–93 (2014)
Cheng, E.: Efficient feature extraction from a wide area motion imagery by MapReduce in Hadoop. In: SPIE Defense + Security. International Society for Optics and Photonics (2014)
Augustine, D.P.: Leveraging big data analytics and hadoop in developing India’s healthcare services. Int. J. Comput. Appl. 89(16), 44–50 (2014)
Gawde, A.U., Shah, M., Ukaye, I., Nanavati, M.: Object detection in hadoop using HIPI. Int. J. Adv. Res. Eng. Technol. (2013)
Bajcsy, P.: Terabyte-sized image computations on Hadoop cluster platforms. In: IEEE International Conference on Big Data, pp. 729–737. IEEE (2013)
Han, W., Kang, Y., Chen, Y., Zhang, X.: A MapReduce approach for SIFT feature extraction. In: International Conference on Cloud Computing and Big Data, pp. 465–469 (2013)
Moise, D., Shestakov, D., Thor, G., Amsaleg, L.: Indexing and searching 100 M images with Map-Reduce. In: ACM International Conference on Multimedia Retrieval, pp. 17–24 (2013)
Huitl, R., Schroth, G., Hilsenbeck, S., Schweiger, F., Steinbach, E.: TUMindoor: An extensive image and point cloud dataset for visual indoor localization and mapping. In: 19th IEEE International Conference on Image Processing (ICIP) Orlando, FL, pp. 1773–1776. IEEE (2012)
Schroth, G.: Mobile Visual Location Recognition. Ph.D. Thesis. Munich: Technische Universität München, July 2013
Panchal, P.M., Panchal, S.R., Shah, S.K.: A Comparison of SIFT and SURF. Int. J. Innovative Res. Comput. Commun. Eng. 1(2), 323–327 (2013). ISSN: 2320–9798
Calonder, M., Lepetit, V., Strecha, C., Fua, P.: BRIEF: binary robust independent elementary features. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6314, pp. 778–792. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15561-1_56
White, T.: Hadoop: The Definitive Guide, 2nd edn. O’Reilly Media, Sebastopol (2011)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Gaber, H., Marey, M., Amin, S.E., Tolba, M.F. (2017). Content Based Image Retrieval with Hadoop. In: Hassanien, A., Shaalan, K., Gaber, T., Azar, A., Tolba, M. (eds) Proceedings of the International Conference on Advanced Intelligent Systems and Informatics 2016. AISI 2016. Advances in Intelligent Systems and Computing, vol 533. Springer, Cham. https://doi.org/10.1007/978-3-319-48308-5_25
Download citation
DOI: https://doi.org/10.1007/978-3-319-48308-5_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-48307-8
Online ISBN: 978-3-319-48308-5
eBook Packages: EngineeringEngineering (R0)