Mobile Networks and Applications

, Volume 24, Issue 1, pp 282–294 | Cite as

Effective Big Data Retrieval Using Deep Learning Modified Neural Networks

  • T. PrasanthEmail author
  • M. Gunasekaran


In big data, the information retrieval (IR) embraces the discovery of documents from a compilation of dataset which are related to the user query. Usually, the data retrieval systems are used to scan the data. The existent approaches that employ customary IR are wearisome for big document collections. Recently, IR approaches were developed, albeit these are faster comparing to the traditional method but the quality of the document retrieved is less. To overcome such difficulties, here, effectual big data retrieval utilizing Deep Learning Modified Neural Networks (DLMNN) is proposed. Initially, the general pre-processing along with feature extraction steps are taken place. In preprocessing stage, unwanted data are removed and also used for changing the unstructured data in to structured data then in FE is done using frequency and entropy calculation of the given input data. Secondly, find the closed recurrent item dataset, after that find the weight of provided data using entropy measure and frequent item measure. In the 3rd step, the documents are clustered utilizing the k-means algorithm and then classified using DLMNN. The K-Means algorithm is utilized to partition the collection of documents into several clusters then the DLMNN is used for classifying the documents into positive and negative classes. The proposed DLNN weight parameters are optimized utilizing the cuckoo search (CS) optimization algorithm. The last stage on the training process section is generating a training data-base. In the other part, the retrieval process is performed, in this section it pre-processes the user query and discovers the frequency item set then gets retrieval data. Finally, check the similarity assessment, if the information is found then it is visualized, otherwise the document is returned to the initial position. Experimental results contrasted with the previous MRT and IRI-RAS techniques concerning precision, recall, F-measure along with computation time. The proposed document IR is better when comparing with existent methods.


Big data retrieval Closed frequent item set Deep learning modified neural networks (DLMNN) Cuckoo Search algorithm (CSA) K-means clustering algorithm Mutual refinement technique (MRT) Intelligent rule-based relevant information retrieval algorithm with semantics (IRI-RAS) 



  1. 1.
    Irfan S, Babu BV (2016) Information retrieval in big data using evolutionary computation: A survey. In: Computing, Communication and Automation (ICCCA), International Conference on, pp. 208-213, IEEEGoogle Scholar
  2. 2.
    Zhao F, Zhu Y, Jin H, Yang LT (2016) A personalized hashtag recommendation approach using LDA-based topic model in microblog environment. Futur Gener Comput Syst 65:196–206.
  3. 3.
    DineshMavaluru RS, Sugumaran V (2014) Big data analytics in information retrieval: promise and potential. In: Proceedings of 08th IRF International Conference. Bengaluru, pp. 41-46Google Scholar
  4. 4.
    Kodabagi MM, Sarashetti D, Naik V (2016) A Text Information Retrieval Technique for Big Data Using Map Reduce. Bonfring International Journal of Software Engineering and Soft Computing 6:22–26CrossRefGoogle Scholar
  5. 5.
    Cuzzocrea A, Lee W, Leung CK (2015) High-recall information retrieval from linked big data. In: Computer Software and Applications Conference (COMPSAC), IEEE 39th Annual, Vol. 2, pp. 712-717, IEEEGoogle Scholar
  6. 6.
    Chiranjeevi HS, Shenoy M, Prabhu S, Sundhar S (2016) DSSM with text hashing technique for text document retrieval in next-generation search engine for big data and data analytics. In: Engineering and Technology (ICETECH), IEEE International Conference on, pp. 395-399, IEEEGoogle Scholar
  7. 7.
    Portilla Herrera NA, López Gomez F, Bucheli VA, SolartePabón O (2017) Semantic annotation and retrieval of scientific documents in a big data environment. IET digital library 7th Latin American Conference on Networked and Electronic Media, pp. 33-38Google Scholar
  8. 8.
    Ketu S, Agarwal S (2015) Performance enhancement of distributed K-Means clustering for big Data analytics through in-memory computation. In: Contemporary Computing (IC3), Eighth International Conference on, pp. 318-324, IEEEGoogle Scholar
  9. 9.
    Chen C, Zhu X, Shen P, Hu J (2014) A hierarchical clustering method for big data oriented ciphertext search. In: Computer Communications Workshops (INFOCOM WKSHPS), IEEE Conference on, pp. 559-564, IEEEGoogle Scholar
  10. 10.
    Wang Y, Liu J, Huang Y, Feng X (2016) Using hashtag graph-based topic model to connect semantically-related words without co-occurrence in microblogs. IEEE Trans Knowl Data Eng 28(7):1919–1933CrossRefGoogle Scholar
  11. 11.
    Caballero I, Serrano M, Piattini M (2014) A data quality in use model for big data. In: International Conference on Conceptual Modeling, pp. 65-74. Springer, ChamGoogle Scholar
  12. 12.
    Sakr NA, ELdesouky AI, Arafat H (2016) An efficient fast-response content-based image retrieval framework for big data. Comput Electr Eng 54:522–538CrossRefGoogle Scholar
  13. 13.
    Li Y, Zhang Y, Huang X, Zhu H, Ma J (2018) Large-scale remote sensing image retrieval by deep hashing neural networks. IEEE Trans Geosci Remote Sens 56(2):950–965CrossRefGoogle Scholar
  14. 14.
    Liu L, Yu M, Shao L (2017) Learning short binary codes for large-scale image retrieval. IEEE Trans Image Process 26(3):1289–1299MathSciNetCrossRefGoogle Scholar
  15. 15.
    Prasanth T, Gunasekaran M (2017) A mutual refinement technique for big data retrieval using hash tag graph. Cluster Computing, pp. 1-11Google Scholar
  16. 16.
    BegümDemir, and Lorenzo Bruzzone (2016) Hashing-based scalable remote sensing image search and retrieval in large archives. IEEE Trans Geosci Remote Sens 54(2):892–904CrossRefGoogle Scholar
  17. 17.
    Kehua G, Liang Z, Tang Y, Chi T (2018) SOR: An optimized semantic ontology retrieval algorithm for heterogeneous multimedia big data. J Comput Sci 28:455–465MathSciNetCrossRefGoogle Scholar
  18. 18.
    Zou F, Tang X, Li K, Wang Y, Song J, Yang S, Ling H (2018) Hidden semantic hashing for fast retrieval over large scale document collection. Multimedia Tools and Applications 77(3):3677–3697CrossRefGoogle Scholar
  19. 19.
    Joshi AS, Kulkarni O, Kakandikar GM, Nandedkar VM (2017) Cuckoo Search Optimization-A Review. Materials Today: Proceedings 4(8):7262–7269CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2019

Authors and Affiliations

  1. 1.Department of Information TechnologyBannari Amman Institute of TechnologySathyamangalamIndia

Personalised recommendations