Information Retrieval Using Hadoop Big Data Analysis

Conference paper
Part of the Springer Proceedings in Physics book series (SPPHY, volume 166)


This paper concern on big data analysis which is the cognitive operation of probing huge amounts of information in an attempt to get uncovers unseen patterns. Through Big Data Analytics Applications such as public and private organization sectors have formed a strategic determination to turn big data into cut throat benefit. The primary occupation of extracting value from big data give rise to a process applied to pull information from multiple different sources; this process is known as extract transforms and lode. This paper approach extract information from log files and Research Paper, awareness reduces the efforts for blueprint finding and summarization of document from several positions. The work is able to understand better Hadoop basic concept and increase the user experience for research. In this paper, we propose an approach for analysis log files for finding concise information which is useful and time saving by using Hadoop. Our proposed approach will be applied on different research papers on a specific domain and applied for getting summarized content for further improvement and make the new content.


File System Inverted Index Hadoop Distribute File System Distribute File System Inverted File 


  1. 1.
    Banerjee E, Patil A, Kakade P, Nandimath J (2013) Big data analysis using Apache Hadoop. In: Proceedings of the 14th international conference on information reuse and integration (IRI). IEEE, San Francisco, CA, pp 700–703Google Scholar
  2. 2.
    Jiang ZM, Hemmati H, Adams B (2013) Assisting developers of big data analytics applications when deploying on Hadoop clouds. In: Proceedings of the 35th international conference on software engineering (ICSE). IEEE, San Francisco, CA, pp 402–411Google Scholar
  3. 3.
  4. 4.
    De Capite D (2014) Techniques in processing data on Hadoop. Paper SAS033-2014Google Scholar
  5. 5.
    Tu Y-N, Seng J-L (2009) Research intelligence involving information retrieval—an example of conferences and journals. Int J Expert Syst Appl 36:12151–12166 (Elsevier)Google Scholar
  6. 6.
  7. 7.
    Hadoop in action book. Manning, pp 10–20 (2011)Google Scholar
  8. 8.
    Silberschatz A, Korth HF, Sudarshan S (2010) Data base system concept (Chap 20), 6th edn. McGraw-Hill, New YorkGoogle Scholar
  9. 9.
    Managing Gigabytes for Java a free full-text search engine for large document collections written in JavaGoogle Scholar
  10. 10.

Copyright information

© Springer India 2015

Authors and Affiliations

  1. 1.Department of CSEMewar UniversityChitordgarhIndia
  2. 2.Faculty of Engineering and TechnologyMewar UniversityChitordgarhIndia

Personalised recommendations