Horizontal Scaling Enhancement for Optimized Big Data Processing

Roy, Chandrima; Barua, Kashyap; Agarwal, Sandeep; Pandey, Manjusha; Rautaray, Siddharth Swarup

doi:10.1007/978-981-13-1951-8_58

Horizontal Scaling Enhancement for Optimized Big Data Processing

Chandrima Roy¹⁹,
Kashyap Barua¹⁹,
Sandeep Agarwal¹⁹,
Manjusha Pandey¹⁹ &
…
Siddharth Swarup Rautaray¹⁹

Conference paper
First Online: 12 December 2018

672 Accesses
5 Citations

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 755))

Abstract

Big Data, as we all know, is becoming a new technological trend in the industries, in science and even businesses. Indefinite data scalability allows organizations to process huge amounts of data in parallel, assisting dramatically decrease the amount of time it takes to manage several amount of work, optimize hardware resource usage and permit the extreme quantity of data per node to be handled. Optimization is to done to attain the finest strategy relative to a set of selected constraints which include maximizing factors such as efficiency, productivity, reliability, strength, and utilization. When the current system becomes insufficient, instead of upgrading it by adding more components to the existing structure you just add more computers to a cluster. This research discusses a hierarchical architecture of Hadoop Nodes namely Name nodes and Data nodes and mainly focuses on the optimization of Data Node by distributing some of its work load to Name Node.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 229.00; Price excludes VAT (USA)

Softcover Book: USD 299.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Yadav, K., Pandey, M., Rautaray, S.S.: Feedback analysis using big data tools. In: International Conference on ICT in Business Industry & Government (ICTBIG). IEEE (2016)
Google Scholar
Chakraborty, S. et al.: A proposal for high availability of HDFS architecture based on threshold limit and saturation limit of the namenode (2017)
Google Scholar
Jena, B. et al.: Name node performance enlarging by aggregator based HADOOP framework. In: 2017 International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud)(I-SMAC). IEEE (2017)
Google Scholar
Shvachko, K., et al.: The hadoop distributed file system. In: 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST). IEEE (2010)
Google Scholar
Jahani, Eaman, Cafarella, Michael J., Ré, Christopher: Automatic optimization for MapReduce programs. Proc. VLDB Endow. 4(6), 385–396 (2011)
Article Google Scholar
Lee, K.-H. et al.: Parallel data processing with MapReduce: a survey. ACM sIGMoD Record 40(4), 11–20 (2012)
Google Scholar
White, T.: Hadoop: The Definitive Guide. O’Reilly Media, Inc. (2012)
Google Scholar
Kanaujia, P.K.M., Pandey, M., Rautaray, S.S.: Real time financial analysis using big data technologies. In: 2017 International Conference on I-SMAC (IoT in Social, Mobile, Analytics and Cloud)(I-SMAC). IEEE (2017)
Google Scholar
Borthakur, Dhruba: The hadoop distributed file system: architecture and design. Hadoop Proj. Website 11(2007), 21 (2007)
Google Scholar
Jena, B. et al.: A survey work on optimization techniques utilizing map reduce framework. Hadoop Cluster. Int. J. Intell. Syst. Appl. 9(4), 61 (2017)
Google Scholar
Feng, D., Zhu, L., Zhang, L.: Review of hadoop performance optimization. In: 2016 2nd IEEE International Conference on Computer and Communications (ICCC). IEEE (2016)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Engineering, KIIT, Deemed to be University, Bhubaneswar, 751024, Odisha, India
Chandrima Roy, Kashyap Barua, Sandeep Agarwal, Manjusha Pandey & Siddharth Swarup Rautaray

Authors

Chandrima Roy
View author publications
You can also search for this author in PubMed Google Scholar
Kashyap Barua
View author publications
You can also search for this author in PubMed Google Scholar
Sandeep Agarwal
View author publications
You can also search for this author in PubMed Google Scholar
Manjusha Pandey
View author publications
You can also search for this author in PubMed Google Scholar
Siddharth Swarup Rautaray
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chandrima Roy .

Editor information

Editors and Affiliations

Machine Intelligence Research Labs, Auburn, WA, USA
Ajith Abraham
Department of Computer and Systems Sciences, Visva-Bharati University, Santiniketan, West Bengal, India
Paramartha Dutta
Department of Computer Science and Engineering, University of Kalyani, Kalyani, India
Jyotsna Kumar Mandal
Institute of Engineering and Management, Kolkata, West Bengal, India
Abhishek Bhattacharya
Institute of Engineering and Management, Kolkata, West Bengal, India
Soumi Dutta

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Roy, C., Barua, K., Agarwal, S., Pandey, M., Rautaray, S.S. (2019). Horizontal Scaling Enhancement for Optimized Big Data Processing. In: Abraham, A., Dutta, P., Mandal, J., Bhattacharya, A., Dutta, S. (eds) Emerging Technologies in Data Mining and Information Security. Advances in Intelligent Systems and Computing, vol 755. Springer, Singapore. https://doi.org/10.1007/978-981-13-1951-8_58

Download citation

DOI: https://doi.org/10.1007/978-981-13-1951-8_58
Published: 12 December 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1950-1
Online ISBN: 978-981-13-1951-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics