Abstract
High-Performance Computing (HPC) is one of the upcoming technologies that represent data-intensive and compute-intensive applications. HPC–on–Cloud is an added advantage to enhance the efficiency of massively parallel applications. Hadoop-MapReduce is a programming paradigm designed to process parallel data on cloud. The key to improve performance of Hadoop-MapReduce lies with Efficient Resource allocation and Scheduling. In this paper, we analyze the behaviour of resource-aware adaptive scheduling which aims to improve resource utilization in MapReduce clusters.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ekanayake, J., Qiu, X., Gunarathne, T., Beason, S., Fox, G.: High Performance Parallel Computing with Cloud and Cloud Technologies, vol. 34, pp. 20–38 (2010)
Ekanayake, J., Fox, G.: High Performance parallel computing with clouds and cloud technologies. 1st international conference on cloud computing, 19–21 Oct 2009
Wikipedia. High-performance computing. https://en.wikipedia.org/wiki/Supercomputer
NIST Definition of Cloud Computing v15. csrc.nist.gov/groups/SNS/cloud-computing/cloud-def-v15.doc
http://www.computer.org/web/computingnow/archive/september2012
Ye, X., Lv, A., Zhao, L.: Research of High Performance Computing With Clouds, pp. 289–293 (2010)
Dean, J., Ghemawat: MapReduce: Simplified Data Processing On Large Clusters. Google Inc. (2004)
Wang, G.: Evaluating MapReduce System Performance:A Simulation Approach (2012)
Apache. Hadoop. http://hadoop.apache.org
Hadoop: The Definitive Guide, Second Edition, by Tom White, O’Reilly Media, Inc., 1005 Gravenstein Highway North, Sebastopol, CA 95472
Hadoop Distributed File System. http://hadoop.apache.org/hdfs
Hadoop’s Fair Scheduler. https://hadoop.apache.org/docs/r1.2.1/fair_scheduler
Hadoop’s Capacity Scheduler. http://hadoop.apache.org/core/docs/current/capacity_scheduler.html
Polo, J., Castillo, C., Carrera, D., Becerra, Y., Whalley, I., Steinder, M., Torres, J., Ayguadé, E.: Resource-aware adaptive scheduling for MapReduce clusters. In Proceedings of the 12th ACM/IFIP/USENIX international conference on Middleware, in (2011)
Zhang, Q., Zhani, M.F., Yang, Y., Boutaba, R., Wong, B.: PRISM: Fine-grained resource-aware scheduling for mapreduce. (2015)
Wottrich, K., Bressoud, T.: The Performance Characteristics of MapReduceApplications on Scalable Clusters. MCURCSM (2011)
Dong, F., Akl, S.G.: Scheduling algorithms for grid computing: state of art and open problems. Queens University Technicalreport (2006)
Shivakumar, N., Rashmi, S., Basu, A.: Hadoop map reduce multi-job workloads using resource aware scheduler. IJARCS (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Rashmi, S., Basu, A. (2018). An Analysis of Resource-Aware Adaptive Scheduling for HPC Clusters with Hadoop. In: Aggarwal, V., Bhatnagar, V., Mishra, D. (eds) Big Data Analytics. Advances in Intelligent Systems and Computing, vol 654. Springer, Singapore. https://doi.org/10.1007/978-981-10-6620-7_22
Download citation
DOI: https://doi.org/10.1007/978-981-10-6620-7_22
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-6619-1
Online ISBN: 978-981-10-6620-7
eBook Packages: EngineeringEngineering (R0)