Abstract
Cloud Database Management System (CDBMS) is one of the potential services provided by various Cloud Service Providers. Cloud providers cope with different users, different data and processing or analysis of different data. Traditional Database Management Systems are insufficient to handle such variety of data, users and their requirements. Hence, at the conceptual layer of CDBMS, traditional SQL, Oracle and many more Database Languages are insufficient to provide proper services to their users. HIVE and Pig are the different types of languages which are suitable for the cloud environment which can handle such huge amount of data. In this paper, performance comparison of 3-Node cluster and Cloud Based Cluster provided by the Amazon Web Services is being done. We have compared the processing of structured data with the help of different queries provided by HIVE tool on 3-Node cluster and Amazon Web Service (AWS) cluster. It has been concluded that HIVE queries on AWS cluster gives better results as compared to 3-Node cluster.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
M. Alam and K. Shakil.: Cloud Database Management System Architecture. In: UACEE International Journal of Computer Science and its Applications, Volume 3(1), 2013, pages 27–31.
AWS documentation; Auto scaling, http://aws.amazon.com/autoscaling.
J. Dean and S. Ghemawat.: Mapreduce: simplified data processing on large clusters. In OSDI’04. In: Proceedings of the 6th Symposium on Opearting Systems Design & Implementation (OSDI’04), 2004, pages 1–10.
L. Zhang, C. Wu, L. Zongpeng, C. Guo, C. Minghua and C.M. Lau. In: Moving Big Data to the Cloud: An Online Cost-Minimizing Approach. In: IEEE journal on selected areas in communications (2013), Vol 31, Issue 12, pages 2710–2721.
L. Huang, H. Shan, Chen and H. Ting-Ting.: Research on Hadoop Cloud Computing Model and its Applications. In: IEEE Third International Conference on Networking and Distributed Computing (ICNDC), 21–24 Oct. 2012, pages 59–63.
Apache: Apache Hadoop: http://hadoop.apache.org/docs/r2.7.1.
Amazon Elastic MapReduce, Developer Guide (API Version 2009-03-31), http://docs.aws.amazon.com/ElasticMapReduce/latest/DeveloperGuide/emr-how-does-emr-work.html.
AmazonEC2 Service Level Agreement, http://aws.amazon.com/ec2-sla/, Retrieved July 2012.
Amazon Virtual Private Cloud, Getting Started Guide, API Version 2013-10-15, http://awsdocs.s3.amazonaws.com/VPC/latest/vpc-gsg.pdf.
Amazon EC2 Instance, http://aws.amazon.com/ec2/, Retrieved July 2012.
S. Mongia, M.N. Doja, B. Alam, and M. Alam.: 5 layered Architecture of Cloud Database Management System. In: AASRI Conference on parallel and Distributed Computing and Systems, Vol 5, Pages 194–199, 2013.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer India
About this paper
Cite this paper
Malhotra, S., Doja, M.N., Alam, B., Alam, M., Anand, A. (2016). Executing HIVE Queries on 3-Node Cluster and AWS Cluster—Comparative Analysis. In: Satapathy, S., Mandal, J., Udgata, S., Bhateja, V. (eds) Information Systems Design and Intelligent Applications. Advances in Intelligent Systems and Computing, vol 433. Springer, New Delhi. https://doi.org/10.1007/978-81-322-2755-7_32
Download citation
DOI: https://doi.org/10.1007/978-81-322-2755-7_32
Published:
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-2753-3
Online ISBN: 978-81-322-2755-7
eBook Packages: EngineeringEngineering (R0)