Apache Hive

Srinivasa, K. G.; G. M., Siddesh; H., Srinidhi

doi:10.1007/978-3-319-77800-6_3

K. G. Srinivasa⁶,
Siddesh G. M.⁷ &
Srinidhi H.⁸

Part of the book series: Computer Communications and Networks ((CCN))

2448 Accesses

Abstract

Hadoop and MapReduce programming were discussed in the previous chapter for data analytics with examples. The main drawback of the MapReduce programming is the developer needs to have a basic understanding of Hadoop and MapReduce programming. To overcome this disadvantage, Hive one of the Hadoop ecosystem tools can be used. Apache Hive is a data warehouse solution under the Hadoop ecosystem. It provides an SQL kind of language called as HiveQL for processing the data. In this chapter, Hive and its architectural components are discussed first. Later, the chapter is followed with different kinds of operations that can be executed in Hive and examples on it. The chapter concludes with the network log and call log case studies with Hive.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.00; Price excludes VAT (USA)

Hardcover Book: USD 129.00; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

https://hive.apache.org/.
Du, D. (2015). Apache Hive essentials. Packt Publishing Ltd.
Google Scholar
Capriolo, E., Wampler, D., & Rutherglen, J. (2012). Programming Hive: Data warehouse and query language for Hadoop. O’Reilly Media, Inc.
Google Scholar
Chen, Y., Qin, X., Bian, H., Chen, J., Dong, Z., Du, X., et al. (2014, March). A study of SQL-on-hadoop systems. In Workshop on big data benchmarks, performance optimization, and emerging hardware (pp. 154–166). Cham: Springer.
Google Scholar
Edward, C., Dean, W., & Jason, R. (2012). Programming hive.
Google Scholar
Lin, X., Wang, P., & Wu, B. (2013, November). Log analysis in cloud computing environment with Hadoop and Spark. In 5th IEEE International Conference on Broadband Network & Multimedia Technology (IC-BNMT) (pp. 273–276).
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Technology, Ch. Brahm Prakash Government Engineering College, Jaffarpur, Delhi, India
K. G. Srinivasa
Department of Computer Science and Engineering, Ramaiah Institute of Technology, Bangalore, Karnataka, India
Siddesh G. M.
Department of Computer Science and Engineering, Ramaiah Institute of Technology, Bangalore, Karnataka, India
Srinidhi H.

Authors

K. G. Srinivasa
View author publications
You can also search for this author in PubMed Google Scholar
Siddesh G. M.
View author publications
You can also search for this author in PubMed Google Scholar
Srinidhi H.
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to K. G. Srinivasa .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Srinivasa, K.G., G. M., S., H., S. (2018). Apache Hive. In: Network Data Analytics. Computer Communications and Networks. Springer, Cham. https://doi.org/10.1007/978-3-319-77800-6_3

Download citation

DOI: https://doi.org/10.1007/978-3-319-77800-6_3
Published: 27 April 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-77799-3
Online ISBN: 978-3-319-77800-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics