Skip to main content

Apache Hive

  • Chapter
  • First Online:
Network Data Analytics

Part of the book series: Computer Communications and Networks ((CCN))

  • 2448 Accesses

Abstract

Hadoop and MapReduce programming were discussed in the previous chapter for data analytics with examples. The main drawback of the MapReduce programming is the developer needs to have a basic understanding of Hadoop and MapReduce programming. To overcome this disadvantage, Hive one of the Hadoop ecosystem tools can be used. Apache Hive is a data warehouse solution under the Hadoop ecosystem. It provides an SQL kind of language called as HiveQL for processing the data. In this chapter, Hive and its architectural components are discussed first. Later, the chapter is followed with different kinds of operations that can be executed in Hive and examples on it. The chapter concludes with the network log and call log case studies with Hive.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 129.00
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. https://hive.apache.org/.

  2. Du, D. (2015). Apache Hive essentials. Packt Publishing Ltd.

    Google Scholar 

  3. Capriolo, E., Wampler, D., & Rutherglen, J. (2012). Programming Hive: Data warehouse and query language for Hadoop. O’Reilly Media, Inc.

    Google Scholar 

  4. Chen, Y., Qin, X., Bian, H., Chen, J., Dong, Z., Du, X., et al. (2014, March). A study of SQL-on-hadoop systems. In Workshop on big data benchmarks, performance optimization, and emerging hardware (pp. 154–166). Cham: Springer.

    Google Scholar 

  5. Edward, C., Dean, W., & Jason, R. (2012). Programming hive.

    Google Scholar 

  6. Lin, X., Wang, P., & Wu, B. (2013, November). Log analysis in cloud computing environment with Hadoop and Spark. In 5th IEEE International Conference on Broadband Network & Multimedia Technology (IC-BNMT) (pp. 273–276).

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to K. G. Srinivasa .

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Srinivasa, K.G., G. M., S., H., S. (2018). Apache Hive. In: Network Data Analytics. Computer Communications and Networks. Springer, Cham. https://doi.org/10.1007/978-3-319-77800-6_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-77800-6_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-77799-3

  • Online ISBN: 978-3-319-77800-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics