Skip to main content

Data Processing Using Pig

  • Chapter
  • First Online:
  • 3482 Accesses

Abstract

So far in this book, we have explored how to develop MapReduce programs using Java. Chapter 10 introduced Hive, the SQL engine on top of the HDFS. You learned how the Hive compiler converts high–level SQL commands into MapReduce programs, which avoids having to write low–level Java programs; you can instead focus on high–level business requirements. Hive is suitable for BI developers who want to treat the HDFS as a data warehouse.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   34.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   44.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Sameer Wadkar and Madhu Siddalingaiah

About this chapter

Cite this chapter

Wadkar, S., Siddalingaiah, M. (2014). Data Processing Using Pig. In: Pro Apache Hadoop. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4302-4864-4_11

Download citation

Publish with us

Policies and ethics