Abstract
Apache Hive is a data warehouse framework for querying and managing large datasets stored in Hadoop distributed filesystems (HDFS). Hive also provides a SQL-like query language called HiveQL. The HiveQL queries may be run in the Hive CLI shell. By default, Hive stores data in the HDFS, but also supports the Amazon S3 filesystem.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2016 Deepak Vohra
About this chapter
Cite this chapter
Vohra, D. (2016). Apache Hive. In: Practical Hadoop Ecosystem. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-2199-0_3
Download citation
DOI: https://doi.org/10.1007/978-1-4842-2199-0_3
Published:
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-4842-2198-3
Online ISBN: 978-1-4842-2199-0
eBook Packages: Professional and Applied ComputingApress Access BooksProfessional and Applied Computing (R0)