Data Warehousing Using Hadoop

Wadkar, Sameer; Siddalingaiah, Madhu

doi:10.1007/978-1-4302-4864-4_10

Sameer Wadkar¹ &
Madhu Siddalingaiah¹

3542 Accesses

Abstract

The Hadoop platform supports several data warehousing solutions, including Apache Hive, Impala, and Shark. These solutions are conceptually similar to relational databases at much larger scale but differ in their implementation and usage model. Relational databases are often used in transactional systems in which single row inserts, updates, and deletes must be executed atomically. Efficient indexing and referential integrity with primary/foreign keys allow modern relational databases to find records quickly and guarantee that all data satisfies a strict schema. Relational databases try to avoid full table scans whenever possible because I/O bandwidth is limited and tends to be the bottleneck in these systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 44.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

MD, US
Sameer Wadkar & Madhu Siddalingaiah

Authors

Sameer Wadkar
View author publications
You can also search for this author in PubMed Google Scholar
Madhu Siddalingaiah
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Wadkar, S., Siddalingaiah, M. (2014). Data Warehousing Using Hadoop. In: Pro Apache Hadoop. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4302-4864-4_10

Download citation

DOI: https://doi.org/10.1007/978-1-4302-4864-4_10
Published: 08 September 2014
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-4302-4863-7
Online ISBN: 978-1-4302-4864-4
eBook Packages: Professional and Applied ComputingApress Access BooksProfessional and Applied Computing (R0)

Publish with us

Policies and ethics