Relational, NoSQL, and Graph Databases

Koitzsch, Kerry

doi:10.1007/978-1-4842-1910-2_4

Kerry Koitzsch²

4734 Accesses
1 Citations

Abstract

In this chapter, we describe the role of databases in distributed big data analysis. Database types include relational databases, document databases, graph databases, and others, which may be used as data sources or sinks in our analytical pipelines. Most of these database types integrate well with Hadoop ecosystem components, as well as with Apache Spark. Connectivity between different kinds of database and Hadoop/Apache Spark-distributed processing may be provided by “glueware” such as Spring Data or Apache Camel. We describe relational databases, such as MySQL, NoSQL databases such as Cassandra, and graph databases such as Neo4j, and how to integrate them with the Hadoop ecosystem.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 16.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

Sunnyvale, California, USA
Kerry Koitzsch

Authors

Kerry Koitzsch
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Koitzsch, K. (2017). Relational, NoSQL, and Graph Databases. In: Pro Hadoop Data Analytics . Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-1910-2_4

Download citation

DOI: https://doi.org/10.1007/978-1-4842-1910-2_4
Published: 30 December 2016
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-4842-1909-6
Online ISBN: 978-1-4842-1910-2
eBook Packages: Professional and Applied ComputingApress Access BooksProfessional and Applied Computing (R0)

Publish with us

Policies and ethics