Abstract
Chapter 2 showed us how to use PolyBase to integrate SQL Server with Azure Blob Storage. In this chapter, we will integrate to the original external data source: Hadoop. In the first part of this chapter, we will take a peek at an already-built Hadoop cluster. Then, we will configure PolyBase to work with the two key variants of Hadoop (pending the release of an on-premises version of Cloudera Data Platform). Next, we will review what changes in terms of PolyBase functionality between Azure Blob Storage and Hadoop. After this review, we will create the infrastructure needed to query a Hadoop external data source. Finally, we will query this external data source and confirm that data retrieval and data insertion both work as they do in Azure Blob Storage.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2020 Kevin Feasel
About this chapter
Cite this chapter
Feasel, K. (2020). Connecting to Hadoop. In: PolyBase Revealed. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-5461-5_3
Download citation
DOI: https://doi.org/10.1007/978-1-4842-5461-5_3
Published:
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-4842-5460-8
Online ISBN: 978-1-4842-5461-5
eBook Packages: Professional and Applied ComputingApress Access BooksProfessional and Applied Computing (R0)