Skip to main content

Connecting to Hadoop

  • Chapter
  • First Online:
PolyBase Revealed
  • 719 Accesses

Abstract

Chapter 2 showed us how to use PolyBase to integrate SQL Server with Azure Blob Storage. In this chapter, we will integrate to the original external data source: Hadoop. In the first part of this chapter, we will take a peek at an already-built Hadoop cluster. Then, we will configure PolyBase to work with the two key variants of Hadoop (pending the release of an on-premises version of Cloudera Data Platform). Next, we will review what changes in terms of PolyBase functionality between Azure Blob Storage and Hadoop. After this review, we will create the infrastructure needed to query a Hadoop external data source. Finally, we will query this external data source and confirm that data retrieval and data insertion both work as they do in Azure Blob Storage.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 24.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 32.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Kevin Feasel

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Feasel, K. (2020). Connecting to Hadoop. In: PolyBase Revealed. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-5461-5_3

Download citation

Publish with us

Policies and ethics