Advertisement

Data Movement

  • Sudhir Rawat
  • Abhishek Narain
Chapter

Abstract

Any extract-transform-load (ETL) or extract-load-transform (ELT) project starts with data ingestion (Figure 3-1). You should be able to connect to various sources, either in a public network or behind firewalls in a private network, and then be able to pull them onto a staging location or a destination on the cloud. In the ELT pattern for Big Data processing, you would generally dump all your data in a staging blob or data lake on the cloud, and based on the need, you would run analytical jobs/transform activities to get further insights or even do some basic data cleansing.

Copyright information

© Sudhir Rawat and Abhishek Narain 2019

Authors and Affiliations

  • Sudhir Rawat
    • 1
  • Abhishek Narain
    • 2
  1. 1.BangaloreIndia
  2. 2.ShanghaiChina

Personalised recommendations