© 2017

Processing Big Data with Azure HDInsight

Building Real-World Big Data Systems on Azure HDInsight Using the Hadoop Ecosystem


Table of contents

  1. Front Matter
    Pages i-xix
  2. Vinit Yadav
    Pages 1-11
  3. Vinit Yadav
    Pages 13-43
  4. Vinit Yadav
    Pages 45-70
  5. Vinit Yadav
    Pages 71-110
  6. Vinit Yadav
    Pages 111-122
  7. Vinit Yadav
    Pages 123-142
  8. Vinit Yadav
    Pages 143-172
  9. Vinit Yadav
    Pages 173-202
  10. Back Matter
    Pages 203-207

About this book


Get a jump start on using Azure HDInsight and Hadoop Ecosystem components. As most Hadoop and Big Data projects are written in either Java, Scala, or Python, this book minimizes the effort to learn another language and is written from the perspective of a .NET developer. Hadoop components are covered, including Hive, Pig, HBase, Storm, and Spark on Azure HDInsight, and code samples are written in .NET only.

Processing Big Data with Azure HDInsight covers the fundamentals of big data, how businesses are using it to their advantage, and how Azure HDInsight fits into the big data world. This book introduces Hadoop and big data concepts and then dives into creating different solutions with HDInsight and the Hadoop Ecosystem. It covers concepts with real-world scenarios and code examples, making sure you get hands-on experience. The best way to utilize this book is to practice while reading. After reading this book you will be familiar with Azure HDInsight and how it can be utilized to build big data solutions, including batch processing, stream analytics, interactive processing, and storing and retrieving data in an efficient manner.

What You Will Learn: 
  • Understand the fundamentals of HDInsight and Hadoop
  • Work with HDInsight cluster
  • Query with Apache Hive and Apache Pig
  • Store and retrieve data with Apache HBase
  • Stream data processing using Apache Storm
  • Work with Apache Spark


Microsoft Windows Azure HDInsight Big Data Hadoop Hive Pig HBase Storm Spark

Authors and affiliations

  1. 1.AhmedabadIndia

About the authors

Vinit Yadav is Founder and CEO of Veloxcore. He started working with Azure when it first came out in 2010, and since then he has been continuously involved in designing solutions around the Microsoft Azure platform. At Veloxcore, he continues to build and deliver highly scalable big data solutions. He is also a machine learning and data science enthusiastic, passionate programmer, and has over 12 years of experience in designing and developing enterprise applications using various .NET technologies.

Vinit founded Veloxcore to help organizations leverage big data and machine learning. He and his team at Veloxcore are actively engaged in developing software solutions for their global customers using agile methodologies. On a side note, he likes to travel, read, and watch sci-fi content and loves to draw, paint, and create something new.

Bibliographic information

Industry Sectors
IT & Software
Consumer Packaged Goods
Finance, Business & Banking