Big Data, Hadoop, and HDInsight
Azure HDInsight is a managed Hadoop distribution, developed in partnership with Hortonworks and Microsoft. It uses the Hortonworks Data Platform (HDP) Hadoop distribution, which means that HDInsight is entirely Apache Hadoop on Azure. It deploys and provisions managed Apache Hadoop clusters in the cloud on Windows or Linux machines, which is a unique capability. It provides the Hadoop Distributed File System (HDFS) for reliable data storage. It uses the MapReduce programming model to process, analyze, and report on data stored in distributed file systems. Because it is a managed offering, within a few hours an enterprise can be up and running with a fully configured Hadoop cluster and other Hadoop ecosystem components, such HBase, Apache Spark, and Apache Storm.