Working with HBase
Previous chapters explored how to leverage an HDInsight cluster to store and process big data. You learned how MapReduce jobs process data. Also, you looked at Hive and Pig, and learned how they make it easy to work with data. All the technologies and tools that you saw so far work in batch mode. And they are accepted in online analytical processing (OLAP) scenarios where it is supposed to take time. But you cannot always use batch processing. What if you want a low-latency database that provides near real-time read/write access, and quick random access to your big data in Hadoop? This is where Apache HBase comes into the picture.