Data Migration and Analytics

  • Vivek Mishra
In the previous chapter, we discussed the benefits of and requirements for running batch analytics over Cassandra via Hadoop MapReduce. We can easily utilize Hadoop MapReduce's pluggable architecture to implement MapReduce even with custom implementations.  Let’s talk about a few of the published use cases as follows:
  • NetApp collects and analyzes system diagnostic-related data for improving the quality of client site deployed systems.

  • The leading health insurance provider collects and processes millions of claims per day. Data ingestion is approximately 1 TB per day.

  • Nokia collects and analyzes data related to various mobile phones. The expected data volume to deal with is about 600 TB with both structured and unstructured forms of data.

  • Etsy, an online marketplace for handmade items, needs to analyze billions of logs for behavior targeting and building search-based recommendations.

Storage of large amounts of data is a significant issue, and with Cassandra we can achieve faster...


File System Data Migration NoSQL Database MapReduce Program Hadoop MapReduce 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Copyright information

© Vivek Mishra 2014

Authors and Affiliations

  • Vivek Mishra
    • 1
  1. 1.KalyanpurIndia

Personalised recommendations