Skip to main content

The Hadoop Ecosystem

Disrupting from Below

  • Chapter
  • First Online:

Abstract

In this chapter, we cover basic principles of Hadoop and its ecosystem; the economics of Hadoop; an introduction to NoSQL datastores; and a review of analytics in Hadoop.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   29.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   37.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    http://sortbenchmark.org/YahooHadoop.pdf

  2. 2.

    http://blog.cloudera.com/blog/2009/05/5-common-questions-about-hadoop/

  3. 3.

    http://wikibon.com/hadoop-nosql-software-and-services-market-forecast-2013-2017/

  4. 4.

    http://www.enterprisetech.com/2014/10/29/hadoop-finds-place-enterprise/

  5. 5.

    http://www.enterprisetech.com/2013/11/08/cluster-sizes-reveal-hadoop-maturity-curve/

  6. 6.

    http://www.gartner.com/newsroom/id/3051717

  7. 7.

    http://www.networkworld.com/article/3024812/big-data-business-intelligence/the-top-5-hadoop-distributions-according-to-forrester.html

  8. 8.

    http://wiki.apache.org/hadoop/Defining%20Hadoop

  9. 9.

    Apache Hadoop distributed 6 releases in 2011; 13 in 2012; 15 in 2013; 8 in 2014; and 5 in 2015.

  10. 10.

    http://hortonworks.com/blog/stinger-next-enterprise-sql-hadoop-scale-apache-hive/

  11. 11.

    http://www.statslice.com/hadoop-business-case-a-cost-effective-queryable-data-archivestorage-platform

  12. 12.

    http://blogs.teradata.com/data-points/how-illy-is-cost-per-terabyte/

  13. 13.

    http://db-engines.com/en/ranking

  14. 14.

    http://siliconangle.com/blog/2015/03/11/wikibon-view-open-source-nosql-database-vendors-face-a-long-hard-slog/

  15. 15.

    http://www.jaspersoft.com/press/jaspersoft-announces-new-hadoop-based-big-data-analytics-solution

  16. 16.

    http://www.idevnews.com/stories/4429/Pentaho-Ships-BI-Analytics-Tools-for-Hadoop-Cloud

  17. 17.

    http://www.infoworld.com/article/2616959/big-data/7-top-tools-for-taming-big-data.html

  18. 18.

    https://github.com/RevolutionAnalytics/RHadoop/wiki

  19. 19.

    https://github.com/RevolutionAnalytics/RHadoop/graphs/contributors

  20. 20.

    https://github.com/saptarshiguha/RHIPE/

  21. 21.

    http://www.quora.com/How-much-time-did-it-take-to-develop-Hive-at-Facebook

  22. 22.

    https://hadoop.apache.org/

  23. 23.

    http://hive.apache.org/people.html

  24. 24.

    http://www.slideshare.net/hortonworks/hive-on-spark-is-blazing-fast-or-is-it-final

  25. 25.

    http://blog.cloudera.com/blog/2016/04/cloudera-enterprise-5-7-is-released/

  26. 26.

    https://www.openhub.net/p/Hive

  27. 27.

    https://developer.yahoo.com/blogs/hadoop/pig-road-efficient-high-level-language-hadoop-413.html

  28. 28.

    https://developer.yahoo.com/blogs/hadoop/pig-incubation-apache-software-foundation-393.html

  29. 29.

    http://www.ibm.com/developerworks/library/j-mahout/

  30. 30.

    https://papers.nips.cc/paper/3150-map-reduce-for-machine-learning-on-multicore.pdf

  31. 31.

    http://www.ibm.com/developerworks/library/j-mahout/

  32. 32.

    http://dl.acm.org/citation.cfm?id=1807184

  33. 33.

    http://www.vldb.org/pvldb/vol8/p1804-ching.pdf

  34. 34.

    https://www.facebook.com/notes/facebook-engineering/scaling-apache-giraph-to-a-trillion-edges/10151617006153920

  35. 35.

    https://www.openhub.net/p/Giraph

  36. 36.

    http://techcrunch.com/2010/04/13/datameer-raises-2-5-million-for-apache-hadoop-based-analytics-platform/

  37. 37.

    http://techcrunch.com/2015/08/18/datameer-bags-40m-round-led-by-singapore-investment-firm/

  38. 38.

    http://www.infoq.com/news/2013/01/Phoenix-HBase-SQL

  39. 39.

    https://developer.salesforce.com/blogs/developer-relations/2014/05/apache-phoenix-small-step-big-data.html

  40. 40.

    http://www.zdnet.com/clouderas-impala-brings-hadoop-to-sql-and-bi-7000006413/

  41. 41.

    http://blog.cloudera.com/blog/2013/05/cloudera-impala-1-0-its-here-its-real-its-already-the-standard-for-sql-on-hadoop/

  42. 42.

    http://www.vldb.org/pvldb/vol7/p1295-floratou.pdf

  43. 43.

    http://blog.cloudera.com/blog/2014/09/new-benchmarks-for-sql-on-hadoop-impala-1-4-widens-the-performance-gap/

  44. 44.

    http://research.google.com/pubs/pub36632.html

  45. 45.

    https://www.openhub.net/p/incubator-drill

  46. 46.

    https://www.facebook.com/notes/facebook-engineering/presto-interacting-with-petabytes-of-data-at-facebook/10151786197628920

  47. 47.

    https://code.facebook.com/posts/370832626374903/even-faster-data-at-the-speed-of-presto-orc/

  48. 48.

    http://www.teradata.com/News-Releases/2015/Teradata-Launches-First-Enterprise-Support-for-Presto/?LangType=1033&LangSelect=true

  49. 49.

    http://money.cnn.com/news/newsfeeds/articles/prnewswire/CL12937.htm

  50. 50.

    https://www.openhub.net/p/facebookpresto

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Thomas W. Dinsmore

About this chapter

Cite this chapter

Dinsmore, T.W. (2016). The Hadoop Ecosystem. In: Disruptive Analytics. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-1311-7_4

Download citation

Publish with us

Policies and ethics