Skip to main content

NoSQL Overview and Performance Testing of HBase Over Multiple Nodes with MySQL

  • Conference paper
  • First Online:

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 813))

Abstract

The escalating amount of web-based applications in the fields of social networks, media, biology, physics, and the Internet of things are continuously generating large volume of data or Bigdata in terabytes, petabytes, and zetabytes over a short period of time. Consequently, an immense amount of read and write requests is generated without much latency. It is an immediate concern to store and analyze such huge amount of mixed ASCII and non-ASCII data efficiently, economically, and in no time. The conventional database systems like MySQL are incapable to handle such large volume of data in real time. At this point, there is a claim that column-based NoSQL databases like Accumulo, Cassandra, HBase, or document-based Apache CouchDB, Couchbase, MongoDB are capable of handling such huge data volume efficiently. In this work, we focussed on column-based Apache HBase, a NoSQL distributed database management system developed in the Bigdata domain on distributed file system architecture provided by Hadoop (HDFS). Let us begin the discussion on NoSQL HBase and the association between HBase and Hadoop. Then some of the important features of HBase are explained. After that, we discussed the advantages and limitations of HBase in distributed data processing over the other NoSQL database management systems. Finally, we performed some experiments to compare the time performance of HBase with traditional database MySQL as data size increases.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Bhupathiraju, V., Ravuri, R.P.: The dawn of big data-Hbase. In: 2014 Conference on IT in Business, Industry and Government (CSIBIG), pp. 1–4. IEEE, March 2014

    Google Scholar 

  2. Huang, S., et al.: Non-structure data storage technology: a discussion. In: 2012 IEEE/ACIS 11th International Conference on Computer and Information Science (ICIS), pp. 482–487. IEEE, May 2012

    Google Scholar 

  3. Vora, M.N.: Hadoop-HBase for large-scale data. In: 2011 International Conference on Computer Science and Network Technology (ICCSNT), vol. 1, pp. 601–605. IEEE, Dec 2011

    Google Scholar 

  4. www.techcrunch.com

  5. www.emc.com

  6. Yang, F., Cao, J., et al.: An evolutionary algorithm for column family schema optimization in HBase. In: 2015 IEEE First International Conference on Big Data Computing Service and Applications (BigDataService). IEEE (2015)

    Google Scholar 

  7. https://changeaas.com/2014/08/20/nosql-and-its-use-in-ceilometer/

  8. www.mail-archives.apache.org

  9. Naheman, W., et al.: Review of NoSQL databases and performance testing on HBase. In: Proceedings 2013 International Conference on Mechatronic Sciences, Electric Engineering and Computer (MEC). IEEE (2013)

    Google Scholar 

  10. Chang, F., Dean, J., et.al.: Bigtable: a distributed storage system for structured data. In: Seventh Symposium on Operating System Design and Implementation (OSDI), Seattle, WA, Usenix Association, Nov 2006

    Google Scholar 

  11. DeCandia, G., et al.: Dynamo: Amazon’s highly available key-value store. In: The Proceedings of the 21st ACM Symposium on Operating Systems Principles, Stevenson, WA, Oct 2007

    Google Scholar 

  12. www.cassandra.apache.org

  13. Judd, D.: Scale out with HyperTable. Linux Mag. (2008)

    Google Scholar 

  14. Anderson, J.C., et al.: CouchDB: The Definitive Guide, 1st edn. O’Reilly Media (2009). ISBN: 0596158165

    Google Scholar 

  15. www.project-voldemort.com

  16. http://memcachedb.org/memcachedb-guide-1.0.pdf

  17. Kristina, C., Michael, D.: MongoDB: The Definitive Guide, 1st edn. O’Reilly Media (2010). ISBN: 9781449381561

    Google Scholar 

  18. Saloustros, G., Magoutis, K.: Rethinking HBase: design and implementation of an elastic key-value store over log-structured local volumes. In: 2015 14th International Symposium on Parallel and Distributed Computing (ISPDC). IEEE (2015)

    Google Scholar 

  19. Chen, X., et al.: Spatio-temporal queries in HBase. In: 2015 IEEE International Conference on Big Data (Big Data). IEEE (2015)

    Google Scholar 

  20. Bao, X., et al.: HConfig: resource adaptive fast bulk loading in HBase. In: 2014 International Conference on Collaborative Computing: Networking, Applications and Work sharing (CollaborateCom). IEEE (2014)

    Google Scholar 

  21. Taylor, R.C.: An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics. BMC Bioinform. 11, Suppl. 12 (2010): S1

    Google Scholar 

  22. Srinivas, S., et al.: Security maturity in NoSQL databases-are they secure enough to haul the modern IT applications. In: 2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI). IEEE (2015)

    Google Scholar 

  23. Xiao, Z., et al.: Remote sensing image database based on NOSQL database. In: 2011 19th International Conference on Geoinformatics. IEEE (2011)

    Google Scholar 

  24. Zdravevski, E., et al.: Row key designs of NoSQL database tables and their impact on write performance. In: 2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP). IEEE (2016)

    Google Scholar 

  25. Naheman, W., et al.: Review of NoSQL databases and performance testing on HBase. In: Proceedings 2013 International Conference on Mechatronic Sciences, Electric Engineering and Computer (MEC), pp. 2304–2309. IEEE (2013)

    Google Scholar 

  26. www.slideshare.net/kmstechnology

  27. www.hadoop.apache.org

  28. www.mapr.com/blog/in-depth-look-hbase-architecture

  29. Wei, D., et al.: Organizing and storing method for large-scale unstructured data set with complex content. In: 2014 Fifth International Conference on Computing for Geospatial Research and Application (COM. Geo). IEEE (2014)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Swagata Paul .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Das, N., Paul, S., Sarkar, B.B., Chakrabarti, S. (2019). NoSQL Overview and Performance Testing of HBase Over Multiple Nodes with MySQL. In: Abraham, A., Dutta, P., Mandal, J., Bhattacharya, A., Dutta, S. (eds) Emerging Technologies in Data Mining and Information Security. Advances in Intelligent Systems and Computing, vol 813. Springer, Singapore. https://doi.org/10.1007/978-981-13-1498-8_24

Download citation

Publish with us

Policies and ethics