Skip to main content

Loading Data into HBase

  • Conference paper
  • First Online:
Computer Engineering and Networking

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 277))

Abstract

HBase is a top Apache open-source project that separated from Hadoop. As it has most of the features of Google’s BigTable system and is implemented in Java, it is very popular in days of massive data. HBase’s advantages are reflected in the massive data read and query. Loading huge amounts of data into HBase is the first step to use HBase. HBase itself has several methods to load data, and different methods have different application scenarios. This article made an exhaustive study and a performance testing of them. Also, this article achieved the custom loading data, and experiments show that it has good efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Dean, J., & Ghemawat, S. (2008). MapReduce: Simplified data processing on large clusters. Communications of the ACM, 51(1), 107–113.

    Article  Google Scholar 

  2. Chang, F., Dean, J., Ghemawat, S., Hsieh, W. C., Wallach, D. A., Burrows, M., et al. (2008). Bigtable: A distributed storage system for structured data. ACM Transactions on Computer Systems (TOCS), 26(2), 4.

    Article  Google Scholar 

  3. White, T. Hadoop: The definitive guide [M]. O’Reilly Media, Inc.,1005 Gravenstein Highway North, Sebastopol, CA95472, 2012.

    Google Scholar 

  4. George, L. HBase: The definitive guide [M]. O’Reilly Media, Inc.,1005 Gravenstein Highway North, Sebastopol, CA95472, 2011.

    Google Scholar 

  5. Huang, J., Ouyang, X., Jose, J., Wasi-ur-Rahman Md., Wang, H., Luo, M., et al. (2012). High-performance design of HBase with RDMA over infiniBand. In Proceedings of the 2012 I.E. 26th International Parallel and Distributed Processing Symposium, IPDPS 2012 (pp. 774–778). Washington, DC: IEEE Computer Society.

    Google Scholar 

  6. Li, C. (2010). Transforming relational database into HBase: A case study. In Proceedings 2010 I.E. International Conference on Software Engineering and Service Sciences, ICSESS 2010 (pp. 683–687). Piscataway, NJ: IEEE Computer Society.

    Google Scholar 

  7. Vora, M. N. (2011). Hadoop-HBase for large-scale data. In Proceedings of 2011 International Conference on Computer Science and Network Technology, ICCSNT 2011 (pp. 601–605). Piscataway, NJ: IEEE Computer Society.

    Google Scholar 

  8. Carstoiu, D., Cernian, A., & Olteanu, A. (2010). Hadoop hbase-0.20. 2 performance evaluation. In NISS2010 – 4th International Conference on New Trends in Information Science and Service Science (pp. 84–87). Piscataway, NJ: IEEE Computer Society.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Juan Yang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Yang, J., Feng, X. (2014). Loading Data into HBase. In: Wong, W.E., Zhu, T. (eds) Computer Engineering and Networking. Lecture Notes in Electrical Engineering, vol 277. Springer, Cham. https://doi.org/10.1007/978-3-319-01766-2_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-01766-2_31

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-01765-5

  • Online ISBN: 978-3-319-01766-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics