Abstract
HBase is a top Apache open-source project that separated from Hadoop. As it has most of the features of Google’s BigTable system and is implemented in Java, it is very popular in days of massive data. HBase’s advantages are reflected in the massive data read and query. Loading huge amounts of data into HBase is the first step to use HBase. HBase itself has several methods to load data, and different methods have different application scenarios. This article made an exhaustive study and a performance testing of them. Also, this article achieved the custom loading data, and experiments show that it has good efficiency.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Dean, J., & Ghemawat, S. (2008). MapReduce: Simplified data processing on large clusters. Communications of the ACM, 51(1), 107–113.
Chang, F., Dean, J., Ghemawat, S., Hsieh, W. C., Wallach, D. A., Burrows, M., et al. (2008). Bigtable: A distributed storage system for structured data. ACM Transactions on Computer Systems (TOCS), 26(2), 4.
White, T. Hadoop: The definitive guide [M]. O’Reilly Media, Inc.,1005 Gravenstein Highway North, Sebastopol, CA95472, 2012.
George, L. HBase: The definitive guide [M]. O’Reilly Media, Inc.,1005 Gravenstein Highway North, Sebastopol, CA95472, 2011.
Huang, J., Ouyang, X., Jose, J., Wasi-ur-Rahman Md., Wang, H., Luo, M., et al. (2012). High-performance design of HBase with RDMA over infiniBand. In Proceedings of the 2012 I.E. 26th International Parallel and Distributed Processing Symposium, IPDPS 2012 (pp. 774–778). Washington, DC: IEEE Computer Society.
Li, C. (2010). Transforming relational database into HBase: A case study. In Proceedings 2010 I.E. International Conference on Software Engineering and Service Sciences, ICSESS 2010 (pp. 683–687). Piscataway, NJ: IEEE Computer Society.
Vora, M. N. (2011). Hadoop-HBase for large-scale data. In Proceedings of 2011 International Conference on Computer Science and Network Technology, ICCSNT 2011 (pp. 601–605). Piscataway, NJ: IEEE Computer Society.
Carstoiu, D., Cernian, A., & Olteanu, A. (2010). Hadoop hbase-0.20. 2 performance evaluation. In NISS2010 – 4th International Conference on New Trends in Information Science and Service Science (pp. 84–87). Piscataway, NJ: IEEE Computer Society.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Yang, J., Feng, X. (2014). Loading Data into HBase. In: Wong, W.E., Zhu, T. (eds) Computer Engineering and Networking. Lecture Notes in Electrical Engineering, vol 277. Springer, Cham. https://doi.org/10.1007/978-3-319-01766-2_31
Download citation
DOI: https://doi.org/10.1007/978-3-319-01766-2_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-01765-5
Online ISBN: 978-3-319-01766-2
eBook Packages: EngineeringEngineering (R0)