Abstract
The recent rapid increase in the amount of data to be processed has led to the increased use of dispersed parallel processing of large-scale data analysis using open-source Hadoop’s MapReduce framework. The large-data processing method proposed by Google and Hadoop which implemented this are representative dispersed parallel processing methods, and the data are dispersedly saved on the HDFS(Hadoop Distributed File System). Such HDFS uses its own indexing technique when it comes to searching specific values from the saved files. Techniques that use conventional index, however, leads to problems like reduced search performance by not considering update and saving index in the disc. Therefore, the paper proposes effective DB indexing technique on Hadoop-based database.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Ryu, H.-S., Choi, H.-S., Son, J.-H., Chung, Y.-D.: An Implementation of a BST index on a relational data warehouse system based on hadoop cloud. In: Proceeding of Korea Information Science Society, pp. 10–12 (2012)
Lee, H.-J., Kim, T.-H.: A MapReduce-based kNN join query processing algorithm for analyzing large-scale data. J. Korea Inf. Sci. Soc. 42, 504–511 (2015)
Kim, D.-M., Choi, J.-W., Woo, C.-W.: A design and development of big data indexing and search system using lucene. J. Internet Comput. Serv. 15, 107–115 (2014)
Park, J.-H., Bok, K.-S., Yoo, J.-S.: Big data parallel processing technology trend. Commun. Korean Inst. Inf. Sci. Eng. 32, 18–26 (2014)
Park, H.-J., Gwon, Y.-H., An, Y.-M.: Big data and big data refinement technology. Korean Soc. Comput. Inf. Rev. 21, 1–8 (2013)
Oh, H.-J., Yun, B.-H., Choi, N.-H., Yoo, C.-J., Kim, Y.: Visualization for preferred locations and moving patterns according to user groups based on contents analysis in social big data. J. Korean Inst. Inf. Technol. 12, 195–203 (2014)
Kim, C.-S.: Hadoop based spatial bigdata index creation and processing. In: Proceeding of Korea Information Science Society, pp. 87–89 (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Shim, JS., Jang, YH., Ju, YW., Park, SC. (2018). Design of Effective Indexing Technique in Hadoop-Based Database. In: Park, J., Loia, V., Yi, G., Sung, Y. (eds) Advances in Computer Science and Ubiquitous Computing. CUTE CSA 2017 2017. Lecture Notes in Electrical Engineering, vol 474. Springer, Singapore. https://doi.org/10.1007/978-981-10-7605-3_15
Download citation
DOI: https://doi.org/10.1007/978-981-10-7605-3_15
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-7604-6
Online ISBN: 978-981-10-7605-3
eBook Packages: EngineeringEngineering (R0)