Abstract
Big data becomes a hot topic. MapReduce is a popular programming paradigm for big data analysis with many benefits. Even though it has widely applications in industry, MapReduce still has limitations in some applications. For these limitations, some extensions have been proposed. In these brief communications, we discuss the benefits and limitations of MapReduce programming paradigm and also its extensions to make MapReduce go beyond the limitations.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Rajaraman, A., Ullman, J.D.: Mining of Massive Datasets. Posts & Telecom Press, Beijing (2012)
Phan, T.-C., d’Orazio, L., Rigaux, P.: Toward intersection filter-based optimization for joins in MapReduce. In: Cloud-I, p. 2 (2013)
Tao, Y., Lin, W., Xiao, X.: Minimal MapReduce algorithms. In: SIGMOD Conference, pp. 529–540 (2013)
Zhang, Y., Chen, S.: i2MapReduce: incremental iterative MapReduce. In: Cloud-I, p. 3 (2013)
Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. Comput. Netw. ISDN Syst. 30, 107–117 (1998)
Avriel, M.: Nonlinear Programming: Analysis and Methods. Courier Dover Publications, Mineola (2003)
Baluja, S., Seth, R., Sivakumar, D., Jing, Y., Yagnik, J., Kumar, S., Ravichandran, D., Aly, M.: Video suggestion and discovery for youtube: taking random walks through the view graph. In: Proceedings of the WWW 2008, pp. 895–904 (2008)
Liben-Nowell, D., Kleinberg, J.M.: The link-prediction problem for social networks. JASIST 58(7), 1019–1031 (2007)
Dean, J., Ghemawat, S.: Mapreduce: simplified data processing on large clusters. In: Proceedings of the OSDI 2004 (2004)
Bloom, B.H.: Space/time trade-offs in hash coding with allowable errors. Commun. ACM 13(7), 422–426 (1970)
Borkar, V.R., Carey, M.J., Grover, R., Onose, N., Vernica, R.: Hyracks: a flexible and extensible foundation for data-intensive computing. In: ICDE, pp. 1151–1162 (2011)
Jiang, D., Chen, G., Ooi, B.C., Tan, K.-L., Wu, S.: epiC: an extensible and scalable system for processing big data. PVLDB 7(7), 541–552 (2014)
Acknowledgments
This paper was partially supported by National Sci-Tech Support Plan 2015BAH10F01 and NSFC grant U1509216, 61472099, 61133002 and the Scientific Research Foundation for the Returned Overseas Chinese Scholars of Heilongjiang Provience LC2016026.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Science+Business Media Singapore
About this paper
Cite this paper
Song, Y., Wang, H., Li, J., Gao, H. (2016). MapReduce for Big Data Analysis: Benefits, Limitations and Extensions. In: Che, W., et al. Social Computing. ICYCSEE 2016. Communications in Computer and Information Science, vol 623. Springer, Singapore. https://doi.org/10.1007/978-981-10-2053-7_40
Download citation
DOI: https://doi.org/10.1007/978-981-10-2053-7_40
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-2052-0
Online ISBN: 978-981-10-2053-7
eBook Packages: Computer ScienceComputer Science (R0)