The Emergence of Modified Hadoop Online-Based MapReduce Technology in Cloud Environments

Allayear, Shaikh Muhammad; Salahuddin, Md.; Hossain, Delwar; Park, Sung Soon

doi:10.1007/978-3-319-20233-4_8

Shaikh Muhammad Allayear¹⁸,
Md. Salahuddin¹⁸,
Delwar Hossain¹⁸ &
…
Sung Soon Park¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8991))

Included in the following conference series:

Workshop on Big Data Benchmarks

1023 Accesses

Abstract

The exponential growth of data first presented challenges to cutting-edge businesses such as Goggle, Yahoo, Amazon, Microsoft, Facebook, and Twitter. Data volumes to be processed by cloud applications are growing much faster than computing power. This growth demands new strategies for processing and analyzing information. Hadoop MapReduce has become a powerful computation model that addresses those problems. MapReduce is a programming model that enables easy development of scalable parallel applications to process vast amounts of data on large clusters. Through a simple interface with two functions, map and reduce, this model facilitates parallel implementation of many real world tasks such as data processing for search engines and machine learning. Earlier versions of Hadoop MapReduce had several performance problems like connection between map to reduce task, data overload and slow processing. In this paper, we propose a modified MapReduce architecture – MapReduce Agent (MRA) – that resolves those performance problems. MRA can reduce completion time, improve system utilization, and give better performance. MRA employs multi-connection which resolves error recovery with a Q-chained load balancing system. In this paper, we also discuss various applications and implementations of the MapReduce programming model in cloud environments.

This research (Grants NO. 2013-140-10047118) was supported by the 2013 Industrial Technology Innovation Project Funded by Ministry Of Science, ICT and Future Planning.

The source code for HOP can be downloaded from http://code.google.com/p/hop.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 34.99; Price excludes VAT (USA)

Softcover Book: USD 44.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Dean, J., Ghemawat, S.: MapReduce: Simplified dataprocessing on large clusters. In: OSDI (2004)
Google Scholar
SAM-3 Information Technology – SCSI Architecture Model 3, Working Draft, T10 Project 1561-D, Revision7 (2003)
Google Scholar
Allayear, S.M., Park, S.S.: iSCSI multi-connection and error recovery method for remote storage system in mobile appliance. In: Gavrilova, M.L., Gervasi, O., Kumar, V., Tan, C., Taniar, D., Laganá, A., Mun, Y., Choo, H. (eds.) ICCSA 2006. LNCS, vol. 3981, pp. 641–650. Springer, Heidelberg (2006)
Google Scholar
Hadoop. http://hadoop.apache.org/mapreduce/
Condie, T., Conway, N., Alvaro, P., Hellerstein, J.M.: UC Berkeley: MapReduce Online. Khaled Elmeleegy, Russell Sears (Yahoo! Research)
Google Scholar
Allayear, S.M., Park, S.S.: iSCSI protocol adaptation with NAS system via wireless environment. In: International Conference on Consumer Electronics (ICCE), Las Vegus, USA (2008)
Google Scholar
RFC 3270. http://www.ietf.org/rfc/rfc3720.txt
Daneshyar, S., Razmjoo, M.: Large-Scale Data Processing Using Mapreduce in Cloud Computing Environment
Google Scholar
Changqing Ji∗†, Yu Li‡, Wenming Qiu‡, Uchechukwu Awada‡, Keqiu Li‡ : Big Data Processing in Cloud Computing Environments
Google Scholar
Rabi Prasad Padhy: Big Data Processing with Hadoop-MapReduce in Cloud Systems
Google Scholar
Chan, J.O.: An Architecture for Big Data Analytics
Google Scholar
Hellerstein, J.M., Haas, P.J., Wang, H.J.: Online aggregation. In: SIGMOD (1997)
Google Scholar
Caceres, R., Iftode, L.: Improving the performance of reliable transport protocols inMobile computing environments. IEEE JSAC
Google Scholar
Laurila, J.K., Blom, J., Dousse, O., Gatica-Perez, D.: The Mobile Data Challenge: Big Data for Mobile Computing Research
Google Scholar
Satyanarayanan, M.: Mobile computing: the next decade. In: Proceedings of the 1st ACM Workshop on Mobile Cloud Computing & Services: Social Networks and Beyond (MCS) June 2010
Google Scholar
Verma, A., Zea, N., Cho, B., Gupta, I., Campbell, R.H.: Breaking the MapReduce Stage Barrier*
Google Scholar
Stokely, M.: Histogram tools for distributions of large data sets
Google Scholar
Lu, L., Shi, X., Jin, H., Wang, Q., Yuan, D., Wu, S.: Morpho: A decoupled MapReduce framework for elastic cloud computing
Google Scholar
Hao, C., Ying, Q.: Research of Cloud Computing based on the Hadoop platform. Chengdu, China, pp. 181–184, 21-23 October 2011
Google Scholar
Armbrust, M., Fox, A., Griffith, R., Joseph, A.D., Katz, R.H., Konwinski, A., Lee, G., Patterson, D.A., Rabkin, A., Stoica, I., Zaharia, M.: Above the Clouds: a Berkeley View of Cloud Computing, Tech. Rep., University of California at Berkeley (2009)
Google Scholar
Palanisamy, B., Singh, A., Liu, L., Jain, B.,: Purlieus: locality-aware resource allocation for MapReduce in a cloud. In: Proceedings of the ACM/IEEE Conference on High Performance Computing Networking, Storage and Analysis, SC 2011, Seattle, WA, USA (2011)
Google Scholar
Lu, L., Jin, H., Shi, X., Fedak, G.: Assessing MapReduce for internet computing: a comparison of Hadoop and BitDew-MapReduce. In: Proceedings of the 13th ACM/IEEE International
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, East West University, Dhaka, Bangladesh
Shaikh Muhammad Allayear, Md. Salahuddin & Delwar Hossain
Anyang University, Anyang, South Korea
Sung Soon Park

Authors

Shaikh Muhammad Allayear
View author publications
You can also search for this author in PubMed Google Scholar
Md. Salahuddin
View author publications
You can also search for this author in PubMed Google Scholar
Delwar Hossain
View author publications
You can also search for this author in PubMed Google Scholar
Sung Soon Park
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shaikh Muhammad Allayear .

Editor information

Editors and Affiliations

University of Toronto, Toronto, Ontario, Canada
Tilmann Rabl
SAP SE, Köln, Germany
Kai Sachs
Server Technologies, Oracle Corporation, Redwood Shores, California, USA
Meikel Poess
University of California at San Diego, La Jolla, CA, USA
Chaitanya Baru
Middleware Systems Research Group, Toronto, Ontario, Canada
Hans-Arno Jacobson

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Allayear, S.M., Salahuddin, M., Hossain, D., Park, S.S. (2015). The Emergence of Modified Hadoop Online-Based MapReduce Technology in Cloud Environments. In: Rabl, T., Sachs, K., Poess, M., Baru, C., Jacobson, HA. (eds) Big Data Benchmarking. WBDB 2014. Lecture Notes in Computer Science(), vol 8991. Springer, Cham. https://doi.org/10.1007/978-3-319-20233-4_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-20233-4_8
Published: 14 June 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-20232-7
Online ISBN: 978-3-319-20233-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics