General-Purpose Big Data Processing Systems

Sakr, Sherif

doi:10.1007/978-3-319-38776-5_2

Sherif Sakr¹⁶

Part of the book series: SpringerBriefs in Computer Science ((BRIEFSCOMPUTER))

1542 Accesses
3 Citations

Abstract

In 2004, Google introduced the MapReduce framework as a simple and powerful programming model that enables the easy development of scalable parallel applications to process vast amounts of data on large clusters of commodity machines (Dean and Ghemawa, OSDI, 2004, [20]). In particular, the implementation described in the original paper is mainly designed to achieve high performance on large clusters of commodity PCs. One of the main advantages of this approach is that it isolates the application from the details of running a distributed program, such as issues on data distribution, scheduling, and fault tolerance. In this model, the computation takes a set of key-value pairs as input and produces a set of key-value pairs as output.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 44.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

University of New South Wales, Sydney, NSW, Australia
Sherif Sakr

Authors

Sherif Sakr
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sherif Sakr .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Sakr, S. (2016). General-Purpose Big Data Processing Systems. In: Big Data 2.0 Processing Systems. SpringerBriefs in Computer Science. Springer, Cham. https://doi.org/10.1007/978-3-319-38776-5_2

Download citation

DOI: https://doi.org/10.1007/978-3-319-38776-5_2
Published: 25 August 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-38775-8
Online ISBN: 978-3-319-38776-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics