Programming Internals of Scalding and Spark

Srinivasa, K.G.; Muppalla, Anil Kumar

doi:10.1007/978-3-319-13497-0_4

Programming Internals of Scalding and Spark

K.G. Srinivasa⁴ &
Anil Kumar Muppalla⁴

Chapter
First Online: 01 January 2015

3618 Accesses

Part of the book series: Computer Communications and Networks ((CCN))

Abstract

Scalding is a Scala-based library built on top of Cascading, a Java library that forms an abstraction over low-level Hadoop API. It is comparable to Pig, but brings the advantages of Scala in building MapReduce jobs [1].

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Hardcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Scala, ”The Scala Programming Language,” 2002. [Online]. Available: http://www.scalalang.org/.
Twitter, Scalding, 2011. [Online]. Available: https://github.com/twitter/scalding.
Wensel, C. K. ”Cascading: Defining and executing complex and fault tolerant data processin workflows on a hadoop cluster” (2008).
Google Scholar
Cascading, ”Cascading: Application Platform for Enterprise Big Data” [Online] Available: http://www.cascading.org/
Zaharia, Matei, et al. ”Spark: cluster computing with working sets.” Proceedings of the 2nd USENIX conference on Hot topics in cloud computing. 2010.
Google Scholar
B. Hindman, A. Konwinski, M. Zaharia, and I. Stoica. A common substrate for cluster computing. In Workshop on Hot Topics in Cloud Computing (HotCloud) 2009, 2009.
Google Scholar
Spark, Apache. [Online] Available: http://spark.incubator.apache.org/docs/latest/

Download references

Author information

Authors and Affiliations

M.S. Ramaiah Institute of Technology, Bangalore, Karnataka, India
K.G. Srinivasa & Anil Kumar Muppalla

Authors

K.G. Srinivasa
View author publications
You can also search for this author in PubMed Google Scholar
Anil Kumar Muppalla
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to K.G. Srinivasa .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Srinivasa, K., Muppalla, A.K. (2015). Programming Internals of Scalding and Spark. In: Guide to High Performance Distributed Computing. Computer Communications and Networks. Springer, Cham. https://doi.org/10.1007/978-3-319-13497-0_4

Download citation

DOI: https://doi.org/10.1007/978-3-319-13497-0_4
Published: 10 February 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13496-3
Online ISBN: 978-3-319-13497-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics