Abstract
Hadoop 2.0 allows a developer to plug in to other frameworks. A large number of frameworks have developed around data stored in the HDFS, and some have been covered in this book (HAMA and Spark, for example). Some of these frameworks were developed to overcome the limitations of using MapReduce for all types of problems. For example, the key limitation of MapReduce is that each MapReduce phase reads and writes data to the HDFS, so iterative algorithms run several times slower in MapReduce. Each iteration is a separate MapReduce job that reads the output of the earlier iteration’s MapReduce job from the HDFS. Frameworks such as HAMA and Spark were developed to address these limitations. They leveraged the HDFS, but existed as separate frameworks that had their own resource management capabilities. YARN or Hadoop 2.0 allows these frameworks to be integrated into the larger Hadoop framework, which centralizes resource management across all types of jobs in the cluster.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2014 Sameer Wadkar and Madhu Siddalingaiah
About this chapter
Cite this chapter
Wadkar, S., Siddalingaiah, M. (2014). Building a YARN Application. In: Pro Apache Hadoop. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4302-4864-4_17
Download citation
DOI: https://doi.org/10.1007/978-1-4302-4864-4_17
Published:
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-4302-4863-7
Online ISBN: 978-1-4302-4864-4
eBook Packages: Professional and Applied ComputingApress Access BooksProfessional and Applied Computing (R0)