Skip to main content

Building a YARN Application

  • Chapter
  • First Online:
Pro Apache Hadoop
  • 3482 Accesses

Abstract

Hadoop 2.0 allows a developer to plug in to other frameworks. A large number of frameworks have developed around data stored in the HDFS, and some have been covered in this book (HAMA and Spark, for example). Some of these frameworks were developed to overcome the limitations of using MapReduce for all types of problems. For example, the key limitation of MapReduce is that each MapReduce phase reads and writes data to the HDFS, so iterative algorithms run several times slower in MapReduce. Each iteration is a separate MapReduce job that reads the output of the earlier iteration’s MapReduce job from the HDFS. Frameworks such as HAMA and Spark were developed to address these limitations. They leveraged the HDFS, but existed as separate frameworks that had their own resource management capabilities. YARN or Hadoop 2.0 allows these frameworks to be integrated into the larger Hadoop framework, which centralizes resource management across all types of jobs in the cluster.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 34.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 44.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Sameer Wadkar and Madhu Siddalingaiah

About this chapter

Cite this chapter

Wadkar, S., Siddalingaiah, M. (2014). Building a YARN Application. In: Pro Apache Hadoop. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4302-4864-4_17

Download citation

Publish with us

Policies and ethics