Name: Pro Apache Hadoop
ISBN: 978-1-4302-4864-4

Overview

Authors:

Sameer Wadkar ,
Madhu Siddalingaiah

Sameer Wadkar

View author publications

You can also search for this author in PubMed Google Scholar
Madhu Siddalingaiah

View author publications

You can also search for this author in PubMed Google Scholar

Pro Apache Hadoop, Second Edition brings you up to speed on Hadoop – the framework of big data – helping you build resilient and reliant compute clusters capable of analyzing large volumes of data in amazingly short times.

70k Accesses
23 Citations
4 Altmetric

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 34.99

Price excludes VAT (USA)

Softcover Book USD 44.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (20 chapters)

Front Matter

Pages i-xxvi

Download chapter PDF
Motivation for Big Data
- Sameer Wadkar, Madhu Siddalingaiah
Pages 1-10
Hadoop Concepts
- Sameer Wadkar, Madhu Siddalingaiah
Pages 11-30
Getting Started with the Hadoop Framework
- Sameer Wadkar, Madhu Siddalingaiah
Pages 31-46
Hadoop Administration
- Sameer Wadkar, Madhu Siddalingaiah
Pages 47-72
Basics of MapReduce Development
- Sameer Wadkar, Madhu Siddalingaiah
Pages 73-106
Advanced MapReduce Development
- Sameer Wadkar, Madhu Siddalingaiah
Pages 107-150
Hadoop Input/Output
- Sameer Wadkar, Madhu Siddalingaiah
Pages 151-183
Testing Hadoop Programs
- Sameer Wadkar, Madhu Siddalingaiah
Pages 185-202
Monitoring Hadoop
- Sameer Wadkar, Madhu Siddalingaiah
Pages 203-215
Data Warehousing Using Hadoop
- Sameer Wadkar, Madhu Siddalingaiah
Pages 217-239
Data Processing Using Pig
- Sameer Wadkar, Madhu Siddalingaiah
Pages 241-269
HCatalog and Hadoop in the Enterprise
- Sameer Wadkar, Madhu Siddalingaiah
Pages 271-282
Log Analysis Using Hadoop
- Sameer Wadkar, Madhu Siddalingaiah
Pages 283-291
Building Real-Time Systems Using HBase
- Sameer Wadkar, Madhu Siddalingaiah
Pages 293-323
Data Science with Hadoop
- Sameer Wadkar, Madhu Siddalingaiah
Pages 325-342
Hadoop in the Cloud
- Sameer Wadkar, Madhu Siddalingaiah
Pages 343-356
Building a YARN Application
- Sameer Wadkar, Madhu Siddalingaiah
Pages 357-379
Installing Hadoop
- Sameer Wadkar, Madhu Siddalingaiah
Pages 381-390
Using Maven with Eclipse
- Sameer Wadkar, Madhu Siddalingaiah
Pages 391-398

About this book

Pro Apache Hadoop, Second Edition brings you up to speed on Hadoop – the framework of big data. Revised to cover Hadoop 2.0, the book covers the very latest developments such as YARN (aka MapReduce 2.0), new HDFS high-availability features, and increased scalability in the form of HDFS Federations. All the old content has been revised too, giving the latest on the ins and outs of MapReduce, cluster design, the Hadoop Distributed File System, and more.

This book covers everything you need to build your first Hadoop cluster and begin analyzing and deriving value from your business and scientific data. Learn to solve big-data problems the MapReduce way, by breaking a big problem into chunks and creating small-scale solutions that can be flung across thousands upon thousands of nodes to analyze large data volumes in a short amount of wall-clock time. Learn how to let Hadoop take care of distributing and parallelizing your software—you just focus on the code; Hadoop takes care of the rest.

Covers all that is new in Hadoop 2.0
Written by a professional involved in Hadoop since day one
Takes you quickly to the seasoned pro level on the hottest cloud-computing framework

About the authors

Jason Venner has more than 20 years of software engineering, managing, designing, and coding experience. He has been a vice president, director, and consultant. Currently, his interests and expertise are in Java, Hadoop, cloud computing, and more. For more, visit www.prohadoopbook.com.

Bibliographic Information

Book Title: Pro Apache Hadoop
Authors: Sameer Wadkar, Madhu Siddalingaiah
DOI: https://doi.org/10.1007/978-1-4302-4864-4
Publisher: Apress Berkeley, CA
eBook Packages: Professional and Applied Computing, Apress Access Books, Professional and Applied Computing (R0)
Copyright Information: Jason Venner and Sameer Wadkar and Madhu Siddalingaiah 2014
Softcover ISBN: 978-1-4302-4863-7Published: 09 September 2014
eBook ISBN: 978-1-4302-4864-4Published: 18 September 2014
Edition Number: 2
Number of Pages: XXII, 444
Number of Illustrations: 70 b/w illustrations
Topics: Open Source, Data Mining and Knowledge Discovery
Industry Sectors: Electronics, IT & Software, Telecommunications

Publish with us

Policies and ethics

Pro Apache Hadoop