Real-Time Analytics with Storm

Yadav, Vinit

doi:10.1007/978-1-4842-2869-2_7

Vinit Yadav²

Abstract

So far, you’ve seen how to work with batch data processing in Hadoop. Batch processing is used with data at rest. You typically generate a report at the end of the day. MapReduce, Hive, and HBase all help in implementing batch processing tasks. But there is another kind of data, which is in constant motion, called streams. To process such data, you need a real-time processing engine. A constant stream of click data for a campaign, user activity data, server logs, IoT, and sensor data—in all of these scenarios, data is constantly coming in and you need to process them in real time, perhaps within a window of time. Apache Storm is very well suited for real-time stream analytics. Storm is a distributed, fault-tolerant, open source computation system that processes data in real time and works on top of Hadoop.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 34.99; Price excludes VAT (USA)

Softcover Book: USD 44.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and Affiliations

Ahmedabad, Gujarat, India
Vinit Yadav

Authors

Vinit Yadav
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Yadav, V. (2017). Real-Time Analytics with Storm. In: Processing Big Data with Azure HDInsight. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-2869-2_7

Download citation

DOI: https://doi.org/10.1007/978-1-4842-2869-2_7
Published: 30 May 2017
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-4842-2868-5
Online ISBN: 978-1-4842-2869-2
eBook Packages: Professional and Applied ComputingApress Access BooksProfessional and Applied Computing (R0)

Publish with us

Policies and ethics