Abstract
Real-time processing of events plays an important role in data analytics. Recent advances in Internet have lead to the rise of social media with massively large data generated in real time. Analysis of such data leads to interesting scenarios where some of the business decisions can be made. In the previous chapters, data analytics with batch processing and primitive datasets were discussed. Apache Flume is one of the tools in the Hadoop ecosystem that provides a platform for real-time data analytics. In this chapter, an overview of Apache Flume and its architectural components with workflow is discussed. Later, the configuration of Flume with Twitter social network is discussed as an example for real-time analytics.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Liu, X., Iftikhar, N., & Xie, X. (2014, July). Survey of real-time processing systems for big data. In Proceedings of the 18th International Database Engineering and Applications Symposium (pp. 356–361). ACM.
Wang, C., Rayan, I. A. & Schwan, K. (2012). Faster, larger, easier: Reining realtime big data processing in cloud. In Proceedings of the Posters and Demo Track (p. 4). ACM.
Ranjan, R. (2014). Streaming big data processing in datacenter clouds. IEEE Cloud Computing, 1(1), 78–83.
Lin, J., & Kolcz, A. (2012). Large-scale machine learning at twitter. In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data (pp. 793–804). ACM.
Khuc, V. N., Shivade, C., Ramnath, R., & Ramanathan, J. (2012). Towards building large-scale distributed systems for twitter sentiment analysis. In Proceedings of the 27th annual ACM symposium on applied computing (pp. 459–464). ACM.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this chapter
Cite this chapter
Srinivasa, K.G., G. M., S., H., S. (2018). Apache Flume. In: Network Data Analytics. Computer Communications and Networks. Springer, Cham. https://doi.org/10.1007/978-3-319-77800-6_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-77800-6_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-77799-3
Online ISBN: 978-3-319-77800-6
eBook Packages: Computer ScienceComputer Science (R0)