Big Data Analytics for Network Congestion Management Using Flow-Based Analysis
Due to explosive growth of traffic volume, it is hard to accumulate Internet traffic on a single machine. In this paper, a Hadoop-based traffic analysis system accepts input from multiple data traces. Hadoop facilitates scalable data processing and storage services on a distributed computing system. This system accepts input of large scales of trace file generated from traffic measurement tool like Wireshark– identifies flows running on the network from this trace file. Characteristics of flow describe the pattern of network traffic; it helps network operator understand network capacity planning, traffic engineering, and fault handling. The main objective is to design and implement a traffic flow identification system using Hadoop. The traffic flow identification system will be very useful for network administrator to monitor faults and also to plan for the future.
KeywordsTraffic analysis Hadoop Wireshark
The author wish to thank the guide for their moral support.
- 1.Lee Y. Toward scalable internet traffic measurement and analysis with hadoop, ACM SIGCOMM computer communication review, vol. 43;2013, p. 5–13.Google Scholar
- 2.Lee Y, Kang W. A hadoop-based packet trace processing tool. In: Traffic monitoring and analysis. Springer;2011, p. 51–63.Google Scholar
- 3.Lee Y, Kang W, Son H. An internet traffic analysis method with mapreduce. In: Network operations and management symposium workshops;2010. p. 357–61.Google Scholar
- 4.Qian L, Wu B, Zhang RW. Characterization of 3g data-plane traffic and application towards centralized control and management for software defined networking. In: Big data (big data congress);2013. p. 278–85.Google Scholar