Abstract
The unprecedented growth in mobile data usage is posing significant challenges to cellular operators. One key challenge is how to provide quality of service to subscribers when their residing cell is experiencing a significant amount of traffic, i.e. becoming a traffic hotspot. In this paper, we perform an empirical study on data hotspots in today’s cellular networks using a 9-week cellular dataset with 734K+ users and 5327 cell sites. Our analysis examines in details static and dynamic characteristics, predictability, and causes of data hotspots, and their correlation with call hotspots. We show that using standard machine learning methods, future hotspots can be accurately predicted from past observations. We believe the understanding of these key issues will lead to more efficient and responsive resource management and thus better QoS provision in cellular networks. To the best of our knowledge, our work is the first to empirically characterize traffic hotspots in today’s cellular networks.
Similar content being viewed by others
Notes
The use of haversine distance helps to cluster cells separated by a small physical distance.
References
Cisco Visual Networking Index. http://www.cisco.com/c/en/us/solutions/collateral/service-provider/visual-networking-index-vni/white_paper_c11-520862.html
Nan E, Chu X, Guo W, Zhang J (2013) User data traffic analysis for 3G cellular networks. In: Proc. of CHINACOM, pp 468– 472
Paul U, Subramanian A, Buddhikot M, Das S (2012) Understanding spatial relationships in resource usage in cellular data networks. In: Proc. of INFOCOM workshops, pp 244–249
Laner M, Svoboda P, Schwarz S, Rupp M (2012) Users in cells: a data traffic analysis. In: Proc. of WCNC, pp 3063–3068
Paul U, Subramanian A, Buddhikot M, Das S (2011) Understanding traffic dynamics in cellular data networks. In: Proc. of INFOCOM, pp 882–890
Shafiq M Z, Ji L, Liu A Z, Pang J, Wang J (2012) Characterizing geospatial dynamics of application usage in a 3G cellular data network. In: Proc. of INFOCOM, pp 1341–1349
Shafiq M Z, Ji L, Liu A X, Wang J (2011) Characterizing and modeling Internet traffic dynamics of cellular devices. In: Proc. of SIGMETRICS, pp 305–316
Zhou X, Zhao Z, Li R, Zhou Y, Zhang H (2012) The predictability of cellular networks traffic. In: Proc. of ISCIT, pp 973–978
Mobile broadband with HSPA and LTE - capacity and cost aspects. http://www.developingtelecoms.com/business/opinion/123-white-papers-case-studies/3211-mobile-broadband-with-hspa-and-lte-capacity-and-cost-aspects.html
Sinnott R W (1984) Virtues of the haversine. Sky Telescop 68(2):159
Defays D (1977) An efficient algorithm for a complete link method. Comput J (Br Comput Soc) 20(4):364–366
Isaacman S, Becker R, Cáceres R, Kobourov S, Martonosi M, Rowland J, Varshavsky A (2011) Identifying important places in people’s lives from cellular network data. In: Proc. of 9th international conference on pervasive computing, pp 133– 151
Hall M, Frank E, Holmes G, Pfahringer B, Reutemann R, Witten I H (2009) The WEKA data mining software: an update. SIGKDD Explor Newsl 11(1):10–18
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297
Breiman L (2001) Random forests. Mach Learn 45(1):5– 32
Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3:1157–1182
Mohri M, Rostamizadeh A, Talwalkar A (2012) Foundations of machine learning. The MIT Press
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Nika, A., Ismail, A., Zhao, B.Y. et al. Understanding and Predicting Data Hotspots in Cellular Networks. Mobile Netw Appl 21, 402–413 (2016). https://doi.org/10.1007/s11036-015-0648-6
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11036-015-0648-6