Abstract
We analyze geo-referenced high-dimensional data describing the use over time of the mobile-phone network in the urban area of Milan, Italy. Aim of the analysis is to identify subregions of the metropolitan area of Milan sharing a similar pattern along time, and possibly related to activities taking place in specific locations and/or times within the city. To tackle this problem, we develop a non-parametric method for the analysis of spatially dependent functional data, named Bagging Voronoi Treelet analysis. This novel approach integrates the treelet decomposition with a proper treatment of spatial dependence, obtained through a Bagging Voronoi strategy. The latter relies on the aggregation of different replicates of the analysis, each involving a set of functional local representatives associated to random Voronoi-based neighborhoods covering the investigated area. Results clearly point out some interesting temporal patterns interpretable in terms of population density mobility (e.g., daily work activities in the tertiary district, leisure activities in residential areas in the evenings and in the weekend, commuters movements along the highways during rush hours, and localized mob concentrations related to occasional events). Moreover we perform simulation studies, aimed at investigating the properties and performances of the method, and whose description is available online as Supplementary material.
Similar content being viewed by others
References
Banerjee S, Carlin B, Gelfand A (2004) Hierarchical modeling and analysis for spatial data. Monographs on statistics and applied probability. Chapman & Hall, London
Becker RA, Caceres R, Hanson K, Loh JM, Urbanek S, Varshavsky A, Volinsky C (2011) A tale of one city: using cellular network data for urban planning. IEEE Pervasive Comput 10(4):18–26
Calabrese F, Lorenzo GD, Liu L, Ratti C (2011) Estimating origin-destination flows using mobile phone location data. IEEE Pervasive Comput 10(4):36–44
James GM (2007) Curve alignment by moments. Ann Appl Stat 1:480–501
Kaziska D, Srivastava A (2007) Gait-based human recognition by classification of cyclostationary processes on nonlinear shape manifolds. J Am Stat Assoc 102:1114–1128
Ke C, Wang Y (2001) Semiparametric nonlinear mixed-effects models and their applications. J Am Stat Assoc 96:1272–1298
Kunsch H, Geman S, Kehagias A (1995) Hidden markov random fields. Ann Appl Probab 5(3):577–602
Lee AB, Nadler B, Wasserman L (2008) Treelets—an adaptive multi-scale basis for sparse unordered data. Ann Appl Stat 2(2):435–471
Mallat SG (1989) A theory for multiresolution signal decomposition: the wavelet representation. IEEE Trans Pattern Anal Mach Intell 11:674–693
Manfredini F, Pucci P, Secchi P, Tagliolato P, Vantini S, Vitelli V (2015) Treelet decomposition of mobile phone data for deriving city usage and mobility pattern in the Milan urban region. In: Paganoni AM, Secchi P (eds) Advances in complex data modeling and computational methods in statistics., Contributions to statisticsSpringer, Berlin, pp 133–147
OECD (2006a) OECD Territorial reviews: competitive cities in the global economy. OECD Publishing, Paris
OECD (2006b) OECD Territorial reviews: Milan, Italy. OECD Publishing, Paris
Ramsay JO, Li X (1998) Curve registration. J R Stat Soc Ser B Stat Methodol 60:351–363
Ramsay JO, Silverman BW (2005) Functional data analysis. Springer, Berlin
Sangalli LM, Secchi P, Vantini S, Veneziani A (2009) A case study in exploratory functional data analysis: geometrical features of the internal carotid artery. J Am Stat Assoc 104:37–48
Sangalli LM, Secchi P, Vantini S, Vitelli V (2010) K-mean alignment for curve clustering. Comput Stat Data Anal 54:1219–1233
Secchi P, Vantini S, Vitelli V (2013) Bagging voronoi classifiers for clustering spatial functional data. Int J Appl Earth Obs Geoinf 22:53–64
Secchi P, Vantini S, Zanini P (2014) Hierarchical independent component analysis: a multi-resolution non-orthogonal data-driven basis. In: Tech Rep 01/2014, MOX—Dipartimento di Matematica, Politecnico di Milano
Acknowledgments
This research has been carried out within the Green Move Project, a joint research program involving MOX Laboratory for Modeling and Scientific Computing (Department of Mathematics, Politecnico di Milano) and funded by Regione Lombardia. We thank Convenzione di Ricerca DiAP–Politecnico di Milano and Telecom Italia that provided the data. We would also like to thank Paola Pucci, Fabio Mafredini and Paolo Tagliolato (Department of Architecture and Urban Studies, Politecnico di Milano) for the interesting discussions on the interpretation of the outcomes of the statistical analysis described in this paper.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Secchi, P., Vantini, S. & Vitelli, V. Analysis of spatio-temporal mobile phone data: a case study in the metropolitan area of Milan. Stat Methods Appl 24, 279–300 (2015). https://doi.org/10.1007/s10260-014-0294-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10260-014-0294-3