Abstract
This work presents a functional clustering procedure applied to meteorological time series. Our proposal combines time series interpolation with smoothing penalized B-spline and the partitioning around medoids clustering algorithm. Our final goal is to obtain homogeneous climate zones of Italy. We compare this approach to standard methods based on a combination of principal component analysis and Cluster Analysis (CA) and we discuss it in relation to other functional clustering approaches based on Fourier analysis and CA. We show that a functional approach is simpler than the standard methods from a methodological and interpretability point of view. Indeed, it becomes natural to find a clear connection between mathematical results and physical variability mechanisms. We discuss how the choice of the basis expansion (splines, Fourier) affects the analysis and propose some comments on their use. The basis for classification is formed by monthly values of temperature and precipitation recorded during the period 1971–2000 over 95 and 94 Italian monitoring stations, respectively. An assessment based on climatic patterns is presented to prove the consistency of the clustering and a comparison of results obtained with different methods is used to judge the functional data approach.
Similar content being viewed by others
References
Abraham C, Cornillon PA, Matzner-Loeber E, Molinari N (2003) Unsupervised curve clustering using B-splines. Scand J Statist 30:581–595
Box GEP, Cox DR (1964) An analysis of transformations. J R Stat Soc, B 26:211–246
Brunetti M, Maugeri M, Monti F, Nanni T (2006) Temperature and precipitation variability in Italy in the last two centuries from homogenized instrumental time series. Int J Climatol 26:345–381
Ehrendorfer M (1987) A regionalization of Austria’s precipitation climate using principal component analysis. Int J Climatol 7(1): 71–89
Fovell RG, Fovell MYC (1993) Climate zones of the conterminous United States defined using cluster analysis. J Climate 6:2103–2135
Harnik N, Chang EKM (2003) Storm track variations as seen in radio- sonde observations and reanalysis data. J Climate 16:480–495
Hartigan JA, Wong MA (1979) A K-means clustering algorithm. Appl Stat 28:100–108
Kaufman L, Rousseeuw PJ (1990) Finding groups in data: an introduction to cluster analysis. Wiley, New York
Kim BR, Zhang L, Berg A, Fan J, Wu R (2008) A computational approach to the functional clustering of periodic gene-expression profiles. Genetics 180:821–834
Ignaccolo R, Ghigo S, Giovenali E (2008) Analysis of air quality monitoring networks by functional clustering. Environmetrics 19:672–686
Laguardia G (2011) Representing the precipitation regime by means of Fourier series. Int J Climatol 31(9):1398–1407
Mennella C (1972) Il clima d’Italia nelle sue caratteristiche e varieta’ quale fattore dinamico del paesaggi, vol II. Fratelli Conte Editore, Napoli
Preisendorfer RW, Mobley CD (1988) Principal component analysis in meteorology and oceanography. Elsevier, Amsterdam
R Development Core Team (2011) R: a language and environment for statistical computing. R foundation for statistical computing, Vienna, Austria. ISBN 3-900051-07-0, http://www.R-project.org
Ramsay JO, Silverman BW (1997) Functional data analysis. Springer, New York
Ramsay JO, Silverman BW (2002) Applied functional data analysis: methods and case studies. Springer, New York
Ramsay JO, Wickham H, Graves S, Hooker G (2011) Fda: functional data analysis. R package version 2.2.7. http://CRAN.R-project.org/package=fda
Reynolds A, Richards G, De La Iglesia B, Rayward-Smith V (1992) Clustering rules: a comparison of partitioning and hierarchical clustering algorithms. Journal of Mathematical Modelling and Algorithms 5:475–504. doi:10.1007/s10852-005-9022-1
Richman MB (1986) Rotation of principal components. J Climatol 6:293–335
Rousseeuw PJ (1987) Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 20:53–65
Sprent P (1998) Data driven statistical methods. Chapman & Hall, London
Sprent P, Smeeton NC (2001) Applied nonparametric statistical methods, 3rd edn. Chapman & Hall/CRC, London
Toreti A, Fioravanti G, Perconti W, Desiato F (2009) Annual and seasonal precipitation over Italy from 1961 to 2006. Int J Climatol 29(13):1976–1987
Trigo R et al (2006) Relations between variability in theMediterranean region and mid-latitude variability. In: Lionello P, Malanotte-Rizzoli P, Boscolo R (eds) Mediterranean climate variability. Elsevier, Amsterdam, pp 179–226
Von Storch H, Zwiers FW (1999) Statistical analysis in climate research. Cambridge University Press, Cambridge
WMO (1989) Calculation of monthly and annual 30-year standard normals. WCDP n.10, WMO-TD/N.341, Geneva
Acknowledgments
This work has been developed within the context of Agroscenari project “Adaptation of Agricultural Management to climate change” funded by the Italian Ministry of Agriculture. The authors would like to thank two anonymous referees for the useful comments that helped in considerably improving the paper and Prof. M. MAugeri for his suggestions.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Di Giuseppe, E., Jona Lasinio, G., Esposito, S. et al. Functional clustering for Italian climate zones identification. Theor Appl Climatol 114, 39–54 (2013). https://doi.org/10.1007/s00704-012-0801-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00704-012-0801-0