Abstract
This study presents the application of clustering techniques to a real-life problem of studying the air quality of the Castilla y León region in Spain. The goal of this work is to analyze the level of air pollution in eight points of this Spanish region between years 2008 and 2015. The analyzed data were provided by eight acquisition stations from the regional network of air quality. The main pollutants recorded at these stations are analyzed in order to study the characterization of such stations, according to a zoning process, and their time evolution. Four cluster evaluation and a clustering technique, with the main distance measures, have been applied to the dataset under analysis.
References
Government of Spain - Aporta Project. http://administracionelectronica.gob.es
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. (CSUR) 31(3), 264–323 (1999)
Kassomenos, P., Vardoulakis, S., Borge, R., Lumbreras, J., Papaloukas, C., Karakitsios, S.: Comparison of statistical clustering techniques for the classification of modelled atmospheric trajectories. Theoret. Appl. Climatol. 102, 1–12 (2010)
Pires, J.C.M., Sousa, S.I.V., Pereira, M.C., Alvim-Ferraz, M.C.M., Martins, F.G.: Management of air quality monitoring using principal component and cluster analysis—Part I: SO2 and PM10. Atmos. Environ. 42(6), 1249–1260 (2008)
European Commission - Air Quality Standards. http://ec.europa.eu/environment/air/quality/standards.htm
Liu, Y., Li, Z., Xiong, H., Gao, X., Wu, J.: Understanding of internal clustering validation measures. In: IEEE International Conference on Data Mining, pp. 911–916 (2010)
Jain, A.K.: Data clustering: 50 years beyond K-means. Pattern Recogn. Lett. 31, 651–666 (2010)
Barlow, H.: Unsupervised learning. Neural Comput. 1, 295–311 (1989)
Caliński, T., Harabasz, J.: A dendrite method for cluster analysis. Commun. Stat. Theory Methods 3, 1–27 (1974)
Rousseeuw, P.J.: Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20, 53–65 (1987)
Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Trans. Pattern Anal. Mach. Intell. 1(2), 224–227 (1979)
Tibshirani, R., Walther, G., Hastie, T.: Estimating the number of clusters in a data set via the gap statistic. J. Roy. Stat. Soc.: Ser. B (Stat. Methodol.) 63, 411–423 (2001)
Ding, C., He, X.: K-means clustering via principal component analysis. In: Proceedings of the Twenty-First International Conference on Machine Learning, p. 29 (2004)
Danielsson, P.E.: Euclidean distance mapping. Comput. Graph. Image Process. 14, 227–248 (1980)
Government of Castilla y León - Zoning of the territory in Castilla y León. http://www.jcyl.es/
European Union Law - Directive 2008/50/EC of the European Parliament and of the Council of 21 May 2008 on ambient air quality and cleaner air for Europe. http://eur-lex.europa.eu/
Government of Castilla y León - Annual reports of the Air Quality. http://www.medioambiente.jcyl.es/
PubChem - PubChem Compounds. https://pubchem.ncbi.nlm.nih.gov/compound
ISO - International Organization for Standardization. PM10/PM2.5. https://www.iso.org/
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Arroyo, Á., Tricio, V., Herrero, Á., Corchado, E. (2017). Time Analysis of Air Pollution in a Spanish Region Through k-means. In: Graña, M., López-Guede, J.M., Etxaniz, O., Herrero, Á., Quintián, H., Corchado, E. (eds) International Joint Conference SOCO’16-CISIS’16-ICEUTE’16. SOCO CISIS ICEUTE 2016 2016 2016. Advances in Intelligent Systems and Computing, vol 527. Springer, Cham. https://doi.org/10.1007/978-3-319-47364-2_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-47364-2_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-47363-5
Online ISBN: 978-3-319-47364-2
eBook Packages: EngineeringEngineering (R0)