Abstract
Spatial clustering is an important component of spatial data analysis which aims in identifying the boundaries of domains and their number. It is commonly used in disease surveillance, spatial epidemiology, population genetics, landscape ecology, crime analysis and many other fields. In this paper, we focus on identifying homogeneous sub-regions in binary data, which indicate the presence or absence of a certain plant species which are observed over a two-dimensional lattice. To solve this clustering problem we propose to use the change-point methodology. we consider a Sequential Importance Sampling approach to change-point methodology using Monte Carlo simulation to find estimates of change-points as well as parameters on each domain. Numerical experiments illustrate the effectiveness of the approach. We applied this method to artificially generated data set and compared with the results obtained via binary segmentation procedure. We also provide example with real data set to illustrate the usefulness of this method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Anderson, C., Lee, D., Dean, N.: Bayesian cluster detection via adjacency modelling. Spat. Spatio-Temporal Epidemiol. 16, 11–20 (2016)
Beckage, B., Joseph, L., Belisle, P., Wolfson, D.B., Platt, W.J.: Bayesian change-point analyses in ecology. New Phytol. 174(2), 456–467 (2007)
Braun, J.V., Muller, H.-G.: Statistical methods for DNA sequence segmentation. Stat. Sci. 13(2), 142–162 (1998)
Chen, J., Gupta, A.K.: Parametric Statistical Change Point Analysis: With Applications to Genetics, Medicine, and Finance. Springer Science & Business Media (2011)
Chen, S.S., Gopalakrishnan, P.S.: Clustering via the bayesian information criterion with applications in speech recognition. In: Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, 1998, vol. 2, pp. 645–648. IEEE (1998)
Cliff, A.D., Ord, J.K.: Spatial Processes: Models & Applications. Taylor & Francis (1981)
Cressie, N.: Statistics for Spatial Data. Wiley (2015)
Eckley, I.A., Fearnhead, P., Killick, R.: Analysis of changepoint models. In: Bayesian Time Series Models, pp. 205–224 (2011)
Fryzlewicz, P., et al.: Wild binary segmentation for multiple change-point detection. Ann. Stat. 42(6), 2243–2281 (2014)
Gangnon, R.E., Clayton, M.K.: Bayesian detection and modeling of spatial disease clustering. Biometrics 56(3), 922–935 (2000)
Helterbrand, J.D., Cressie, N., Davidson, J.L.: A statistical approach to identifying closed object boundaries in images. Adv. Appl. Probab. 26(04), 831–854 (1994)
Killick, R., Eckley, I.A., Jonathan, P., Chester, U.: Efficient detection of multiple changepoints within an oceano-graphic time series. In: Proceedings of the 58th World Science Congress of ISI (2011)
López, I., Gámez, M., Garay, J., Standovár, T., Varga, Z.: Application of change-point problem to the detection of plant patches. Acta Biotheor. 58(1), 51–63 (2010)
Manuguerra, M., Sofronov, G., Tani, M., Heller, G.: Monte Carlo methods in spatio-temporal regression modeling of migration in the EU. In: 2013 IEEE Conference on Computational Intelligence for Financial Engineering & Economics (CIFEr), pp. 128–134. IEEE (2013)
Olshen, A.B., Venkatraman, E., Lucito, R., Wigler, M.: Circular binary segmentation for the analysis of array-based DNA copy number data. Biostatistics 5(4), 557–572 (2004)
Polushina, T., Sofronov, G.: Change-point detection in binary Markov DNA sequences by the cross-entropy method. In: 2014 Federated Conference on Computer Science and Information Systems (FedCSIS), pp. 471–478. IEEE (2014)
Priyadarshana, W., Sofronov, G.: Multiple break-points detection in array CGH data via the cross-entropy method. IEEE/ACM Trans. Comput. Biol. Bioinf. (TCBB) 12(2), 487–498 (2015)
Raveendran, N., Sofronov, G.: Binary segmentation methods for identifying boundaries of spatial domains. Computer Science and Information Systems (FedCSIS) 13, 95–102 (2017)
Rubinstein, R.Y., Kroese, D.P.: Simulation and the Monte Carlo Method, vol. 10, 2nd edn. Wiley (2007)
Sofronov, G.: Change-point modelling in biological sequences via the bayesian adaptive independent sampler. Int. Proc. Comput. Sci. Inf. Technol. 5(2011), 122–126 (2011)
Sofronov, G.Y., Evans, G.E., Keith, J.M., Kroese, D.P.: Identifying change-points in biological sequences via sequential importance sampling. Environ. Model. & Assess. 14(5), 577–584 (2009)
Sofronov, G.Y., Glotov, N.V., Zhukova, O.V.: Statistical analysis of spatial distribution in populations of microspecies of Alchemilla l. In: Proceedings of the 30th International Workshop on Statistical Modelling, vol. 2, pp. 259–262. Statistical Modelling Society, Linz, Austria (2015)
Tripathi, S., Govindaraju, R.S.: Change detection in rainfall and temperature patterns over India. In: Proceedings of the Third International Workshop on Knowledge Discovery from Sensor Data, pp. 133–141. ACM (2009)
Tung, A.K., Hou, J., Han, J.: Spatial clustering in the presence of obstacles. In: Proceedings of 17th International Conference on Data Engineering, 2001, pp. 359–367. IEEE (2001)
Upton, G.J., Fingleton, B.: Spatial Data Analysis by Example, vol. 1: Point Pattern and Quantitative Data. Wiley, Chichester, 1 (1985)
Wang, Y.: Change curve estimation via wavelets. J. Am. Stat. Assoc. 93(441), 163–172 (1998)
Yang, T.Y., Swartz, T.B.: Applications of binary segmentation to the estimation of quantal response curves and spatial intensity. Biometrical J. 47(4), 489–501 (2005)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Raveendran, N., Sofronov, G. (2019). Identifying Clusters in Spatial Data Via Sequential Importance Sampling. In: Fidanova, S. (eds) Recent Advances in Computational Optimization. Studies in Computational Intelligence, vol 795. Springer, Cham. https://doi.org/10.1007/978-3-319-99648-6_10
Download citation
DOI: https://doi.org/10.1007/978-3-319-99648-6_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-99647-9
Online ISBN: 978-3-319-99648-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)