Abstract
Swap randomization has been shown to be an effective technique for assessing the significance of data mining results such as Boolean matrices, frequent itemsets, correlations or clusterings. Basically, instead of applying statistical tests on selected attributes, the global structure of the actual dataset is taken into account by checking whether obtained results are likely or not to occur in randomized datasets whose column and row margins are equal to the ones of the actual dataset. In this paper, a swap randomization approach for bases of sequences is proposed with the aim of assessing sequential patterns extracted from Satellite Image Time Series (SITS). This assessment relies on the spatiotemporal locations of the extracted patterns. Using an entropy-based measure, the locations obtained on the actual dataset and a single swap randomized dataset are compared. The potential and generality of the proposed approach is evidenced by experiments on both optical and radar SITS.
Chapter PDF
References
Agrawal, R., Srikant, R.: Mining sequential patterns. In: Proceedings of the Eleventh International Conference on Data Engineering, Taipei, Taiwan, pp. 3–14, March 1995
Besag, J.: Markov chain monte carlo methods for statistical inference (2004). http://www-users.mat.umk.pl/~wniem/SemMgr/besag_MCMC.pdf
Besag, J., Clifford, P.: Generalized monte carlo significance tests. Biometrika 76(4), 633–642 (1989)
Chuvieco, E., Huete, A.: Fundamentals of Satellite Remote Sensing. CRC Press, Boca Raton (2009)
Cobb, G.W., Chen, Y.: An application of markov chain monte carlo to community ecology. The American Mathematical Monthly 110(4), 265–288 (2003)
Cover, T.M., Thomas, J.A.: Elements of Information Theory (Wiley Series in Telecommunications and Signal Processing). Wiley-Interscience (2006)
Doin, M., Lodge, F., Guillaso, S., Jolivet, R., Lasserre, C., Ducret, G., Grandin, R., Pathier, E., Pinel, V.: Presentation of the small baseline nsbas processing chain on a case example: the etna deformation monitoring from 2003 to 2010 using envisat data. In: Proceedings of the Fringe Symposium, pp. 3434–3437. ESA SP-697, ESA Communications, Frascati, September 2011
Fisher, R., Dawson-Howe, K., Fitzgibbon, A., Robertson, C., Trucco, E.: Dictionary of Computer Vision and Image Processing. John Wiley and Sons, New York (2005)
Gionis, A., Mannila, H., Mielikäinen, T., Tsaparas, P.: Assessing data mining results via swap randomization. In: Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Philadelphia, PA, USA, pp. 167–176, August 2006
Gionis, A., Mannila, H., Mielikäinen, T., Tsaparas, P.: Assessing data mining results via swap randomization. TKDD 1(3) (2007)
Gonzalez, R.C., Woods, R.E.: Digital Image Processing, 3rd edn. Prentice-Hall Inc., Upper Saddle River (2006)
Good, P.: Permutation tests : a practical guide to resampling methods for testing hypotheses. Springer series in statistics. Springer, New York (2000)
Gueguen, L., Datcu, M.: Image time-series data mining based on the information-bottleneck principle. IEEE Trans. Geoscience and Remote Sensing 45(4), 827–838 (2007)
Guttler, F., Ienco, D., Teisseire, M., Nin, J., Poncelet, P.: Towards the use of sequential patterns for detection and characterization of natural and agricultural areas. In: Laurent, A., Strauss, O., Bouchon-Meunier, B., Yager, R.R. (eds.) IPMU 2014, Part I. CCIS, vol. 442, pp. 97–106. Springer, Heidelberg (2014)
Hanhijärvi, S., Ojala, M., Vuokko, N., Puolamäki, K., Tatti, N., Mannila, H.: Tell me something I don’t know: randomization strategies for iterative data mining. In: Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, pp. 379–388, June-July 2009
Honda, R., Konishi, O.: Temporal rule discovery for time-series satellite images and integration with RDB. In: Siebes, A., De Raedt, L. (eds.) PKDD 2001. LNCS (LNAI), vol. 2168, p. 204. Springer, Heidelberg (2001)
Julea, A., Méger, N., Bolon, P., Rigotti, C., Doin, M., Lasserre, C., Trouvé, E., Lazarescu, V.: Unsupervised spatiotemporal mining of satellite image time series using grouped frequent sequential patterns. IEEE Trans. Geoscience and Remote Sensing 49(4), 1417–1430 (2011)
Julea, A., Méger, N., Rigotti, C., Trouvé, E., Jolivet, R., Bolon, P.: Efficient spatio-temporal mining of satellite image time series for agricultural monitoring. Trans. MLDM 5(1), 23–44 (2012)
Luo, C., Chung, S.M.: Efficient mining of maximal sequential patterns using multiple samples. In: Proceedings of the 2005 SIAM International Conference on Data Mining, SDM 2005, Newport Beach, CA, USA, pp. 415–426. SIAM, April 2005
Pei, J., Han, J., Wang, W.: Constraint-based sequential pattern mining: the pattern-growth methods. J. Intell. Inf. Syst. 28(2), 133–160 (2007)
Petitjean, F., Inglada, J., Gançarski, P.: Satellite image time series analysis under time warping. IEEE Trans. Geoscience and Remote Sensing 50(8), 3081–3095 (2012)
Rigotti, C., Lodge, F., Méger, N., Pothier, C., Jolivet, R., Lasserre, C.: Monitoring of tectonic deformation by mining satellite image time series. In: Reconnaissance de Formes et Intelligence Artificielle (RFIA), Rouen, France, June 2014
Ryser, H.J.: Combinatorial properties of matrices of zeros and ones. Canadian Journal of Mathematics 9, 371–377 (1957)
Vuokko, N., Kaski, P.: Significance of patterns in time series collections. In: Proceedings of the Eleventh SIAM International Conference on Data Mining, SDM 2011, April 28–30, Mesa, Arizona, USA, pp. 676–686 (2011)
Zaki, M.J.: Sequence mining in categorical domains: Incorporating constraints. In: Proceedings of the Ninth International Conference on Information and Knowledge Management, CIKM 2000, pp. 422–429. ACM, New York, November 2000
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Méger, N., Rigotti, C., Pothier, C. (2015). Swap Randomization of Bases of Sequences for Mining Satellite Image Times Series . In: Appice, A., Rodrigues, P., Santos Costa, V., Gama, J., Jorge, A., Soares, C. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2015. Lecture Notes in Computer Science(), vol 9285. Springer, Cham. https://doi.org/10.1007/978-3-319-23525-7_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-23525-7_12
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23524-0
Online ISBN: 978-3-319-23525-7
eBook Packages: Computer ScienceComputer Science (R0)