Abstract
Time series clustering by k-Means algorithm still has to overcome the dilemma of choosing the initial cluster centers. In this paper, we present a new method for initializing the k-Means clustering algorithm of time series data. Our initialization method hinges on the use of time series motif information detected by a previous task in choosing k time series in the database to be the seeds. Experimental results show that our proposed clustering approach performs better than ordinary k-Means in terms of clustering quality, robustness and running time.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Buza, K., Thieme, L.S.: Motif-based classification of time series with Bayesian networks and SVMs. In: Fink, A., et al. (eds.) Advances in Data Analysis, Data Handling and Business Intelligences, Studies in Classification, Data Analysis, Knowledge Organization, pp. 105–114. Springer, Heidelberg (2010)
Chiu, B., Keogh, E., Lonardi, S.: Probabilistic discovery of time series motifs. In: Proc. of 9th Int. Conf. on Knowledge Discovery and Data Mining (KDD 2003), pp. 493–498 (2003)
Jiang, Y., Li, C., Han, J.: Stock temporal prediction based on time series motifs. In: Proc. of 8th Int. Conf. on Machine Learning and Cybernetics, Baoding, July 12-15 (2009)
Keogh, E., Chakrabarti, K., Pazzani, M., Mehrotra, S.: Dimensionality reduction for fast similarity search in large time series database. Journal of Knowledge and Information Systems, 263–286 (2000)
Keogh, E., Folias, T.: The UCR time series data mining archive (2002), http://www.cs.ucr.edu/~eamonn/TSDMA/index.html
Le, P.: Motif-based method for k-Means clustering of time series data. Master Thesis, Faculty of Computer Science & Engineering, Ho Chi Minh City University of Technology, Vietnam (June 2011)
Lin, J., Keogh, E., Lonardi, S., Patel, P.: Finding motifs in time series. In: Proc. of 2nd Workshop on Temporal Data Mining, KDD 2002 (2002)
Lin, J., Vlachos, M., Keogh, E.J., Gunopulos, D.: Iterative Incremental Clustering of Time Series. In: Hwang, J., Christodoulakis, S., Plexousakis, D., Christophides, V., Koubarakis, M., Böhm, K. (eds.) EDBT 2004. LNCS, vol. 2992, pp. 106–122. Springer, Heidelberg (2004)
Mueen, A., Keogh, E., Zhu, Q., Cash, S., Westover, B.: Exact discovery of time series motifs. In: Proc. of SDM (2009)
Historical Data for S&P 500 Stocks, http://kumo.swcp.com/stocks/
Zhang, H., Ho, T.B., Zhang, Y., Lin, M.S.: Unsupervised feature extraction for time series clustering using orthogonal wavelet transform. Journal Informatica 30(3), 305–319 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Phu, L., Anh, D.T. (2011). Motif-Based Method for Initialization the K-Means Clustering for Time Series Data. In: Wang, D., Reynolds, M. (eds) AI 2011: Advances in Artificial Intelligence. AI 2011. Lecture Notes in Computer Science(), vol 7106. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25832-9_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-25832-9_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25831-2
Online ISBN: 978-3-642-25832-9
eBook Packages: Computer ScienceComputer Science (R0)