Abstract
The comparison of two time series and the extraction of subsequences that are common to the two is a complex data mining problem. Many existing techniques, like the Discrete Fourier Transform (DFT), offer solutions for comparing two whole time series. Often, however, the important thing is to analyse certain regions, known as events, rather than the whole times series. This applies to domains like the stock market, seismography or medicine. In this paper, we propose a method for comparing two time series by analysing the events present in the two. The proposed method is applied to time series generated by stabilometric and posturographic systems within a branch of medicine studying balance-related functions in human beings.
This work was funded by the Spanish Ministry of Education and Science as part of the 2004-2007 National R&D&I Plan through the VIIP Project (DEP2005-00232-C03).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Faloutsos, C., Swami, A.: Efficient Similarity Search In Sequence Databases. In: FODO. Evanston, Illinois (1993)
Chan, K., Fu, A.W.: Efficient Time Series Matching by Wavelets. In: ICDE, pp. 126–133. Sydney-AUS (1999)
Povinelli, R.: Time Series Data Mining: identifying temporal patterns for characterization and prediction of time series, PhD. Thesis. Milwaukee (1999)
Chaovalitwongse, W.A., Fan, Y., Sachdeo, R.C.: On the Time Series K-Nearest Neighbor Classification of Abnormal Brain Activity. IEEE Transactions on Systems, Man, and Cybernetics-Part A: Systems and Humans 1 (2007)
Lee, C.L., Liu, A., Chen, W.: Pattern Discovery of Fuzzy Time Series for Financial Prediction. IEEE Transactions on Knowledge and Data Engineering 18(5) (2006)
Yin, J., Zhou, D., Xie, Q.: A Clustering Algorithm for Time Series Data. In: Proceedings of the Seventh International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT 2006). IEEE, Los Alamitos (2006)
Tseng, V.S., Chen, C., Chen, C., Hong, T.: Segmentation of Time Series by the Clustering and Genetic Algorithms. In: Sixth IEEE International Conference on Data Mining - Workshops ICDMW (2006)
Kumar, R.P., Nagabhushan, P., Chouakria-Douzal, A.: WaveSim and Adaptive WaveSim Transform for Subsequence Time-Series Clustering. In: 9th International Conference on Information Technology, ICIT (2006)
Perng, C., Wang, H., Zhang, S.R., Parker, D.S.: Landmarks: A New Model for Similarity-Based Pattern Querying in Time Series Databases. In: ACDE, San Diego, USA, pp. 33–44 (2000)
Rafiei, D., Mendelzon, A.: Similarity-Based Queries for Time Series Data. In: ACM SIGMOD, Tucson, AZ, pp. 13–25 (1997)
Park, S., Chu, W., Yoon, J., Hsu, C.: Efficient Searches for Similar Subsequences of Different Lengths in Sequence Databases. In: ICDE, San Diego, USA, pp. 23–32 (2000)
Lee, S., Chun, S., Kim, D., Lee, J., Chung, C.: Similarity Search for Multidimensional Data Sequences. In: ICDE, San Diego, USA, pp. 599–610 (2000)
Alonso, F., Martínez, L., Pérez, A., Santamaría, A., Caraça-Valente, J.P.: Integrating Expert Knowledge and Data Mining for Medical Diagnosis. In: Expert Systems Research Trends, cap. 3, pp. 113–137. Nova Science Ed. (2007)
Povinelli, R., Feng, X.: A New Temporal Pattern Identification Method for Characterization and Prediction of Complex Time Series Events. IEEE Transactions on Knowledge and Data Engineering 15(2) (2003)
Vilalta, R., Sheng, M.: Predicting rare events in temporal domain. In: IEEE International Conference on Data Mining, pp. 474– 481 (2002)
Faloutsos, C., Ranganathan, M., Manolopoulos, Y.: Fast Subsequence Matching in Time-Series Databases, pp. 4190–429. ACM SIGMOD (1994)
Kahveci, T., Singh, A., Gürel, A.: Shift and scale invariant search of multi-attribute time sequences, Technical report, UCSB (2001)
Wang, Y., Zhou, L., Feng, J., Wang, J., Liu, Z.: Mining Complex Time-Series Data by Learning Markovian Models. In: Proceedings of the Sixth International Conference on Data Mining ICDM 2006. IEEE, Los Alamitos (2006)
Jan, L., Vasileios, L., Qiang, M., Lakaemper, W.R., Ratanamahatana, C.A., Keogh, E.: Partial Elastic Matching of Time Series. In: Proceedings of the Fifth IEEE International Conference on Data Mining (2005)
Dong, X., Gu, C., Wang, Z.: Research On Shape-Based Time Series Similarity Measure. In: Proceedings of the IEEE Fifth International Conference on Machine Learning and Cybernetics. Dalian (2006)
Lederberg, J.: How Dendral Was Conceived and Born. In: ACM Symposium on the History of Medical Informatics, November 05 1987. National Library of Medicine, Rockefeller University (1987)
Shortliffe, E.H.: Computer Based Medical Consultations: MYCIN. American Elsevier, Amsterdam (1976)
Edberg, S.C.: Global infectious diseases and epidemiology network (GIDEON): A world wide web-based program for diagnosis and informatics in infectious diseases, Clinical Infectious Diseases. official Publication of the Infectious Diseases Society of America 40(1), 123–126 (2005)
Gil, D., Soriano, A., Ruiz, D., Montejo, C.A.: Embedded systems for diagnosing dysfunctions in the lower urinary tract. In: Proceedings of the 22nd Annual ACM Symposium on Applied Computing (2007)
Alonso, F., Caraça-Valente, J.P., González, A.L., Montes, C.: Combining expert knowledge and data mining in a medical domain. Expert Systems with Applications 23, 367–375 (2002)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lara, J.A., Pérez, A., Valente, J.P., López-Illescas, Á. (2009). Comparing Time Series through Event Clustering. In: Corchado, J.M., De Paz, J.F., Rocha, M.P., Fernández Riverola, F. (eds) 2nd International Workshop on Practical Applications of Computational Biology and Bioinformatics (IWPACBB 2008). Advances in Soft Computing, vol 49. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85861-4_1
Download citation
DOI: https://doi.org/10.1007/978-3-540-85861-4_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85860-7
Online ISBN: 978-3-540-85861-4
eBook Packages: EngineeringEngineering (R0)