Advertisement

An Efficient Method for Discovering Motifs in Streaming Time Series Data

  • Cao Duy TruongEmail author
  • Duong Tuan Anh
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 244)

Abstract

The discovery of repeated subsequences, time series motifs, is a problem which has great utility for several higher-level data mining tasks, including classification, clustering, forecasting and rule discovery. In recent years there has been significant research effort spent on efficiently discovering these motifs in static time series data. However, for many applications, the streaming nature of time series demands a new kind of methods for discovery of time series motifs. In this paper, we develop a new method for motif discovery in streaming time series. In this method we use significant extreme points to determine motif candidates and then cluster motif candidates by BIRCH algorithm. The method is very effective not only for large time series data but also for streaming environment since it needs only one-pass of scan through the whole data.

Keywords

Time Series Extreme Point Time Series Data Static Time Series Cluster Feature 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Castro, N., Azevedo, P.J.: Multiresolution Motif Discovery in Time Series. In: Proc. of the SIAM Int. Conf. on Data Mining, SDM 2010, Columbus, Ohio, USA, April 29-May 1 (2010)Google Scholar
  2. 2.
    Chiu, B., Keogh, E., Lonardi, S.: Probabilistic Discovery of Time Series Motifs. In: Proc. of 9th Int. Conf. on Knowledge Discovery and Data Mining (KDD 2003), pp. 493–498 (2003)Google Scholar
  3. 3.
    Gruber, C., Coduro, M., Sick, B.: Signature verification with dynamic RBF network and time series motifs. In: Proc. of 10th International Workshop on Frontiers in Hand Writing Recognition (2006)Google Scholar
  4. 4.
    Keogh, E., Lin, J.: Clustering of Time-Series Subsequences is Meaningless. Implications for previous and future research. Knowl. Inf. Syst. 8(2), 154–177 (2005)CrossRefGoogle Scholar
  5. 5.
    Li, Y., Lin, J., Oates, T.: Visualizing Variable-Length Time Series Motifs. In: SDM 2012, pp. 895–906 (2012)Google Scholar
  6. 6.
    Lin, J., Keogh, E., Patel, P., Lonardi, S.: Finding Motifs in Time Series. In: Proc. of the 2nd Workshop on Temporal Data Mining, at the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2002)Google Scholar
  7. 7.
    Mueen, A., Keogh, E., Zhu, Q., Cash, S., Westover, B.: Exact Discovery of Time Series Motif. In: Proc. of 2009 SIAM International Conference on Data Mining, pp. 1–12 (2009)Google Scholar
  8. 8.
    Mueen, A., Keogh, E.: Online Discovery and Maintenance of Time Series Motif. In: Proc. of ACM SIGKDD 2010, pp. 1089–1098 (2010)Google Scholar
  9. 9.
    Mueen, A., Keogh, E., Zhu, Q., Cash, S., Bigdely-Shamlo, N.: Finding Time Series Motifs in Disk-Resident Data. In: Proc. of IEEE International Conference on Data Mining, ICDM, pp. 367–376 (2009)Google Scholar
  10. 10.
    Pratt, K.B., Fink, E.: Search for patterns in compressed time series. International Journal of Image and Graphics 2(1), 89–106 (2002)CrossRefGoogle Scholar
  11. 11.
    Tanaka, Y., Iwamoto, K., Uehara, K.: Discovery of Time Series Motif from Multi-Dimensional Data based on MDL Principle. Machine Learning 58(2-3), 269–300 (2005)CrossRefzbMATHGoogle Scholar
  12. 12.
    Tang, H., Liao, S.: Discovering Original Motifs with Different Lengths from Time Series. Knowledge-based System 21(7), 666–671 (2008)CrossRefGoogle Scholar
  13. 13.
    Truong, C.D., Anh, D.T.: An Efficient Method for Discovering Motifs in Large Time Series. In: Selamat, A., Nguyen, N.T., Haron, H. (eds.) ACIIDS 2013, Part I. LNCS (LNAI), vol. 7802, pp. 135–145. Springer, Heidelberg (2013)CrossRefGoogle Scholar
  14. 14.
    Xi, X., Keogh, E., Li, W., Mafra-neto, A.: Finding Motifs in a Database of Shapes. In: SDM 2007. LNCS, vol. 4721, pp. 249–260. Springer, Heidelberg (2007)Google Scholar
  15. 15.
    Yang, Q., Wu, X.: 10 Challenging Problems in Data Mining Research. Intl. Jrnl. of Information Technology & Decision Making 5(4), 597–604 (2006)CrossRefGoogle Scholar
  16. 16.
    Yankov, D., Keogh, E., Medina, J., Chiu, B., Zordan, V.: Detecting Time Series Motifs under Uniform Scaling. In: Proc. of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2007), pp. 844–853 (2007)Google Scholar
  17. 17.
    Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: An efficient data clustering method for very large databases. SIGMOD Rec. 25(2), 103–114 (1996)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  1. 1.Faculty of Computer Science and EngineeringHo Chi Minh City University of TechnologyHo Chi MinhVietnam

Personalised recommendations