Skip to main content

Finding Persisting States for Knowledge Discovery in Time Series

  • Conference paper
From Data and Information Analysis to Knowledge Engineering

Abstract

Knowledge Discovery in time series usually requires symbolic time series. Many discretization methods that convert numeric time series to symbolic time series ignore the temporal order of values. This often leads to symbols that do not correspond to states of the process generating the time series. We propose a new method for meaningful unsupervised discretization of numeric time series called “Persist”, based on the Kullback-Leibler divergence between the marginal and the self-transition probability distributions of the discretization symbols. In evaluations with artificial and real life data it clearly outperforms existing methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 159.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • DAW, C.S., FINNEY, C.E.A., and TRACY, E.R. (2003): A review of symbolic analysis of experimental data. Review of Scientific Instruments, 74:0 916–930.

    Google Scholar 

  • GUIMARAES, G. and ULTSCH, A. (1999): A method for temporal knowledge conversion In Proc. 3rd Int. Symp. Intelligent Data Analysis, 369–380.

    Google Scholar 

  • HARMS, S. K. and DEOGUN, J. (2004): Sequential association rule mining with time lags. Journal of Intelligent Information Systems (JIIS), 22:1, 7–22.

    Google Scholar 

  • HETLAND, M.L. and SAETROM, P. (2003): The role of discretization parameters in sequence rule evolution. In Proc. 7th Int. KES Conf., 518–525.

    Google Scholar 

  • HÖPPNER, F. (2002): Learning dependencies in multivariate time series. Proc. ECAI Workshop, Lyon, France, 25–31.

    Google Scholar 

  • KEOGH, E. (2002): The UCR Time Series Data Mining Archive http://www.cs.ucr.edu/~eamonn/TSDMA/index.html

    Google Scholar 

  • KEOGH, E., LONARDI, S., and CHIU, B. (2002): Finding Surprising Patterns in a Time Series Database in Linear Time and Space In Proc. 8th ACM SIGKDD, 550–556.

    Google Scholar 

  • KEOGH, E., CHU, S., HART, D., and PAZZANI, M. (2004): Segmenting time series: A survey and novel approach. Data Mining in Time Series Databases, World Scientific, 1–22.

    Google Scholar 

  • KULLBACK, S. and LEIBLER, R.A. (1951): On information and sufficiency Annals of Mathematical Statistics, 22, 79–86.

    MathSciNet  Google Scholar 

  • LIN, J., KEOGH, E., LONARDI, S., and CHIU, B. (2003): A symbolic representation of time series, with implications for streaming algorithms. In Proc. 8th ACM SIGMOD, DMKD workshop, 2–11.

    Google Scholar 

  • LIU, H., HUSSAIN, F., TAN, C.L., and DASH, M. (2002): Discretization: An Enabling Technique. Data Mining and Knowledge Discovery, 4:6, 393–423.

    MathSciNet  Google Scholar 

  • MÖRCHEN, F. and ULTSCH, A. (2004): Discovering Temporal Knowlegde in Multivariate Time Series In Proc. GfKl, Dortmund, Germany, 272–279.

    Google Scholar 

  • MÖRCHEN, F., ULTSCH, A., and HOOS, O. (2005): Extracting interpretable muscle activation patterns with time series knowledge mining. Intl. Journal of Knowledge-Based & Intelligent Engineering Systems (to appear).

    Google Scholar 

  • RODRIGUEZ, J.J., ALSONSO, C.J., and BOSTRÖM, H. (2000): Learning First Order Logic Time Series Classifiers In Proc. 10th Intl. Conf. on Inductive Logic Programming, 260–275.

    Google Scholar 

  • RABINER, L. R. (1989): A tutorial on hidden markov models and selected applications in speech recognition. In Proc. of IEEE, 77(2):0 257–286.

    Google Scholar 

  • ULTSCH, A. (2003): Pareto Density Estimation: Probability Density Estimation for Knowledge Discovery. In Proc. GfKl, Cottbus, Germany, 91–102.

    Google Scholar 

  • VAN WIJK, J. J., VAN SELOW, E. R. (1999): Cluster and Calendar Based Visualization of Time Series Data. In Proc. INFOVIS, 4–9.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer Berlin · Heidelberg

About this paper

Cite this paper

Mörchen, F., Ultsch, A. (2006). Finding Persisting States for Knowledge Discovery in Time Series. In: Spiliopoulou, M., Kruse, R., Borgelt, C., Nürnberger, A., Gaul, W. (eds) From Data and Information Analysis to Knowledge Engineering. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-31314-1_33

Download citation

Publish with us

Policies and ethics