Abstract
Given a set of time series, our goal is to identify prototypes that cover the maximum possible amount of occurring subsequences regardless of their order. This scenario appears in the context of the automotive industry, where the goal is to determine operational profiles that comprise frequently recurring driving behavior patterns. This problem can be solved by clustering, however, standard distance measures such as the dynamic time warping distance might not be suitable for this task, because they aim at capturing the cost of aligning two time series rather than rewarding pairwise recurring patterns. In this contribution, we propose a novel time series distance measure, based on the notion of recurrence plots, which enables us to determine the (dis)similarity of multivariate time series that contain segments of similar trajectories at arbitrary positions. We use recurrence quantification analysis to measure the structures observed in recurrence plots and to investigate dynamical properties, such as determinism, which reflect the pairwise (dis)similarity of time series. In experiments on real-life test drives from Volkswagen, we demonstrate that clustering multivariate time series using the proposed recurrence plot-based distance measure results in prototypical test drives that cover significantly more recurring patterns than using the same clustering algorithm with dynamic time warping distance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Keogh, E.J., Zhu, Q., Hu, B., Hao, Y., Xi, X., Wei, L., Ratanamahatana, C.A.: The (UCR) time series classification/clustering homepage, www.cs.ucr.edu/eamonn/time_series_data/ (2011)
Kumar, M., Patel, N.R., Woo, J.: Clustering seasonality patterns in the presence of errors. In: KDD (2002)
Lines, J., Bagnall, A., Caiger-Smith, P., Anderson, S.: Classification of household devices by electricity usage profiles. In: IDEAL, pp. 403–412 (2011)
Moeller-Levet, C.S., Klawonn, F., Cho, K.-H., Wolkenhauer, O.: Fuzzy clustering of short time-series and unevenly distributed sampling points. In: IDA, pp. 28–30 (2003)
Axel, W., Oliver, L., Dersch, D.R., Leinsinger, G.L., Klaus, H., Benno, P., Dorothee, A.: Cluster analysis of biomedical image time-series. Int. J. Comput. Vision 46(2), 103–128 (2002)
Gustavo, E.A., Batista, P.A., Wang, X., Keogh, E.J.: A complexity-invariant distance measure for time series. In: SDM, pp. 699–710 (2011)
Liao, T.W.: Clustering of time series data—a survey. J. Pattern Recognit. 38(11), 1857–1874 (2005)
Ding, H., Trajcevski, G., Scheuermann, P., Wang, X., Keogh, E.J.: Querying and mining of time series data: experimental comparison of representations and distance measures. PVLDB 1(2), 1542–1552 (2008)
Keogh, E.J., Kasetty, S.: On the need for time series data mining benchmarks: a survey and empirical demonstration. Data Min. Knowl. Discov. 7(4), 349–371 (2003)
Rakthanmanon, T., Campana, B.J.L., Mueen, A., Batista, G., Westover, M.B., Zhu, Q., Zakaria, J., Keogh, E.J.: Searching and mining trillions of time series subsequences under dynamic time warping. In: KDD, pp. 262–270 (2012)
Chiu, B.Y.-c., Keogh, E.J., Lonardi, S.: Probabilistic discovery of time series motifs. In: KDD, pp. 493–498 (2003)
Lin, J., Keogh, E.J., Lonardi, S., Patel, P.: Finding motifs in time series. In: KDD (2002)
Rakthanmanon, T., Keogh, E.J.: Fast-shapelets: a scalable algorithm for discovering time series shapelets. In: SDM (2013)
Zakaria, J., Mueen, A., Keogh, E.J.: Clustering time series using unsupervised-shapelets. In: ICDM, pp. 785–794 (2012)
Stephan, S., Johannes, J.B., William, De L.E., Sahin, A.: Pattern recognition in multivariate time series: dissertation proposal. In: PIKM, pp. 27–34 (2011)
Stephan, S., Julia, G., Andreas, L., Ernesto, De L., Sahin, A.: Pattern recognition and classification for multivariate time series. In: SensorKDD, pp. 34–42 (2011)
Spiegel, S., Albayrak, S.: An order-invariant time series distance measure—Position on recent developments in time series analysis. In: KDIR, pp. 264–268 (2012)
Bing, H., Chen, Y., Keogh, E.J.: Time series classification under more realistic assumptions. In: SDM (2013)
Keogh, E.J., Lin, J.: Clustering of time-series subsequences is meaningless: implications for previous and future research. Knowl. Inf. Syst. 8(2), 154–177 (2005)
Keogh, E.J., Lin, J., Fu, A.W.-C.: HOT SAX: efficiently finding the most unusual time series subsequence. In: ICDM, pp. 226–233 (2005)
Marwan, N.: Encounters with neighbours: current developments of concepts based on recurrence plots and their applications. University of Potsdam (2003)
Marwan, N., Romano, M., Thiel, M., Kurths, J.: Recurrence plots for the analysis of complex systems. Phys. Rep. 438(5–6), 237–329 (2007)
Marwan, N.: A historical review of recurrence plots. Eur. Phys. J. Special Topics 164(1), 3–12 (2008)
Marwan, N., Romano, M., Thiel, M.: Recurrence plots and cross recurrence plots. www.recurrence-plot.tk
Marwan, N., Schinkel, S., Kurths, J.: Recurrence plots 25 years later—gaining confidence in dynamical transitions. Europhys. Lett., 101(2) (2013)
Marwan, N.: How to avoid potential pitfalls in recurrence plot based data analysis. I. J. Bifurcat. Chaos 21(4), 1003–1017 (2011)
Schultz, A.P., Zou, Y., Marwan, N., Turvey, M.T.: Local minima-based recurrence plots for continuous dynamical systems. I. J. Bifurcat. Chaos 21(4), 1065–1075 (2011)
Webber, C.L., Marwan, N., Facchini, A., Giuliani, A.: Simpler methods do it better: success of recurrence quantification analysis as a general purpose data analysis tool. Phys. Lett. A 373(41), 3753–3756 (2009)
Vlahogianni, E.I., Karlaftis, M.G.: Comparing traffic flow time-series under fine and adverse weather conditions using recurrence-based complexity measures. Nonlinear Dyn. 69(4), 1949–1963 (2012)
Choi, J.M., Bae, B.H., Kim, S.Y.: Divergence in perpendicular recurrence plot; quantification of dynamical divergence from short chaotic time series. Phys. Lett. A 263(4–6), 299–306 (1999)
Maulik, U., Bandyopadhyay, S.: Performance evaluation of some clustering algorithms and validity indices. IEEE Trans. Pattern Anal. Mach. Intell. 24(12), 1650–1654 (2002)
Acknowledgments
The proposed recurrence plot-based distance measure for clustering multivariate time series was developed in cooperation with the Volkswagen AG, Wolfsburg. Thanks to Bernd Werther and Matthias Pries for their contribution of expert knowledge and their help in recording vehicular sensor data.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Spiegel, S., Jain, JB., Albayrak, S. (2014). A Recurrence Plot-Based Distance Measure. In: Marwan, N., Riley, M., Giuliani, A., Webber, Jr., C. (eds) Translational Recurrences. Springer Proceedings in Mathematics & Statistics, vol 103. Springer, Cham. https://doi.org/10.1007/978-3-319-09531-8_1
Download citation
DOI: https://doi.org/10.1007/978-3-319-09531-8_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-09530-1
Online ISBN: 978-3-319-09531-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)