Abstract
We develop a highly efficient access method, called Delta-Top-Index, to answer top-k subsequence matching queries over a time series data set. Compared to a naïve implementation, our index has a storage cost that is up to two orders of magnitude smaller, while providing answers within microseconds. We demonstrate the efficiency and effectiveness of our technique in an experimental evaluation with real-world data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Altogether there are around twenty different parameters.
- 2.
We use integers to simplify the matter, the actual temperatures are represented by real numbers.
- 3.
Remember that \(P^{\varvec{x}\varvec{y}}[i,0] := 0\) and \(P^{\varvec{x}\varvec{y}}[0,j] := 0\).
- 4.
We avoid race conditions by protecting top list modifications with a critical section.
- 5.
We will look at a more sophisticated implementation in the following section.
References
Agrawal, R., Faloutsos, C., Swami, A.: Efficient similarity search in sequence databases. In: Lomet, D.B. (ed.) FODO 1993. LNCS, vol. 730, pp. 69–84. Springer, Heidelberg (1993). doi:10.1007/3-540-57301-1_5
Ding, H., Trajcevski, G., Scheuermann, P., Wang, X., Keogh, E.J.: Querying and mining of time series data: experimental comparison of representations and distance measures. PVLDB 1(2), 1542–1552 (2008)
Faloutsos, C., Ranganathan, M., Manolopoulos, Y.: Fast subsequence matching in time-series databases. In: SIGMOD, pp. 419–429 (1994)
Han, T.S., Ko, S.-K., Kang, J.: Efficient subsequence matching using the longest common subsequence with a dual match index. In: Perner, P. (ed.) MLDM 2007. LNCS (LNAI), vol. 4571, pp. 585–600. Springer, Heidelberg (2007). doi:10.1007/978-3-540-73499-4_44
Keogh, E.J., Chakrabarti, K., Pazzani, M.J., Mehrotra, S.: Dimensionality reduction for fast similarity search in large time series databases. Knowl. Inf. Syst. 3(3), 263–286 (2001)
Kim, S.W., Park, D.H., Lee, H.G.: Efficient processing of subsequence matching with the Euclidean metric in time-series databases. IPL 90(5), 253–260 (2004)
Lim, S.H., Park, H., Kim, S.W.: Using multiple indexes for efficient subsequence matching in time-series databases. Inf. Sci. 177(24), 5691–5706 (2007)
Moon, Y.S., Whang, K.Y., Loh, W.K.: Duality-based subsequence matching in time-series databases. In: ICDE, pp. 263–272 (2001)
Mueen, A., Keogh, E., Young, N.: Logical-shapelets: an expressive primitive for time series classification. In: Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2011, pp. 1154–1162. ACM (2011)
Papapetrou, P., Athitsos, V., Potamias, M., Kollios, G., Gunopulos, D.: Embedding-based Subsequence matching in Time-series Databases. ACM Trans. Database Syst. 36(3), 17:1–17:39 (2011)
Shieh, J., Keogh, E.J.: iSAX: indexing and mining terabyte sized time series. In: SIGKDD, pp. 623–631 (2008)
Yi, B., Faloutsos, C.: Fast time sequence indexing for arbitrary Lp norms. In: VLDB, pp. 385–394 (2000)
Zoumpatianos, K., Idreos, S., Palpanas, T.: ADS: the adaptive data series index. VLDB J. 25(6), 843–866 (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Piatov, D., Helmer, S., Gamper, J. (2017). Interactive Time Series Subsequence Matching. In: Kirikova, M., Nørvåg, K., Papadopoulos, G. (eds) Advances in Databases and Information Systems. ADBIS 2017. Lecture Notes in Computer Science(), vol 10509. Springer, Cham. https://doi.org/10.1007/978-3-319-66917-5_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-66917-5_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-66916-8
Online ISBN: 978-3-319-66917-5
eBook Packages: Computer ScienceComputer Science (R0)