Skip to main content

Impact of the Sakoe-Chiba Band on the DTW Time Series Distance Measure for kNN Classification

  • Conference paper
Knowledge Science, Engineering and Management (KSEM 2014)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8793))

Abstract

For classification of time series, the simple 1-nearest neighbor (1NN) classifier in combination with an elastic distance measure such as Dynamic Time Warping (DTW) distance is considered superior in terms of classification accuracy to many other more elaborate methods, including k-nearest neighbor (kNN) with neighborhood size k > 1. In this paper we revisit this apparently peculiar relationship and investigate the differences between 1NN and kNN classifiers in the context of time-series data and constrained DTW distance. By varying neighborhood size k, constraint width r, and evaluating 1NN and kNN with and without distance-based weighting in different schemes of cross-validation, we show that the first nearest neighbor indeed has special significance in labeled time-series data, but also that weighting can drastically improve the accuracy of kNN. This improvement is manifested by better accuracy of weighted kNN than 1NN for small values of k (3–4), better accuracy of weighted kNN than unweighted kNN in general, and reduced need to use large values of constraint r with weighted kNN.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Faloutsos, C., Ranganathan, M., Manolopoulos, Y.: Fast subsequence matching in time-series databases. ACM SIGMOD Rec. 23, 419–429 (1994)

    Article  Google Scholar 

  2. Berndt, D., Clifford, J.: Using dynamic time warping to find patterns in time series. In: KDD Workshop, Seattle, WA, pp. 359–370 (1994)

    Google Scholar 

  3. Vlachos, M., Kollios, G., Gunopulos, D.: Discovering similar multidimensional trajectories. In: Proceedings of the 18th International Conference on Data Engineering (ICDE), pp. 673–684. IEEE Comput. Soc. (2002)

    Google Scholar 

  4. Chen, L., Ng, R.: On The Marriage of Lp-norms and Edit Distance. In: Proceedings of the 30th International Conference on Very Large Data Bases (VLDB), pp. 792–803. VLDB Endowment (2004)

    Google Scholar 

  5. Chen, L., Özsu, M.T., Oria, V.: Robust and fast similarity search for moving object trajectories. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 491–502. ACM, New York (2005)

    Google Scholar 

  6. Keogh, E., Kasetty, S.: On the need for time series data mining benchmarks: A survey and empirical demonstration. In: Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 102–111. ACM, New York (2002)

    Google Scholar 

  7. Ding, H., Trajcevski, G., Scheuermann, P., Wang, X., Keogh, E.: Querying and mining of time series data: Experimental comparison of representations and distance measures. Proc. VLDB Endow. 1, 1542–1552 (2008)

    Article  Google Scholar 

  8. Esling, P., Agon, C.: Time-series Data Mining. ACM Comput. Surv. 45, 12:1–12:34 (2012)

    Google Scholar 

  9. Xi, X., Keogh, E., Shelton, C., Wei, L., Ratanamahatana, C.A.: Fast time series classification using numerosity reduction. In: Proceedings of the 23rd International Conference on Machine Learning (ICML), pp. 1033–1040. ACM, New York (2006)

    Chapter  Google Scholar 

  10. Keogh, E., Zhu, Q., Hu, B., Y., H., Xi, X., Wei, L., Ratanamahatana, C.A.: The UCR Time Series Classification/Clustering Homepage, www.cs.ucr.edu/~eamonn/time_series_data/,citeulike-article-id:2139261

  11. Agrawal, R., Faloutsos, C., Swami, A.: Efficient similarity search in sequence databases. In: Lomet, D.B. (ed.) FODO 1993. LNCS, vol. 730, pp. 69–84. Springer, Heidelberg (1993)

    Chapter  Google Scholar 

  12. Chan, K.-P., Fu, A.W.-C.: Efficient time series matching by wavelets. In: Proceedings of the 15th International Conference on Data Engineering (ICDE), pp. 126–133. IEEE Comput. Soc. (1999)

    Google Scholar 

  13. Keogh, E., Chakrabarti, K., Pazzani, M., Mehrotra, S.: Dimensionality reduction for fast similarity search in large time series databases. Knowl. Inf. Syst. 3, 263–286 (2001)

    Article  MATH  Google Scholar 

  14. Keogh, E., Chakrabarti, K., Pazzani, M., Mehrotra, S.: Locally adaptive dimensionality reduction for indexing large time series databases. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 151–162. ACM, New York (2001)

    Google Scholar 

  15. Ratanamahatana, C.A., Keogh, E.: Three myths about dynamic time warping data mining. In: Proceedings of the 5th SIAM International Conference on Data Mining (SDM), pp. 506–510 (2005)

    Google Scholar 

  16. Keogh, E.: Exact indexing of dynamic time warping. In: Proceedings of the 28th International Conference on Very Large Data Bases (VLDB), pp. 406–417. VLDB Endowment (2002)

    Google Scholar 

  17. Sakoe, H., Chiba, S.: Dynamic programming algorithm optimization for spoken word recognition. IEEE T. Acoust. Speech Signal Process. 26, 43–49 (1978)

    Article  MATH  Google Scholar 

  18. Kurbalija, V., Radovanović, M., Geler, Z., Ivanović, M.: The influence of global constraints on DTW and LCS similarity measures for time-series databases. In: Dicheva, D., Markov, Z., Stefanova, E. (eds.) Third International Conference on Software, Services and Semantic Technologies S3T 2011. AISC, vol. 101, pp. 67–74. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  19. Kurbalija, V., Radovanović, M., Geler, Z., Ivanović, M.: The influence of global constraints on similarity measures for time-series databases. Knowl-Based Syst. 56, 49–67 (2014)

    Article  Google Scholar 

  20. Radovanović, M., Nanopoulos, A., Ivanović, M.: Time-series classification in many intrinsic dimensions. In: Proceedings of the 10th SIAM International Conference on Data Mining (SDM), Columbus, Ohio, USA, pp. 677–688 (2010)

    Google Scholar 

  21. Dudani, S.A.: The distance-weighted k-nearest-neighbor rule. IEEE T. Syst. Man Cy. 6, 325–327 (1976)

    Article  Google Scholar 

  22. Macleod, J.E.S., Luk, A., Titterington, D.M.: A re-examination of the distance-weighted k-nearest neighbor classification rule. IEEE T. Syst. Man Cy. 17, 689–696 (1987)

    Article  Google Scholar 

  23. Gou, J., Xiong, T., Kuang, Y.: A novel weighted voting for k-nearest neighbor rule. J. Comput. 6, 833–840 (2011)

    Article  Google Scholar 

  24. Gou, J., Du, L., Zhang, Y., Xiong, T.: A new distance-weighted k-nearest neighbor classifier. J. Inf. Comput. Sci. 9, 1429–1436 (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Geler, Z., Kurbalija, V., Radovanović, M., Ivanović, M. (2014). Impact of the Sakoe-Chiba Band on the DTW Time Series Distance Measure for kNN Classification. In: Buchmann, R., Kifor, C.V., Yu, J. (eds) Knowledge Science, Engineering and Management. KSEM 2014. Lecture Notes in Computer Science(), vol 8793. Springer, Cham. https://doi.org/10.1007/978-3-319-12096-6_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-12096-6_10

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-12095-9

  • Online ISBN: 978-3-319-12096-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics