Skip to main content

A Comparative Study of Clustering Methods for Long Time-Series Medical Databases

  • Conference paper
Book cover Modeling Decisions for Artificial Intelligence (MDAI 2004)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3131))

Abstract

This paper presents a comparative study of methods for clustering long-term temporal data. We split a clustering procedure into two processes: similarity computation and grouping. As similarity computation methods, we employed dynamic time warping (DTW) and multiscale matching. As grouping methods, we employed conventional agglomerative hierarchical clustering (AHC) and rough sets-based clustering (RC). Using various combinations of these methods, we performed clustering experiments of the hepatitis data set and evaluated validity of the results. The results suggested that (1) complete-linkage (CL) criterion outperformed average-linkage (AL) criterion in terms of the interpret-ability of a dendrogram and clustering results, (2) combination of DTW and CL-AHC constantly produced interpretable results, (3) combination of DTW and RC would be used to find the core sequences of the clusters, (4) multiscale matching may suffer from the treatment of ’no-match’ pairs, however, the problem may be eluded by using RC as a subsequent grouping method.

This work was supported in part by the Grant-in-Aid for Scientific Research on Priority Area (B)(No.759) “Implementation of Active Mining in the Era of Information Flood” by the Ministry of Education, Culture, Science and Technology of Japan.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Keogh, E.: Mining and Indexing Time Series Data. Tutorial at the, IEEE International Conference on Data Mining (2001)

    Google Scholar 

  2. Chu, S., Keogh, E., Hart, D., Pazzani, M.: Iterative Deepening Dynamic Time Warping for Time Series. In: proceedings of the second SIAM International Conference on Data Mining (2002)

    Google Scholar 

  3. Berndt, D.J., Clifford, J.: Using dynamic time warping to find patterns in time series.In: Proceedings of AAAI Workshop on Knowledge Discovery in Databases pp. 359-370 (1994)

    Google Scholar 

  4. Hirano, S., Tsumoto, S.: Mining Similar Temporal Patterns in Long Time-series Data and Its Application to Medicine. In: Proceedings of the IEEE 2002 International Conference on Data Mining, pp. 219–226 (2002)

    Google Scholar 

  5. Ueda, N., Suzuki, S.: A Matching Algorithm of Deformed Planar Curves Using Multiscale Convex/Concave Structures. IEICE Transactions on Information and Systems J73-D-II(7), 992–1000 (1990)

    Google Scholar 

  6. Mokhtarian, F., Mackworth, A.K.: Scale-based Description and Recognition of planar Curves and Two Dimensional Shapes. IEEE Transactions on Pattern Analysis and Machine Intelligence PAMI-8(1), 24–43 (1986)

    Article  Google Scholar 

  7. http://lisp.vse.cz/challenge/ecmlpkdd2003/

  8. Everitt, B.S., Landau, S., Leese, M.: Cluster Analysis, 4th edn. Arnold Publishers (2001)

    Google Scholar 

  9. Hirano, S., Tsumoto, S.: An Indiscernibility-based Clustering Method with Iterative Refinement of Equivalence Relations. Journal of Advanced Computational Intelligence and Intelligent Informatics (2003) (in press)

    Google Scholar 

  10. Lowe, D.G.: Organization of Smooth Image Curves at Multiple Scales. International Journal of Computer Vision 3, 119–130 (1980)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Tsumoto, S., Hirano, S. (2004). A Comparative Study of Clustering Methods for Long Time-Series Medical Databases. In: Torra, V., Narukawa, Y. (eds) Modeling Decisions for Artificial Intelligence. MDAI 2004. Lecture Notes in Computer Science(), vol 3131. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27774-3_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-27774-3_25

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22555-3

  • Online ISBN: 978-3-540-27774-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics