Incremental OPTICS: Efficient Computation of Updates in a Hierarchical Cluster Ordering

Kriegel, Hans-Peter; Kröoger, Peer; Gotlibovich, Irina

doi:10.1007/978-3-540-45228-7_23

Hans-Peter Kriegel⁷,
Peer Kröoger⁷ &
Irina Gotlibovich⁷

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2737))

Included in the following conference series:

International Conference on Data Warehousing and Knowledge Discovery

491 Accesses
13 Citations

Abstract

Data warehouses are a challenging field of application for data mining tasks such as clustering. Usually, updates are collected and applied to the data warehouse periodically in a batch mode. As a consequence, all mined patterns discovered in the data warehouse (e.g. clustering structures) have to be updated as well. In this paper, we present a method for incrementally updating the clustering structure computed by the hierarchical clustering algorithm OPTICS. We determine the parts of the cluster ordering that are affected by update operations and develop efficient algorithms that incrementally update an existing cluster ordering. A performance evaluation of incremental OPTICS based on synthetic datasets as well as on a real-world dataset demonstrates that incremental OPTICS gains significant speed-up factors over OPTICS for update operations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

McQueen, J.: Some Methods for Classification and Analysis of Multivariate Observations. In: 5th Berkeley Symp. Math. Statist. Prob. vol. 1, pp. 281–297 (1967)
Google Scholar
Ng, R., Han, J.: Efficient and Affective Clustering Methods for Spatial Data Mining. In: Proc. 20th Int. Conf. on Very Large Databases (VLDB 1994), Santiago, Chile, pp. 144–155 (1994)
Google Scholar
Zhang, T., Ramakrishnan, R. Livny, M.: BIRCH: An Efficient Data Clustering Method for Very Large Databases. In: Proc. ACM SIGMOD Int. Conf. on Management of Data (SIGMOD 1996), Montreal, Canada, pp. 103–114 (1996)
Google Scholar
Ester, M., Kriegel, H.P., Sander, J., Xu, X.: A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise. In: Proc. 2nd Int. Conf. on Knowledge Discovery and Data Mining (KDD 1996), Portland, OR, pp. 291–316. AAAI Press, Menlo Park (1996)
Google Scholar
Ankerst, M., Breunig, M.M., Kriegel, H.P., Sander, J.: OPTICS: Ordering Points to Identify the Clustering Structure. In: Proc. ACM SIGMOD Int. Conf. on Management of Data (SIGMOD 1999), Philadelphia, PA, pp. 49–60 (1999)
Google Scholar
Ester, M., Kriegel, H.P., Sander, J., Wimmer, M., Xu, X.: Incremental Clustering for Mining in a Data Warehousing Environment. In: Proc. 24th Int. Conf. on Very Large Databases (VLDB 1998), pp. 323–333 (1998)
Google Scholar
Feldman, R., Aumann, Y., Amir, A., Mannila, H.: Efficient Algorithms for Discovering Frequent Sets in Incremental Databases. In: Proc. ACM SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery, Tucson, AZ, pp. 59–66 (1997)
Google Scholar
Ester, M., Wittmann, R.: Incremental Generalization for Mining in a Data Warehousing Environment. In: Schek, H.-J., Saltor, F., Ramos, I., Alonso, G. (eds.) EDBT 1998. LNCS, vol. 1377, pp. 135–152. Springer, Heidelberg (1998)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Institute for Computer Science, University of Munich, Germany
Hans-Peter Kriegel, Peer Kröoger & Irina Gotlibovich

Authors

Hans-Peter Kriegel
View author publications
You can also search for this author in PubMed Google Scholar
Peer Kröoger
View author publications
You can also search for this author in PubMed Google Scholar
Irina Gotlibovich
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Graduate School of Informatics, Kyoto University, Yoshida-Honmachi, Sakyo, 606-8501, Kyoto, Japan
Yahiko Kambayashi
I.B.M. India Research Lab, India
Mukesh Mohania
Institute for Application Oriented Knowledge Processing (FAW), Johannes Kepler University Linz, Austria
Wolfram Wöß

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kriegel, HP., Kröoger, P., Gotlibovich, I. (2003). Incremental OPTICS: Efficient Computation of Updates in a Hierarchical Cluster Ordering. In: Kambayashi, Y., Mohania, M., Wöß, W. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2003. Lecture Notes in Computer Science, vol 2737. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45228-7_23

Download citation

DOI: https://doi.org/10.1007/978-3-540-45228-7_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40807-9
Online ISBN: 978-3-540-45228-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics