Partition-Based Clustering Using Constraint Optimization

Grossi, Valerio; Guns, Tias; Monreale, Anna; Nanni, Mirco; Nijssen, Siegfried

doi:10.1007/978-3-319-50137-6_11

Partition-Based Clustering Using Constraint Optimization

Valerio Grossi¹⁹,
Tias Guns²¹,
Anna Monreale¹⁹,
Mirco Nanni²⁰ &
…
Siegfried Nijssen^21,22

Chapter
First Online: 03 December 2016

1546 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10101))

Abstract

Partition-based clustering is the task of partitioning a dataset in a number of groups of examples, such that examples in each group are similar to each other. Many criteria for what constitutes a good clustering have been identified in the literature; furthermore, the use of additional constraints to find more useful clusterings has been proposed. In this chapter, it will be shown that most of these clustering tasks can be formalized using optimization criteria and constraints. We demonstrate how a range of clustering tasks can be modelled in generic constraint programming languages with these constraints and optimization criteria. Using the constraint-based modeling approach we also relate the DBSCAN method for density-based clustering to the label propagation technique for community discovery.

Siegfried Nijssen can currently be reached at the Institute of Information and Communication Technologies, Electronics and Applied Mathematics, UC Louvain, Belgium.

Tias Guns can currently be reached at the Vrije Universiteit Brussel.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Aggarwal, C.C., Reddy, C.K.: Data Clustering: Algorithms and Applications, 1st edn. Chapman & Hall/CRC, Boca Raton (2013)
MATH Google Scholar
Babaki, B., Guns, T., Nijssen, S.: Constrained clustering using column generation. In: Simonis, H. (ed.) CPAIOR 2014. LNCS, vol. 8451, pp. 438–454. Springer, Heidelberg (2014). doi:10.1007/978-3-319-07046-9_31
Chapter Google Scholar
Basu, S., Davidson, I., Wagstaff, K., Clustering, C.: Advances in Algorithms, Theory, and Applications, 1st edn. Chapman & Hall/CRC, Boca Raton (2008)
Google Scholar
Berthold, M.R., Borgelt, C., Hppner, F., Klawonn, F.: Guide to Intelligent Data Analysis: How to Intelligently Make Sense of Real Data, 1st edn. Springer, Heidelberg (2010)
Book MATH Google Scholar
Coscia, M., Giannotti, F., Pedreschi, D.: A classification for community discovery methods in complex networks. Stat. Anal. Data Min. 4(5), 512–546 (2011)
Article MathSciNet Google Scholar
Dao, T., Duong, K., Vrain, C.: A filtering algorithm for constrained clustering with within-cluster sum of dissimilarities criterion. In: 2013 IEEE 25th International Conference on Tools with Artificial Intelligence, Herndon, VA, USA, 4–6 November 2013, pp. 1060–1067 (2013)
Google Scholar
Dao, T.-B.-H., Duong, K.-C., Vrain, C.: A declarative framework for constrained clustering. In: Blockeel, H., Kersting, K., Nijssen, S., Železný, F. (eds.) ECML PKDD 2013. LNCS (LNAI), vol. 8190, pp. 419–434. Springer, Heidelberg (2013). doi:10.1007/978-3-642-40994-3_27
Chapter Google Scholar
Dao, T.-B.-H., Duong, K.-C., Vrain, C.: Constrained clustering by constraint programming. Artif. Intell. (2015)
Google Scholar
Davidson, I., Ravi, S.: The complexity of non-hierarchical clustering with instance and cluster level constraints. Data Min. Knowl. Disc. 14(1), 25–61 (2007)
Article MathSciNet Google Scholar
Davidson, I., Ravi, S.S.: Clustering with constraints: feasibility issues and the k-means algorithm. In: Proceedings of the 2005 SIAM International Conference on Data Mining, SDM 2005, Newport Beach, CA, USA, 21–23 April 2005, pp. 138–149 (2005)
Google Scholar
du Merle, O., Hansen, P., Jaumard, B., Mladenovic, N.: An interior point algorithm for minimum sum-of-squares clustering. SIAM J. Sci. Comput. 21(4), 1485–1505 (1999)
Article MathSciNet MATH Google Scholar
Ester, M., Kriegel, H.-P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Simoudis, E., Han, J., Fayyad, U.M. (eds.) KDD, pp. 226–231. AAAI Press, Menlo Park (1996)
Google Scholar
Gonzalez, T.F.: Clustering to minimize the maximum intercluster distance. Theor. Comput. Sci. 38, 293–306 (1985)
Article MathSciNet MATH Google Scholar
Hansen,P., Aloise, D.: A survey on exact methods for minimum sum-of-squares clustering, pp. 1–2, January 2009. http://www.math.iit.edu/Buck65files/msscStLouis.pdf
Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. 31(3), 264–323 (1999)
Article Google Scholar
Mueller, M., Kramer, S.: Integer linear programming models for constrained clustering. In: Pfahringer, B., Holmes, G., Hoffmann, A. (eds.) DS 2010. LNCS (LNAI), vol. 6332, pp. 159–173. Springer, Heidelberg (2010). doi:10.1007/978-3-642-16184-1_12
Chapter Google Scholar
Raghavan, U.N., Albert, R., Kumara, S.: Near linear time algorithm to detect community structures in large-scale networks. Phys. Rev. E 76(2), 036106+ (2007)
Article Google Scholar
Wagstaff, K., Cardie,C.: Clustering with instance-level constraints. In: Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2000), Stanford University, Stanford, CA, USA, June 29–July 2 2000, pp. 1103–1110 (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Pisa, Pisa, Italy
Valerio Grossi & Anna Monreale
ISTI - CNR, Pisa, Italy
Mirco Nanni
DTAI, KU Leuven, Leuven, Belgium
Tias Guns & Siegfried Nijssen
LIACS, Universiteit Leiden, Leiden, The Netherlands
Siegfried Nijssen

Authors

Valerio Grossi
View author publications
You can also search for this author in PubMed Google Scholar
Tias Guns
View author publications
You can also search for this author in PubMed Google Scholar
Anna Monreale
View author publications
You can also search for this author in PubMed Google Scholar
Mirco Nanni
View author publications
You can also search for this author in PubMed Google Scholar
Siegfried Nijssen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Siegfried Nijssen .

Editor information

Editors and Affiliations

Université Montpellier 2, Montpellier, France
Christian Bessiere
KU Leuven, Heverlee, Belgium
Luc De Raedt
University of British Columbia, Vancouver, British Columbia, Canada
Lars Kotthoff
Université Catholique de Louvain, Louvain-la-Neuve, Belgium
Siegfried Nijssen
University College Cork, Cork, Ireland
Barry O'Sullivan
University of Pisa, Pisa, Italy
Dino Pedreschi

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Grossi, V., Guns, T., Monreale, A., Nanni, M., Nijssen, S. (2016). Partition-Based Clustering Using Constraint Optimization. In: Bessiere, C., De Raedt, L., Kotthoff, L., Nijssen, S., O'Sullivan, B., Pedreschi, D. (eds) Data Mining and Constraint Programming. Lecture Notes in Computer Science(), vol 10101. Springer, Cham. https://doi.org/10.1007/978-3-319-50137-6_11

Download citation

DOI: https://doi.org/10.1007/978-3-319-50137-6_11
Published: 03 December 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-50136-9
Online ISBN: 978-3-319-50137-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics