A Kernel-Learning Approach to Semi-supervised Clustering with Relative Distance Comparisons

Amid, Ehsan; Gionis, Aristides; Ukkonen, Antti

doi:10.1007/978-3-319-23528-8_14

Ehsan Amid¹⁰,
Aristides Gionis¹⁰ &
Antti Ukkonen¹¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9284))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

4843 Accesses
8 Citations

Abstract

We consider the problem of clustering a given dataset into k clusters subject to an additional set of constraints on relative distance comparisons between the data items. The additional constraints are meant to reflect side-information that is not expressed in the feature vectors, directly. Relative comparisons can express structures at finer level of detail than must-link (ML) and cannot-link (CL) constraints that are commonly used for semi-supervised clustering. Relative comparisons are particularly useful in settings where giving an ML or a CL constraint is difficult because the granularity of the true clustering is unknown.

Our main contribution is an efficient algorithm for learning a kernel matrix using the log determinant divergence (a variant of the Bregman divergence) subject to a set of relative distance constraints. Given the learned kernel matrix, a clustering can be obtained by any suitable algorithm, such as kernel k-means. We show empirically that kernels found by our algorithm yield clusterings of higher quality than existing approaches that either use ML/CL constraints or a different means to implement the supervision using relative comparisons.

Download to read the full chapter text

Chapter PDF

Constrained clustering and multiple kernel learning without pairwise constraint relaxation

Article 22 June 2022

KeCo: Kernel-Based Online Co-agreement Algorithm

Kernel conditional clustering and kernel conditional semi-supervised learning

Article Open access 06 June 2019

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Anand, S., Mittal, S., Tuzel, O., Meer, P.: Semi-supervised kernel mean shift clustering. PAMI 36, 1201–1215 (2014)
Article Google Scholar
Wagstaff, K., Cardie, C.: Clustering with instance-level constraints. In: ICML (2000)
Google Scholar
Wagstaff, K., Cardie, C., Rogers, S., Schroedl, S.: Constrained \(k\)-means clustering with background knowledge. In: ICML (2001)
Google Scholar
Klein, D., Kamvar, S.D., Manning, C.D.: From instance-level constraints to space-level constraints. In: ICML (2002)
Google Scholar
Basu, S., Bilenko, M., Mooney, R.J.: A probabilistic framework for semi-supervised clustering. In: KDD (2004)
Google Scholar
Basu, S., Banerjee, A., Mooney, R.J.: Active semi-supervision for pairwise constrained clustering. In: SDM (2004)
Google Scholar
Lu, Z., Leen, T.K.: Semi-supervised learning with penalized probabilistic clustering. NIPS (2005)
Google Scholar
Lu, Z.: Semi-supervised clustering with pairwise constraints: A discriminative approach. In: AISTATS (2007)
Google Scholar
Pei, Y., Fern, X.Z., Rosales, R., Tjahja, T.V.: Discriminative clustering with relative constraints. arXiv:1501.00037 (2014)
Lu, Z., Ip, H.H.S.: Constrained Spectral Clustering via Exhaustive and Efficient Constraint Propagation. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part VI. LNCS, vol. 6316, pp. 1–14. Springer, Heidelberg (2010)
Chapter Google Scholar
Lu, Z., Carreira-Perpiñán, M.: Constrained spectral clustering through affinity propagation. In: CVPR (2008)
Google Scholar
Dhillon, I.S., Guan, Y., Kulis, B.: A unified view of kernel k-means, spectral clustering and graph cuts. Technical Report TR-04-25, University of Texas (2005)
Google Scholar
Kulis, B., Basu, S., Dhillon, I., Mooney, R.: Semi-supervised graph clustering: a kernel approach. Machine Learning 74, 1–22 (2009)
Article Google Scholar
Xing, E.P., Ng, A.Y., Jordan, M.I., Russell, S.J.: Distance metric learning with application to clustering with side-information. In: NIPS (2002)
Google Scholar
Schultz, M., Joachims, T.: Learning a distance metric from relative comparisons. In: NIPS (2003)
Google Scholar
Davis, J.V., Kulis, B., Jain, P., Sra, S., Dhillon, I.S.: Information-theoretic metric learning. In: ICML (2007)
Google Scholar
Liu, W., Ma, S., Tao, D., Liu, J., Liu, P.: Semi-supervised sparse metric learning using alternating linearization optimization. In: KDD (2010)
Google Scholar
Liu, E.Y., Guo, Z., Zhang, X., Jojic, V., Wang, W.: Metric learning from relative comparisons by minimizing squared residual. In: ICDM (2012)
Google Scholar
Bilenko, M., Basu, S., Mooney, R.J.: Integrating constraints and metric learning in semi-supervised clustering. In: ICML (2004)
Google Scholar
Xiang, S., Nie, F., Zhang, C.: Learning a mahalanobis distance metric for data clustering and classification. Pattern Recognition 41, 3600–3612 (2008)
Article MATH Google Scholar
Kumar, N., Kummamuru, K.: Semisupervised clustering with metric learning using relative comparisons. TKDE 20, 496–503 (2008)
Google Scholar
Comaniciu, D., Meer, P.: Mean shift: a robust approach toward feature space analysis. PAMI 24, 603–619 (2002)
Article Google Scholar
Kulis, B., Sustik, M.A., Dhillon, I.S.: Low-rank kernel learning with Bregman matrix divergences. JMLR 10, 341–376 (2009)
MathSciNet MATH Google Scholar
Bregman, L.: The relaxation method of finding the common point of convex sets and its application to the solution of problems in convex programming. USSR Computational Mathematics and Mathematical Physics 7, 200–217 (1967)
Article Google Scholar
Tsuda, K., Rätsch, G., Warmuth, M.: Matrix exponentiated gradient updates for on-line learning and Bregman projection. JMLR 6, 995–1018 (2005)
MATH Google Scholar
Hubert, L., Arabie, P.: Comparing partitions. Journal of Classification 2, 193–218 (1985)
Article Google Scholar
Oliva, A., Torralba, A.: Modeling the shape of the scene: A holistic representation of the spatial envelope. IJCV 42, 145–175 (2001)
Article MATH Google Scholar
Hinton, G., Roweis, S.: Stochastic neighbor embedding. In: NIPS (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Helsinki Institute for Information Technology, and Department of Computer Science, Aalto University, Espoo, Finland
Ehsan Amid & Aristides Gionis
Finnish Institute of Occupational Health, Helsinki, Finland
Antti Ukkonen

Authors

Ehsan Amid
View author publications
You can also search for this author in PubMed Google Scholar
Aristides Gionis
View author publications
You can also search for this author in PubMed Google Scholar
Antti Ukkonen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ehsan Amid .

Editor information

Editors and Affiliations

University of Bari Aldo Moro, Bari, Italy
Annalisa Appice
University of Porto, Porto, Portugal
Pedro Pereira Rodrigues
University of Porto - CRACS/INESC TEC, Porto, Portugal
Vítor Santos Costa
University of Porto - INESC TEC, Porto, Portugal
Carlos Soares
University of Porto - INESC TEC, Porto, Portugal
João Gama
University of Porto - INESC TEC, Porto, Portugal
Alípio Jorge

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Amid, E., Gionis, A., Ukkonen, A. (2015). A Kernel-Learning Approach to Semi-supervised Clustering with Relative Distance Comparisons. In: Appice, A., Rodrigues, P., Santos Costa, V., Soares, C., Gama, J., Jorge, A. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2015. Lecture Notes in Computer Science(), vol 9284. Springer, Cham. https://doi.org/10.1007/978-3-319-23528-8_14

Download citation

DOI: https://doi.org/10.1007/978-3-319-23528-8_14
Published: 29 August 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23527-1
Online ISBN: 978-3-319-23528-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Kernel-Learning Approach to Semi-supervised Clustering with Relative Distance Comparisons

Abstract

Chapter PDF

Similar content being viewed by others

Constrained clustering and multiple kernel learning without pairwise constraint relaxation

KeCo: Kernel-Based Online Co-agreement Algorithm

Kernel conditional clustering and kernel conditional semi-supervised learning

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Kernel-Learning Approach to Semi-supervised Clustering with Relative Distance Comparisons

Abstract

Chapter PDF

Similar content being viewed by others

Constrained clustering and multiple kernel learning without pairwise constraint relaxation

KeCo: Kernel-Based Online Co-agreement Algorithm

Kernel conditional clustering and kernel conditional semi-supervised learning

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation