Skip to main content

Outlier Preserving Clustering for Structured Data Through Kernels

  • Conference paper
From Data and Information Analysis to Knowledge Engineering

Abstract

In this paper, we propose a kernel-based clustering algorithm that highlights both the major trends and the atypical behaviours present in a dataset, so as to provide a complete characterisation of the data; thanks to the kernel framework, the algorithm can be applied independently of the data nature without requiring any adaptation. We apply it to xml data describing student results to several exams: we propose a kernel to handle such data and present the results obtained with a real dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 159.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • COLLINS, M. and DUFFY, N (2002): Convolution kernels for natural language. In Advances in Neural Information Processing Systems, NIPS 14, pages 625–632.

    Google Scholar 

  • KASHIMA, H. and KOYANAGI, T. (2002): Kernels for semi-structured data. In Proc. of ICML’02, pages 291–298.

    Google Scholar 

  • KLAWONN, F., KRUSE, R. and TIMM, H. (1997): Fuzzy shell cluster analysis. In G. della Riccia, H. Lenz, and R. Kruse, editors, Learning, networks and statistics, pages 105–120. Springer.

    Google Scholar 

  • LESOT, M.-J. and BOUCHON-MEUNIER, B. (2004): Descriptive concept extraction with exceptions by hybrid clustering. In Proc. of FUZZ-IEEE’04, pages 389–394.

    Google Scholar 

  • SCHÖLKOPF, B. and SMOLA, A. (2002): Learning with kernels. MIT Press.

    Google Scholar 

  • VAPNIK, V. (1995): The nature of statistical learning theory. Springer, New York.

    Google Scholar 

  • WU, Z., XIE, W. and YU, J. (2003): Fuzzy c-means clustering algorithm based on kernel method. In Proc. of ICCIMA’03, pages 1–6.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer Berlin · Heidelberg

About this paper

Cite this paper

Lesot, MJ. (2006). Outlier Preserving Clustering for Structured Data Through Kernels. In: Spiliopoulou, M., Kruse, R., Borgelt, C., Nürnberger, A., Gaul, W. (eds) From Data and Information Analysis to Knowledge Engineering. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-31314-1_56

Download citation

Publish with us

Policies and ethics