Abstract
In this paper, we propose a kernel-based clustering algorithm that highlights both the major trends and the atypical behaviours present in a dataset, so as to provide a complete characterisation of the data; thanks to the kernel framework, the algorithm can be applied independently of the data nature without requiring any adaptation. We apply it to xml data describing student results to several exams: we propose a kernel to handle such data and present the results obtained with a real dataset.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
COLLINS, M. and DUFFY, N (2002): Convolution kernels for natural language. In Advances in Neural Information Processing Systems, NIPS 14, pages 625–632.
KASHIMA, H. and KOYANAGI, T. (2002): Kernels for semi-structured data. In Proc. of ICML’02, pages 291–298.
KLAWONN, F., KRUSE, R. and TIMM, H. (1997): Fuzzy shell cluster analysis. In G. della Riccia, H. Lenz, and R. Kruse, editors, Learning, networks and statistics, pages 105–120. Springer.
LESOT, M.-J. and BOUCHON-MEUNIER, B. (2004): Descriptive concept extraction with exceptions by hybrid clustering. In Proc. of FUZZ-IEEE’04, pages 389–394.
SCHÖLKOPF, B. and SMOLA, A. (2002): Learning with kernels. MIT Press.
VAPNIK, V. (1995): The nature of statistical learning theory. Springer, New York.
WU, Z., XIE, W. and YU, J. (2003): Fuzzy c-means clustering algorithm based on kernel method. In Proc. of ICCIMA’03, pages 1–6.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer Berlin · Heidelberg
About this paper
Cite this paper
Lesot, MJ. (2006). Outlier Preserving Clustering for Structured Data Through Kernels. In: Spiliopoulou, M., Kruse, R., Borgelt, C., Nürnberger, A., Gaul, W. (eds) From Data and Information Analysis to Knowledge Engineering. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-31314-1_56
Download citation
DOI: https://doi.org/10.1007/3-540-31314-1_56
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31313-7
Online ISBN: 978-3-540-31314-4
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)