Abstract
In this paper, we consider unsupervised clustering as a combinatorial optimization problem. We focus on the use of Local Search procedures to optimize an association coefficient whose aim is to construct a couple of conceptual partitions, one on the set of objects and the other one on the set of attribute-value pairs. We present a study of the variation of the function in order to decrease the complexity of local search and to propose stochastic local search. Performances of the given algorithms are tested on synthetic data sets and the real data set Vote taken from the UCI Irvine repository.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
J. N. Bhuyan, V. V. Raghavan, and V. K. Elayavalli. Genetic algorithm for clustering with ordered representation. In Richard K. Belew and Lashon B. Booker, editors, Proceedings of the Fourth International Conference on Genetic Algorithms, San Mateo, CA, 1991. Morgan Kaufmann Publishers.
G. Celeux, E. Diday, G. Govaert, Y. Lechevallier, and H. Ralambondrainy. Classification automatique des données. Dunod, paris, 1988.
R. M. Cole. Clustering with genetic algorithms. Master’s thesis, University of Western Australia, 1998.
P. Cheeseman and J. Stutz. Bayesian classi.cation (autoclass): Theory and results. Advances in Knowledge Discovery and Data Mining, 1996.
D. H. Fisher. Knowledge acquisition via incremental conceptual clustering. Machine Learning, 2:139–172, 1987.
D. H. Fisher. Iterative optimization and simplification of hierarchical clusterings. Journal of Artificial Intelligence Research, 4:147–180, 1996.
P. Fränti and J. Kivijärvi. Randomised local search algorithm for the clustering problem. Pattern Analysis and Applications, pages 358–369, 2000.
L. A. Goodman and W. H. Kruskal. Measures of association for cross classification. Journal of the American Statistical Association, 49:732–764, 1954.
M. Gyllenberg, T. Koski, T. Lund, and O. Nevalainen. Clustering by adaptive local search with multiple search operators. Pattern Analysis and Applications, pages 348–357, 2000.
G. Govaert. Classification simultanée de tableaux binaires. In E. Diday, M. Jambu, L. Lebart, J. Pages, and R. Tomassone, editors, Data analysis and informatics III, pages 233–236. North Holland, 1984.
A. K. Jain and R. C. Dubes. Algorithms for clustering data. Prentice Hall, Englewood cliffs, New Jersey, 1988.
I.C. Lerman and J. F. P. da Costa. Coefficients d’association et variables á trés grand nombre de catégories dans les arbres de décision: application á l’identification de la structure secondaire d’une protéine. Technical Report 2803, INRIA, février 1996.
G. Matthews and J. Hearne. Clustering without a metric. IEEE Transaction on pattern analysis and machine intelligence, 13(2):175–184, 1991.
W. T. McCormick, P. J. Schweitzer, and T. W. White. Problem decomposition and data reorganization by a clustering technique. Operations Research, 20(5):993–1009, 1972.
M. Olszak. Modélisation des relations de causalité entre variables qualitatives. PhD thesis, Université de Genéve, 1995.
R. Rakotomalala. Graphes d’induction. PhD thesis, Université Claude Bernard Lyon 1, 1997.
C. Robardet and F. Feschet. Comparison of three objective functions for conceptual clustering. In Proceedings of the 5th European Conference on Principles and Practice of Knowledge Discovery in Databases. Springer-Verlag, September 2001.
G. Rudolph. Convergence analysis of canonical genetic algorithms. IEEE Transactions on neuronal networks, 5(1):96–101, 1994.
J.R. Slagle, C. L. Chang, and S. R. Heller. A clustering and datareorganizing. IEEE Transactions On systems, Man and Cybernetics, pages 125–128, January 1975.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Robardet, C., Feschet, F. (2001). Effcient Local Search in Conceptual Clustering. In: Jantke, K.P., Shinohara, A. (eds) Discovery Science. DS 2001. Lecture Notes in Computer Science(), vol 2226. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45650-3_28
Download citation
DOI: https://doi.org/10.1007/3-540-45650-3_28
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42956-2
Online ISBN: 978-3-540-45650-6
eBook Packages: Springer Book Archive