Abstract
Using domain knowledge in unsupervised learning has shown to be a useful strategy when the set of examples of a given domain has not an evident structure or presents some level of noise. This background knowledge can be expressed as a set of classification rules and introduced as a semantic bias during the learning process.
In this work we present some experiments on the use of partial domain knowledge in conceptual clustering. The domain knowledge (or domain theory) is used to select a set of examples that will be used to start the learning process, this knowledge has not to be complete neither consistent. This bias will increase the quality of the final groups and reduce the effect of the order of the examples. Some measures of stability of classification are used as evaluation method.
The improvement of the acquired concepts can be used to improve and correct the domain knowledge. A set of heuristics to revise the original domain theory has been experimented, yielding to some interesting results.
This work has been financed by UPC grup precompetitiu PR99-09
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
J.Béjar. Adquisición de conocimiento en dominios poco estructurados. PhD thesis, Dep. of Software. Universitat Politécnica de Catalunya, 1995. 47, 48, 50, 51, 53
J. Béjar, U. Cortés, and R. Sangüesa. Experiments with Domain Knowledge in Knowledge Discovery 1st International Conference on the Practical Application of Knowledge Discovery and Data Mining, London, 1997. 47, 50
J. Béjar, U. Cortés. Experiments with Domain Knowledge in Unsupervised Learning: Using and Revising Theories. Revista Iberoamericana de Computación. Computación y Sistemas 1(3). pags 136–144. 1998. 47, 48, 49, 50, 52
D. Faith, L. Belbin. Comparison of classifications using measures intermediate between metric dissimilarity and consensus similarity. Journal of Classification, 3:257–280, 1986. 50
D. Fisher. Knowledge acquisition via incremental conceptual clustering. Machine Learning, 2:139–172, 1987. 47, 48
A. K. Jain, R. C. Dubes. Algorithms for clustering data. Prentice Hall, 1989. 48
R. Michalski, R. E. Steep. Learning from observation: Conceptual Clustering, Machine Learning an A.I. Perspective, pages 331–363. Ed. Tioga, Palo Alto, California, 1983. 47, 48
R. S. Michalski, R. L. Chilausky. Learning by being told and learning from examples: An experimental comparison of the two methods of knowledge acquisition in the context of developing an expert system for soybean disease diagnosis. International Journal of Policy Analysis and Information Systems, 4(2), 1980. 49
P. M. Murphy, D. W. Aha. UCI repository of machine learning databases. Irvine, University of California, Department of Information and Computer Science, 1994. 49
D. Ourston and R. J. Mooney. Theory refinement combining analytical and empirical methods. Artificial Intelligence, 1993. 50
E. Rojo. Aplicación del software LINNEO a la clasificación de transtornos mentales. PhD thesis, Facultat de Medicina. Universitat de Barcelona, 1993. 48
M. Sànchez, U. Cortés, J. Béjar, J. de Gracia, J. Lafuente, and M. Poch. Concept formation in WWWTP by means of classification techniques: A compared study. Applied Intelligence 7, pags 147–165, 1997. 48
L. Talavera, J. Bejar. Integrating Declarative Knowledge in Hierarchical Clustering Tasks. Third Symposium on Intelligent Data Analysis. Pages 211–222, Amsterdam, The Netherlands. LNCS vol. 1642. Springer Verlag, 1999. 50
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Béjar, J. (2000). Improving Knowledge Discovery Using Domain Knowledge in Unsupervised Learning. In: López de Mántaras, R., Plaza, E. (eds) Machine Learning: ECML 2000. ECML 2000. Lecture Notes in Computer Science(), vol 1810. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45164-1_6
Download citation
DOI: https://doi.org/10.1007/3-540-45164-1_6
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67602-7
Online ISBN: 978-3-540-45164-8
eBook Packages: Springer Book Archive