Disjunctive Learning with a Soft-Clustering Method

Cleuziou, Guillaume; Martin, Lionel; Vrain, Christel

doi:10.1007/978-3-540-39917-9_7

Guillaume Cleuziou⁸,
Lionel Martin⁸ &
Christel Vrain⁸

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2835))

Included in the following conference series:

International Conference on Inductive Logic Programming

378 Accesses
2 Citations

Abstract

In the case of concept learning from positive and negative examples, it is rarely possible to find a unique discriminating conjunctive rule; in most cases, a disjunctive description is needed. This problem, known as disjunctive learning, is mainly solved by greedy methods, iteratively adding rules until all positive examples are covered. Each rule is determined by discriminating properties, where the discriminating power is computed from the learning set. Each rule defines a subconcept of concept to be learned with these methods. The final set of sub-concepts is then highly dependent from both the learning set and the learning method.

In this paper, we propose a different strategy: we first build clusters of similar examples thus defining subconcepts, and then we characterize each cluster by a unique conjunctive definition. The clustering method relies on a similarity measure designed for examples described in first order logic. The main particularity of our clustering method is to build “soft clusters”, i.e. allowing some objects to belong to different groups. Once clusters have been built, we learn first-order rules defining the clusters, using a general-to-specific method: each step consists in adding a literal that covers all examples of a group and rejects as many negative examples as possible.

This strategy limits some drawbacks of greedy algorithms and induces a strong reduction of the hypothesis space: for each group (subconcept), the search space is reduced to the set of rules that cover all the examples of the group and reject the negative examples of the concept.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Baraldi, A., Blonda, P.: A survey of fuzzy clustering algorithms for pattern recognition. II. IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics) 29, 786–801 (1999)
Article Google Scholar
Bomze, I., Budinich, M., Pardalos, P., Pelillo, M.: The maximum clique problem. In: Du, D.-Z., Pardalos, P.M. (eds.) Handbook of Combinatorial Optimization, vol. 4, Kluwer Academic Publishers, Boston (1999)
Google Scholar
Ben-Dor, A., Shamir, R., Yakhini, Z.: Clustering gene expression patterns. Journal of Computational Biology 6(3/4), 281–297 (1999)
Article Google Scholar
Berkhin, P.: Survey of clustering data mining techniques. Technical report, Accrue Software, San Jose, CA (2002)
Google Scholar
Bisson, G.: Learning in FOL with a similarity measure. In: 11th National Conf. on Artificial Intelligence (AAAI), San Jose, CA, pp. 82–87. AAAI Press, Menlo Park (1992)
Google Scholar
Dzeroski, S., de Raedt, L., Lavrac, N.: Multiple predicate learning. In: Proceedings of the Thirteen International Joint Conference on Artificial Intelligence, Chambéry, France, pp. 1037–1043. Springer, Heidelberg (1993)
Google Scholar
Emde, W., Wettschereck, D.: Relational instance-based learning. In: Saitta, L. (ed.) 13th Int. Conf. on Machine Learning (ICML 1996), Bari, Italy, pp. 122–130. Morgan & Kaufmann, San Francisco (1996)
Google Scholar
Hoff, W.A., Michalski, R.S., Stepp, R.E.: INDUCE 2: A program for learning structural descriptions from examples. Technical Report 904, Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, Illinois (1983)
Google Scholar
Krishnapuram, R., Keller, J.: A possibilistic approach to clustering. IEEE Transactions on Fuzzy Systems 1(2), 98–110 (1993)
Article Google Scholar
Martin, L., Moal, F.: A language-based similarity measure. In: Flach, P.A., De Raedt, L. (eds.) ECML 2001. LNCS (LNAI), vol. 2167, pp. 336–347. Springer, Heidelberg (2001)
Chapter Google Scholar
Muggleton, S.: Inverse entailment and Progol. New Generation Computing, Special issue on Inductive Logic Programming 13(3-4), 245–286 (1995)
Google Scholar
Quinlan, J.R., Cameron-Jones, R.M.: Induction of logic programs: FOIL and related systems. New Generation Computing, Special issue on Inductive Logic Programming 13(3-4), 287–312 (1995)
Google Scholar
Semeraro, G., Brunk, C.A., Pazzani, M.J.: Traps and pitfalls when learning logical theories: A case study with FOIL and FOCL. Technical Report ICS-TR-93-33 (July 1993)
Google Scholar
Sebag, M.: Distance induction in first order logic. In: Džeroski, S., Lavrač, N. (eds.) ILP 1997. LNCS, vol. 1297, pp. 264–272. Springer, Heidelberg (1997)
Google Scholar
Sebag, M., Schoenauer, M.: A Rule-based Similarity Measure. In: Wess, S., Richter, M., Althoff, K.-D. (eds.) EWCBR 1993. LNCS, vol. 837, pp. 119–130. Springer, Heidelberg (1994)
Google Scholar

Download references

Author information

Authors and Affiliations

LIFO, Laboratoire d’Informatique Fondamentale d’Orléans, Faculté des Sciences, Rue Léonard de Vinci, B.P. 6759, 45067, Orléans cedex2, France
Guillaume Cleuziou, Lionel Martin & Christel Vrain

Authors

Guillaume Cleuziou
View author publications
You can also search for this author in PubMed Google Scholar
Lionel Martin
View author publications
You can also search for this author in PubMed Google Scholar
Christel Vrain
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Fraunhofer IAIS, Schloss Birlinghoven, Sankt Augustin, Germany
Tamás Horváth
Graduate School of Informatics, Kyoto University Yoshida Honmachi, 606-850, Sakyo-ku, Kyoto, Japan
Akihiro Yamamoto

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Cleuziou, G., Martin, L., Vrain, C. (2003). Disjunctive Learning with a Soft-Clustering Method. In: Horváth, T., Yamamoto, A. (eds) Inductive Logic Programming. ILP 2003. Lecture Notes in Computer Science(), vol 2835. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39917-9_7

Download citation

DOI: https://doi.org/10.1007/978-3-540-39917-9_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20144-1
Online ISBN: 978-3-540-39917-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics