Clustering Rules Using Empirical Similarity of Support Sets

Lele, Shreevardhan; Golden, Bruce; Ozga, Kimberly; Wasil, Edward

doi:10.1007/3-540-45650-3_39

Shreevardhan Lele³,
Bruce Golden³,
Kimberly Ozga⁴ &
…
Edward Wasil⁵

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2226))

Included in the following conference series:

International Conference on Discovery Science

383 Accesses
1 Citations

Abstract

We consider the problem of pruning a given set of if-then rules, such that the support of the pruned rule set is not much less than the support of the given rule set. An empirical measure of similarity between two rules is introduced. This similarity measure is proportional to the degree of overlap between the support sets of the two rules. Using this similarity measure, we cluster the given rule set via the complete linkage algorithm. Rules within a cluster are approximate substitutes for each other and, as such, they can be replaced by a single rule, which is chosen to be the rule whose individual support value is the largest in the cluster. The pruning procedure is demonstrated on a set of rules generated from a marketing data set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

An innovative clustering approach utilizing frequent item sets

Article 26 April 2024

Decision Rule Clustering—Comparison of the Algorithms

Unsupervised Classifier Based on Heuristic Optimization and Maximum Entropy Principle

References

Agrawal, R., Mannila, H., Srikant, R., Toivonen, H., Verkamo, A.I.: Fast Discovery of Association Rules. In Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R. (eds.): Advances in Knowledge Discovery and Data Mining. AAAI Press, Menlo Park, California (1996) 307–328
Google Scholar
Han, J., Kamber, M.: Data Mining. Morgan Kaufmann, San Francisco, California (2001)
Google Scholar
Hand, D.: Construction and Assessment of Classification Rules. Wiley, Chichester, England (1997)
MATH Google Scholar
Kohavi, R., Provost, F.: Applications of Data Mining to Electronic Commerce. In Data Mining and Knowledge Discovery, Vol. 5 (2001) 5–10
Article MATH Google Scholar
Lent, B., Swami. A., Widom, J.: Clustering Association Rules. In Proc. 1997 Int. Conf. Data Engineering (ICDE’97), Birmingham, England (1997) 220–231
Google Scholar
Kaufman, L., Rousseeuw, P.J.: Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, New York (1990)
Google Scholar
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, California (1993)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Maryland, R.H. Smith School of Business, 20742, College Park, MD, USA
Shreevardhan Lele & Bruce Golden
University of Maryland, Department of Mathematics, 20742, College Park, MD, USA
Kimberly Ozga
American University, Kogod School of Business, 20016, Washington, DC, USA
Edward Wasil

Authors

Shreevardhan Lele
View author publications
You can also search for this author in PubMed Google Scholar
Bruce Golden
View author publications
You can also search for this author in PubMed Google Scholar
Kimberly Ozga
View author publications
You can also search for this author in PubMed Google Scholar
Edward Wasil
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

DFKI GmbH Saarbrücken, 66123, Saarbrücken, Germany
Klaus P. Jantke
Department of Informatics, Kyushu University, 6-10-1 Hakozaki, Higashi-ku, 812-8581, Fukuoka, Japan
Ayumi Shinohara

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lele, S., Golden, B., Ozga, K., Wasil, E. (2001). Clustering Rules Using Empirical Similarity of Support Sets. In: Jantke, K.P., Shinohara, A. (eds) Discovery Science. DS 2001. Lecture Notes in Computer Science(), vol 2226. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45650-3_39

Download citation

DOI: https://doi.org/10.1007/3-540-45650-3_39
Published: 20 December 2001
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42956-2
Online ISBN: 978-3-540-45650-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Clustering Rules Using Empirical Similarity of Support Sets

Abstract

Access this chapter

Preview

Similar content being viewed by others

An innovative clustering approach utilizing frequent item sets

Decision Rule Clustering—Comparison of the Algorithms

Unsupervised Classifier Based on Heuristic Optimization and Maximum Entropy Principle

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Clustering Rules Using Empirical Similarity of Support Sets

Abstract

Access this chapter

Preview

Similar content being viewed by others

An innovative clustering approach utilizing frequent item sets

Decision Rule Clustering—Comparison of the Algorithms

Unsupervised Classifier Based on Heuristic Optimization and Maximum Entropy Principle

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation