Itemset materializing for fast mining of association rules

Wojciechowski, Marek; Zakrzewicz, Maciej

doi:10.1007/BFb0057741

Marek Wojciechowski¹ &
Maciej Zakrzewicz¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1475))

Included in the following conference series:

East European Symposium on Advances in Databases and Information Systems

149 Accesses
3 Citations

Abstract

Mining association rules is an important data mining problem. Association rules are usually mined repeatedly in different parts of a database. Current algorithms for mining association rules work in two steps. First, the most frequently occurring sets of items are discovered, then the sets are used to generate the association rules. The first step usually requires repeated passes over the analyzed database and determines the overall performance. In this paper, we present a new method that addresses the issue of discovering the most frequently occurring sets of items. Our method consists in materializing precomputed sets of items discovered in logical database partitions. We show that the materialized sets can be repeatedly used to efficiently generate the most frequently occurring sets of items. Using this approach, required association rules can be mined with only one scan of the database. Our experiments show that the proposed method significantly outperforms the well-known algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal R., Imielinski T., Swami A., “Mining Association Rules Between Sets of Items in Large Databases”, Proc. ACM SIGMOD, pp. 207–216, Washington DC, USA, May 1993
Google Scholar
Agrawal R., Mehta M., Shafer J., Srikant R., Arning A., Bollinger T., “The Quest Data Mining System”, Proc. of the 2^nd Int’l Conference on Knowledge Discovery in Databases and Data Mining, Portland, Oregon, August 1996
Google Scholar
Agrawal R., Srikant R., “Fast Algorithms for Mining Association Rules”, Proc. 20^th Int’l Conf. Very Large Data Bases, pp. 478–499, Santiago, Chile, 1994
Google Scholar
Cheung D.W., Han J., Ng V., Wong C.Y., “Maintenance of Discovered Association Rules in Large Databases: An Incremental Updating Technique”, Proc. Int’l Conf. Dana Eng., New Orleans, USA, February 1996
Google Scholar
Fayyad U., Piatetsky-Shapiro G., Smyth P., “The KDD Process for Extracting Useful Knowledge from Volumes of Data”, Communications of the ACM, Vol. 39, No. 11, Nov. 1996
Google Scholar
Han J., “Towards On-Line Analytical Mining in Large Databases, SIGMOD Record”, Vol. 27, No. 1, March 1998
Google Scholar
Houtsma M., Swami A., “Set-Oriented Mining of Association Rules”, Research Report RJ 9567, IBM Almaden Research Center, San Jose, California, USA, October 1993
Google Scholar
Imielinski T., Manilla H., “A Database Perspective on Knowledge Discovery”, Communications of the ACM, Vol. 39, No. 11, Nov. 1996
Google Scholar
Manilla H., Toivonen H., Inkeri Verkamo A., “Efficient Algorithms for Discovering Association Rules”, Proc. AAAI Workshop Knowledge Discovery in Databases, pp. 181–192, July 1994
Google Scholar
Manilla H., Toivonnen H., “Levelwise Search and Borders of Theories in Knowledge Discovery”, Report C-1997-8, University of Helsinki, Finland
Google Scholar
Morzy T., Zakrzewicz M., “SQL-Like Language For Database Mining”, ADBIS’97 Symposium, St. Petersburg, September 1997
Google Scholar
Park J.S., Chen M.-S., Yu P. S., “An Effective Hash-Based Algorithm for Mining Association Rules”, SIGMOD’95, San Jose, CA, USA, 1995
Google Scholar
Piatetsky-Shapiro G., Frawley W.J., editors, Knowledge Discovery in Databases, MIT Press, 1991
Google Scholar
Savasere, E. Omiecinski, S. Navathe, “An Efficient Algorithm for Mining Association Rules in Large Databases”, Proc. 21th Int’l Conf. Very Large Data Bases, pp. 432–444, Zurich, Switzerland, September 1995
Google Scholar
Toivonen H., “Sampling Large Databases for Association Rules”, Proc. 22^nd Int’l Conf. Very Large Data Bases, Bombay, India, 1996
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computing Science, Poznan University of Technology, ul. Piotrowo 3a, 60-965, Poznan, Poland
Marek Wojciechowski & Maciej Zakrzewicz

Authors

Marek Wojciechowski
View author publications
You can also search for this author in PubMed Google Scholar
Maciej Zakrzewicz
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Witold Litwin Tadeusz Morzy Gottfried Vossen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wojciechowski, M., Zakrzewicz, M. (1998). Itemset materializing for fast mining of association rules. In: Litwin, W., Morzy, T., Vossen, G. (eds) Advances in Databases and Information Systems. ADBIS 1998. Lecture Notes in Computer Science, vol 1475. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0057741

Download citation

DOI: https://doi.org/10.1007/BFb0057741
Published: 29 June 2006
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-64924-3
Online ISBN: 978-3-540-68309-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics