Considering Main Memory in Mining Association Rules

Xiao, Yongqiao; Dunham, Margaret H.

doi:10.1007/3-540-48298-9_23

Yongqiao Xiao⁶ &
Margaret H. Dunham⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1676))

Included in the following conference series:

International Conference on Data Warehousing and Knowledge Discovery

836 Accesses
9 Citations

Abstract

We propose a family of large itemset counting algorithms which adapt to the amount of main memory available. By using historical or sampling data, the potential large itemsets (candidates) and the false candidates are identified earlier. Redundant computation is reduced(thus overall CPU time reduced) by counting different sizes of candidates together and the use of a dynamic trie. By counting candidates earlier and counting more candidates in each scan, the algorithms reduce the overall number of scans required.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Rakesh Agrawal, Tomasz Imielinski, and Arun N. Swami. Mining association rules between sets of items in large databases. In Peter Buneman and Sushil Jajodia, editors, Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data, pages 207–216, Washington, D.C., 26–28 May 1993.
Google Scholar
Rakesh Agrawal, Heikki Mannila, Ramakrishnan Srikant, Hannu Toivonen, and A. Inkeri Verkamo. Fast discovery of association rules. In Usama M. Fayyad, Gregory Piatetsky-Shapiro, Padhraic Smyth, and Ramasamy Uthurusamy, editors, Advances in Knowledge Discovery and Data Mining, pages 307–328, Menlo Park, CA, 1996. AAAI Press.
Google Scholar
Rakesh Agrawal and John C. Shafer. Parallel mining of association rules. IEEE Transactions on Knowledge and Data Engineering, 8(6):962–969, December 1996.
Google Scholar
Rakesh Agrawal and Ramakrishnan Srikant. Fast Algorithms for Mining Association Rules in Large Databases. In Proceedings of the Twentieth International Conference on Very Large Databases, pages 487–499, Santiago, Chile, 1994.
Google Scholar
Sergey Brin, Rajeev Motwanik, Jeffrey D. Ullman, and Shalom Tsur. Dynamic itemset counting and implication rules for market basket data. In Proceedings of the ACM SIGMOD Conference, pages 255–264, 1997.
Google Scholar
Eui-Hong Han, George Karypis, and Vipin Kumar. Scalable parallel data mining for association rules. In Proceedings of the ACM SIGMOD Conference, pages 277–288, 1997.
Google Scholar
Jun-Lin Lin and M. H. Dunham. Mining association rules: Anti-skew algorithms. In Proceedings of the 14th International Conference on Data Engineering, Olando, Florida, February 1998. IEEE Computer Society Press.
Google Scholar
Heikki Mannila, Hannu Toivonen, and A. Inkeri Verkamo. Efficient algorithms for discovering association rules. In Usama M. Fayyad and Ramasamy Uthurusamy,editors, AAAI Workshop on Knowledge Discovery in Databases (KDD-94), pages 181–192, July 1994.
Google Scholar
Ashoka Savasere, Edward Omiecinski, and Shamkant B. Navathe. An efficient algorithm for mining association rules in large databases. In Proceedings of the 21nd International Conference on Very Large Databases, pages 432–444, Zurich, Swizerland, 1995.
Google Scholar
Takahiko Shintani and Masaru Kitsuregawa. Hash based parallel algorithms for mining association rules. In Proceedings of PDIS, 1996.
Google Scholar
Hannu Toivonen. Sampling large databases for association rules. In Proceedings of the 22nd International Conference on Very Large Databases, pages 134–145, Mumbai, India, 1996.
Google Scholar
Yongqiao Xiao and Margaret H Dunham. Considering main memory in mining association rules. In Technical Report-99-CSE-4, Department of Computer Science, Southern Methodist University, 1999.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Southern Methodist University, Dallas, Texas, 75275-0122
Yongqiao Xiao & Margaret H. Dunham

Authors

Yongqiao Xiao
View author publications
You can also search for this author in PubMed Google Scholar
Margaret H. Dunham
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer and Information Science, University of South Australia, The Levels, Adelaide, Australia, 05
Mukesh Mohania
IFS, Technical University of Vienna, Resselgasse 3, A-1040, Vienna, Austria
A Min Tjoa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xiao, Y., Dunham, M.H. (1999). Considering Main Memory in Mining Association Rules. In: Mohania, M., Tjoa, A.M. (eds) DataWarehousing and Knowledge Discovery. DaWaK 1999. Lecture Notes in Computer Science, vol 1676. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48298-9_23

Download citation

DOI: https://doi.org/10.1007/3-540-48298-9_23
Published: 01 March 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66458-1
Online ISBN: 978-3-540-48298-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics