Association Rule Mining on Streams

Yu, Philip S.; Chi, Yun

doi:10.1007/978-1-4614-8265-9_25

Association Rule Mining on Streams

Philip S. Yu³ &
Yun Chi⁴

Reference work entry
First Online: 01 January 2018

56 Accesses
1 Citations

Definition

Let I = {i₁, …, i_m} be a set of items. Let S be a stream of transactions in a sequential order where each transaction is a subset of I. For an itemset X, which is a subset of I, a transaction T in S is said to contain the itemset X if X ⊆ T. The support of X is defined as the fraction of transactions in S that contain X. For a given support threshold s%, X is frequent if the support of X is greater than or equal to s%, i.e., if at least s% transactions in S contain X. For a given confidence threshold c%, an association rule X ⇒ Y holds if X ∪ Y is frequent and at least c% of transactions in S that contain X also contain Y. The problem of association rule mining on streams is to discover all association rules that hold in a stream of transactions.

Historical Background

In 1993, Rakesh Agrawal et al. [1] proposed the framework for association rule mining. Since this seminal work, a lot of research work has been done to improve the efficiency of association rule mining...

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 4,499.99; Price excludes VAT (USA)

Hardcover Book: USD 6,499.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Recommended Reading

Agrawal R, Imielinski T, Swami A. Mining association rules between sets of items in large databases. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 1993. p. 207–16.
Google Scholar
Chang JH, Lee WS. Finding recent frequent itemsets adaptively over online data streams. In: Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; 2003. p. 487–92.
Google Scholar
Charikar M, Chen K, Farach-Colton M. Finding frequent items in data streams. In: Proceedings of the 29th International Colloquium on Automata, Languages and Programming; 2002. p. 693–703.
Chapter Google Scholar
Cheng J, Ke Y, Ng W. A survey on algorithms for mining frequent itemsets over data streams. Knowl Int Syst. 2008;16(1):1–27.
Article Google Scholar
Chi Y, Wang H, Yu PS, Muntz RR. Catch the moment: maintaining closed frequent itemsets in a data stream sliding window. Knowl Inf Syst. 2006;10(3):265–94.
Article Google Scholar
Cheung DW, Han J, Ng V, Wong CY. Maintenance of discovered association rules in large databases: an incremental updating technique. In: Proceedings of the 12th International Conference on Data Engineering; 1996. p. 106–14.
Google Scholar
Cheung DW, Lee SD, Kao B. A general incremental technique for maintaining discovered association rules. In: Proceedings of the 5th Interenational Conference on Database Systems for Advanced Applications; 1997. p. 185–94.
Google Scholar
Giannella C, Han J, Pei J, Yan X, Yu PS. Mining frequent patterns in data streams at multiple time granularities. In: Kargupta H, Joshi A, Sivakumar K, Yesha Y, editors. Data mining: next generation challenges and future directions. AAAI; 2004.
Google Scholar
Gouda K, Zaki MJ. Efficiently mining maximal frequent itemsets. In: Proceedings of the 1st IEEE Interenational Conference on Data Mining; 2001. p. 163–70.
Google Scholar
Manku G, Motwani R. Approximate frequency counts over data streams. In: Proceedings of the 28th International Conference on Very Large Data Bases; 2002. p. 346–57.
Chapter Google Scholar
Otey ME, Parthasarathy S, Wang C, Veloso A, Meira W Jr. Parallel and distributed methods for incremental frequent itemset mining. IEEE Trans Syst Man Cybern B. 2004;34(6):2439–50.
Article Google Scholar
Teng W-G, Chen M-S, Yu PS. A regression-based temporal pattern mining scheme for data streams. In: Proceedings of the 29th International Conference on Very Large Data Bases; 2003. p. 98–104.
Chapter Google Scholar
Thomas S, Bodagala S, Alsabti K, Ranka S. An efficient algorithm for the incremental updation of association rules in large databases. In: Proceedings of the 3rd International Conference on Knowledge Discovery and Data Mining; 1997. p. 263–6.
Google Scholar
Veloso A, Meira Jr W, de Carvalho M, Pôssas B, Parthasarathy S, Zaki MJ. Mining frequent itemsets in evolving databases. In: Proceedings of the SIAM International Conference on Data Mining; 2002.
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Department, University of Illinois at Chicago, Chicago, IL, USA
Philip S. Yu
NEC Laboratories America, Cupertino, CA, USA
Yun Chi

Authors

Philip S. Yu
View author publications
You can also search for this author in PubMed Google Scholar
Yun Chi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Philip S. Yu .

Editor information

Editors and Affiliations

Georgia Institute of Technology College of Computing, Atlanta, GA, USA
Ling Liu
University of Waterloo School of Computer Science, Waterloo, ON, Canada
M. Tamer Özsu

Section Editor information

AT&T Labs-Research, Bedminster, NJ, USA
Divesh Srivastava

Rights and permissions

Reprints and permissions

Copyright information

About this entry

Cite this entry

Yu, P.S., Chi, Y. (2018). Association Rule Mining on Streams. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_25

Download citation

DOI: https://doi.org/10.1007/978-1-4614-8265-9_25
Published: 07 December 2018
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-8266-6
Online ISBN: 978-1-4614-8265-9
eBook Packages: Computer ScienceReference Module Computer Science and Engineering

Publish with us

Policies and ethics