A Comparative Analysis of Algorithms for Mining Frequent Itemsets

Busarov, Vyacheslav; Grafeeva, Natalia; Mikhailova, Elena

doi:10.1007/978-3-319-40180-5_10

Vyacheslav Busarov¹⁴,
Natalia Grafeeva¹⁴ &
Elena Mikhailova¹⁴

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 615))

Included in the following conference series:

International Baltic Conference on Databases and Information Systems

700 Accesses

Abstract

Finding frequent sets of items was first considered critical to mining association rules in the early 1990s. In the subsequent two decades, there have appeared numerous new methods of finding frequent itemsets, which underlines the importance of this problem. The number of algorithms has increased, thus making it more difficult to select proper one for a particular task and/or a particular type of data. This article analyses and compares the twelve most widely used algorithms for mining association rules. The choice of the most efficient of the twelve algorithms is made not only on the basis of available research data, but also based on empirical evidence. In addition, the article gives a detailed description of some approaches and contains an overview and classification of algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Agrawal, R., Imielinski, T., Swami, A.: Mining associations between sets of items in large databases. In: ACM SIGMOD International Conference on Management of Data, SIGMOD 1993, Washington, DC, pp. 207–216 (1993)
Google Scholar
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: 20th International Conference on Very Large Databases, VLDB 1994, Santiago, Chile, pp. 487–499 (1994)
Google Scholar
Agrawal, R., Mannila, H., Srikant, R., Toivonen, H.: Fast discovery of association rules. Adv. Knowl. Discov. Data Min. 12(1), 307–328 (1996). AAAI MIT Press
Google Scholar
Han, J., Pei, H., Yin, Y.: Mining frequent patterns without candidate generation. In: ACM SIGMOD International Conference on Management of Data, SIGMOD 2000, Dallas, TX, pp. 1–12 (2000)
Google Scholar
Zaki, M.J.: Scalable algorithms for association mining. IEEE Trans. Knowl. Data Eng. 12(3), 372–390 (2000)
Article MathSciNet Google Scholar
Borgelt, C.: Keeping things simple: finding frequent item sets by recursive elimination. In: Open Source Data Mining Workshop, OSDM 2005, Chicago, IL, pp. 66–70. ACM Press, New York (2005)
Google Scholar
Pei, J., Han, J., Lu, H., Nishio, S., Tang, S., Yang, D.: H-mine: fast and space-preserving frequent pattern mining in large databases. IIE Trans. 39(6), 593–605 (2007)
Article Google Scholar
Uno, T., Kiyomi, M., Arimura, H.: LCM ver. 2: efficient mining algorithms for frequent/closed/maximal itemsets. In: Workshop on Frequent Itemset Mining Implementations, FIMI 2004, Brighton, UK (2004)
Google Scholar
Uno, T., Kiyomi, M., Arimura, H.: LCM ver.3: collaboration of array, bitmap and prefix tree for frequent itemset mining. In: Open Source Data Mining Workshop, OSDM 2005, Chicago, IL, pp. 77–86. ACM Press, New York (2005)
Google Scholar
Deng, Z.H., Wang, Z., Jiang, J.: A new algorithm for fast mining frequent itemsets using N-lists. Sci. China Inf. Sci. 55(9), 2008–2030 (2012)
Article MathSciNet MATH Google Scholar
Zaki, M.J., Gouda, K.: Fast vertical mining using diffsets. In: 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2003, Washington, DC, pp. 326–335. ACM Press, New York (2003)
Google Scholar
Deng, Z.H., Wang, Z.: A new fast vertical method for mining frequent patterns. Int. J. Comput. Intell. Syst. 3(6), 733–744 (2010)
Article MathSciNet Google Scholar
Deng, Z.H., Lv, S.L.: Fast mining frequent itemsets using Nodesets. Expert Syst. Appl. 41(10), 4505–4512 (2014)
Article Google Scholar
Deng, Z.H., Lv, S.L.: PrePost+: an efficient N-lists-based algorithm for mining frequent itemsets via Children-Parent Equivalence pruning. Expert Syst. Appl. 42(10), 5424–5432 (2015)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Saint Petersburg State University, St. Petersburg, Russia
Vyacheslav Busarov, Natalia Grafeeva & Elena Mikhailova

Authors

Vyacheslav Busarov
View author publications
You can also search for this author in PubMed Google Scholar
Natalia Grafeeva
View author publications
You can also search for this author in PubMed Google Scholar
Elena Mikhailova
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Vyacheslav Busarov .

Editor information

Editors and Affiliations

University of Latvia, Riga, Latvia
Guntis Arnicans
University of Latvia, Riga, Latvia
Vineta Arnicane
University of Latvia, Riga, Latvia
Juris Borzovs
University of Latvia, Riga, Latvia
Laila Niedrite

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Busarov, V., Grafeeva, N., Mikhailova, E. (2016). A Comparative Analysis of Algorithms for Mining Frequent Itemsets. In: Arnicans, G., Arnicane, V., Borzovs, J., Niedrite, L. (eds) Databases and Information Systems. DB&IS 2016. Communications in Computer and Information Science, vol 615. Springer, Cham. https://doi.org/10.1007/978-3-319-40180-5_10

Download citation

DOI: https://doi.org/10.1007/978-3-319-40180-5_10
Published: 21 June 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-40179-9
Online ISBN: 978-3-319-40180-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics