Abstract
Finding frequent sets of items was first considered critical to mining association rules in the early 1990s. In the subsequent two decades, there have appeared numerous new methods of finding frequent itemsets, which underlines the importance of this problem. The number of algorithms has increased, thus making it more difficult to select proper one for a particular task and/or a particular type of data. This article analyses and compares the twelve most widely used algorithms for mining association rules. The choice of the most efficient of the twelve algorithms is made not only on the basis of available research data, but also based on empirical evidence. In addition, the article gives a detailed description of some approaches and contains an overview and classification of algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agrawal, R., Imielinski, T., Swami, A.: Mining associations between sets of items in large databases. In: ACM SIGMOD International Conference on Management of Data, SIGMOD 1993, Washington, DC, pp. 207–216 (1993)
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: 20th International Conference on Very Large Databases, VLDB 1994, Santiago, Chile, pp. 487–499 (1994)
Agrawal, R., Mannila, H., Srikant, R., Toivonen, H.: Fast discovery of association rules. Adv. Knowl. Discov. Data Min. 12(1), 307–328 (1996). AAAI MIT Press
Han, J., Pei, H., Yin, Y.: Mining frequent patterns without candidate generation. In: ACM SIGMOD International Conference on Management of Data, SIGMOD 2000, Dallas, TX, pp. 1–12 (2000)
Zaki, M.J.: Scalable algorithms for association mining. IEEE Trans. Knowl. Data Eng. 12(3), 372–390 (2000)
Borgelt, C.: Keeping things simple: finding frequent item sets by recursive elimination. In: Open Source Data Mining Workshop, OSDM 2005, Chicago, IL, pp. 66–70. ACM Press, New York (2005)
Pei, J., Han, J., Lu, H., Nishio, S., Tang, S., Yang, D.: H-mine: fast and space-preserving frequent pattern mining in large databases. IIE Trans. 39(6), 593–605 (2007)
Uno, T., Kiyomi, M., Arimura, H.: LCM ver. 2: efficient mining algorithms for frequent/closed/maximal itemsets. In: Workshop on Frequent Itemset Mining Implementations, FIMI 2004, Brighton, UK (2004)
Uno, T., Kiyomi, M., Arimura, H.: LCM ver.3: collaboration of array, bitmap and prefix tree for frequent itemset mining. In: Open Source Data Mining Workshop, OSDM 2005, Chicago, IL, pp. 77–86. ACM Press, New York (2005)
Deng, Z.H., Wang, Z., Jiang, J.: A new algorithm for fast mining frequent itemsets using N-lists. Sci. China Inf. Sci. 55(9), 2008–2030 (2012)
Zaki, M.J., Gouda, K.: Fast vertical mining using diffsets. In: 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2003, Washington, DC, pp. 326–335. ACM Press, New York (2003)
Deng, Z.H., Wang, Z.: A new fast vertical method for mining frequent patterns. Int. J. Comput. Intell. Syst. 3(6), 733–744 (2010)
Deng, Z.H., Lv, S.L.: Fast mining frequent itemsets using Nodesets. Expert Syst. Appl. 41(10), 4505–4512 (2014)
Deng, Z.H., Lv, S.L.: PrePost+: an efficient N-lists-based algorithm for mining frequent itemsets via Children-Parent Equivalence pruning. Expert Syst. Appl. 42(10), 5424–5432 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Busarov, V., Grafeeva, N., Mikhailova, E. (2016). A Comparative Analysis of Algorithms for Mining Frequent Itemsets. In: Arnicans, G., Arnicane, V., Borzovs, J., Niedrite, L. (eds) Databases and Information Systems. DB&IS 2016. Communications in Computer and Information Science, vol 615. Springer, Cham. https://doi.org/10.1007/978-3-319-40180-5_10
Download citation
DOI: https://doi.org/10.1007/978-3-319-40180-5_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-40179-9
Online ISBN: 978-3-319-40180-5
eBook Packages: Computer ScienceComputer Science (R0)