Abstract
Mining frequent itemset is one of the main challenges in association rule mining. The efficiency of frequent itemsets depends on the computation time and the data structure used for storing the itemsets. The data structure greatly influences the space requirement. Most of the algorithms work well for a sparse dataset. However, if the dataset is large, it becomes difficult for computation, which eventually increases the execution time. This will affect the scalability of the algorithm. With a compact and concise representation of the itemsets, the itemsets can fit in the memory and hence, do not require any I/O operations. The data structures that are mostly used are array, tree, and trie. In this paper, we present a comparison of the different data structures that are used by the mining algorithms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agrawal, R., Mannila, H., Srikant, R., Toivonen, H., Verkamo, A.I., et al.: Fast discovery of association rules. Adv. Knowl. Discov. Data Min. 12(1), 307–328 (1996)
Ananthanarayana, V., Murty, M.N., Subramanian, D.: Tree structure for efficient data mining using rough sets. Pattern Recognit. Lett. 24(6), 851–862 (2003)
Baralis, E., Cerquitelli, T., Chiusano, S.: A persistent hy-tree to efficiently support itemset mining on large datasets. In: Proceedings of the 2010 ACM Symposium on Applied Computing, pp. 1060–1064. ACM (2010)
Bodon, F.: A fast apriori implementation, rpi cs department technical report tr 03-14 (2003)
Chen, J., Xiao, K.: Bisc: a bitmap itemset support counting approach for efficient frequent itemset mining. ACM Trans. Knowl. Discov. Data (TKDD) 4(3), 12 (2010)
Cheung, D.W., Lee, S.D., Kao, B.: A general incremental technique for maintaining discovered association rules. In: Database Systems For Advanced Applications’ 97, pp. 185–194. World Scientific (1997)
Ezeife, C.I., Su, Y.: Mining incremental association rules with generalized fp-tree. In: Conference of the Canadian Society for Computational Studies of Intelligence, pp. 147–160. Springer (2002)
Grahne, G., Zhu, J.: Efficiently using prefix-trees in mining frequent itemsets. In: FIMI, vol. 90 (2003)
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: ACM Sigmod Record, vol. 29, pp. 1–12. ACM (2000)
Leung, C.S., Khan, Q.I., Hoque, T.: Cantree: a tree structure for efficient incremental mining of frequent patterns. In: Fifth IEEE International Conference on Data Mining, p. 8. IEEE (2005)
Nguyen, T.T.: Mining incrementally closed item sets with constructive pattern set. Exp. Syst. Appl. 100, 41–67 (2018)
Pietracaprina, A.: Mining frequent itemsets using patricia tries (2003)
Rácz, B.: nonordfp: an fp-growth variation without rebuilding the fp-tree. In: FIMI (2004)
Schmidt-Thieme, L.: Algorithmic features of eclat. In: FIMI (2004)
Song, M., Rajasekaran, S.: A transaction mapping algorithm for frequent itemsets mining. IEEE Trans. Knowl. Data Eng. 18(4), 472–481 (2006)
Srikant, R., Agrawal, R.: Mining sequential patterns: generalizations and performance improvements. In: International Conference on Extending Database Technology, pp. 1–17. Springer (1996)
Tanbeer, S.K., Ahmed, C.F., Jeong, B.S., Lee, Y.K.: Efficient single-pass frequent pattern mining using a prefix-tree. Inf. Sci. 179(5), 559–583 (2009)
Uno, T., Kiyomi, M., Arimura, H.: Lcm ver. 3: collaboration of array, bitmap and prefix tree for frequent itemset mining. In: Proceedings of the 1st International Workshop on Open Source Data Mining: Frequent Pattern Mining Implementations, pp. 77–86. ACM (2005)
Zaki, M.J., Gouda, K.: Fast vertical mining using diffsets. In: Proceedings of the Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 326–335. ACM (2003)
Zhou, Z., Ezeife, C.: A low-scan incremental association rule maintenance method based on the apriori property. In: Conference of the Canadian Society for Computational Studies of Intelligence, pp. 26–35. Springer (2001)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kharkongor, C., Nath, B. (2020). A Survey on Representation for Itemsets in Association Rule Mining. In: Pati, B., Panigrahi, C., Buyya, R., Li, KC. (eds) Advanced Computing and Intelligent Engineering. Advances in Intelligent Systems and Computing, vol 1082. Springer, Singapore. https://doi.org/10.1007/978-981-15-1081-6_14
Download citation
DOI: https://doi.org/10.1007/978-981-15-1081-6_14
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-1080-9
Online ISBN: 978-981-15-1081-6
eBook Packages: EngineeringEngineering (R0)