Abstract
Maximal frequent itemsets mining is one of the most fundamental problems in data mining. In this paper, we present CfpMfi, a new depth-first search algorithm based on CFP-tree for mining MFI. Based on the new data structure CFP-tree, which is a combination of FP-tree and MFI-tree, CfpMfi takes a variety pruning techniques and a novel item ordering policy to reduce the search space efficiently. Experimental comparison with previous work reveals that, on dense datasets, CfpMfi prunes the search space efficiently and is better than other MFI Mining algorithms on dense datasets, and uses less main memory than similar algorithm.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of the 20th VLDB Conference, Santiago, Chile (1994)
Han, J., Pei, J., Yin, Y.: Mining Frequent Patterns without Candidate Generation. In: Proc. 2000 ACM-SIGMOD Int. Conf. on Management of Data (SIGMOD 2000), Dallas, TX (May 2000)
Rigoutsos, L., Floratos, A.: Combinatorial pattern discovery in biological sequences: The Teiresias algorithm. Bioinformatics 14(1), 55–67 (1998)
Liu, G., Lu, H., Yu, J.X., Wang, W., Xiao, X.: AFOPT: An Efficient Implementation of Pattern Growth Approach. In: Proceedings of the IEEE ICDM Workshop on Frequent Itemset Mining Implementations, Melbourne, Florida, USA, November 19 (2003)
Bayardo, R.: Efficiently mining long patterns from databases. In: ACM SIGMOD Conference (1998)
Agarwal, R., Aggarwal, C., Prasad, V.: A tree projection algorithm for generation of frequent itemsets. Journal of Parallel and Distributed Computing (2001)
Burdick, D., Calimlim, M., Gehrke, J.: MAFIA: A Performance Study of Mining Maximal Frequent Itemsets. In: Proceedings of the IEEE ICDM Workshop on Frequent Itemset Mining Implementations Melbourne, Florida, USA, November 19 (2003)
Zaki, M.J., Hsiao, C.-J.: CHARM: An efficient algorithm for closed association rule mining. TR 99-10, CS Dept., RPI (October 1999)
Gouda, K., Zaki, M.J.: Efficiently Mining Maximal Frequent Itemsets. In: Proc. of the IEEE Int. Conference on Data Mining, San Jose (2001)
Grahne, G., Zhu, J.: Efficiently Using Prefix-trees in Mining Frequent Item-sets. In: Proceedings of the IEEE ICDM Workshop on Frequent Itemset Mining Imple-mentations, Melbourne, Florida, USA, November 19 (2003)
Goethals, B., Zaki, M.J.: FIMI 2003: Workshop on Frequent Itemset Mining Imple-mentations. In: Proceedings of the IEEE ICDM Workshop on Frequent Itemset Mining Implementations, Melbourne, Florida, USA, (November 19 2003)
Codes and datasets available at: http://fimi.cs.helsinki.fi/
Yan, Y., Li, Z.-J., Chen, H.-w.: Fast mining maximal frequent itemSets based on FP-tree. In: Atzeni, P., et al. (eds.) ER 2004. LNCS, vol. 3288, pp. 348–361. Springer, Heidelberg (2004)
ER (2004): http://www.cs.fudan.edu.cn/er2004/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yan, Y., Li, Z., Wang, T., Chen, Y., Chen, H. (2004). Mining Maximal Frequent ItemSets Using Combined FP-Tree. In: Webb, G.I., Yu, X. (eds) AI 2004: Advances in Artificial Intelligence. AI 2004. Lecture Notes in Computer Science(), vol 3339. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30549-1_42
Download citation
DOI: https://doi.org/10.1007/978-3-540-30549-1_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24059-4
Online ISBN: 978-3-540-30549-1
eBook Packages: Computer ScienceComputer Science (R0)