IFIN+: A Parallel Incremental Frequent Itemsets Mining in Shared-Memory Environment

Huynh, Van Quoc Phuong; Küng, Josef; Jäger, Markus; Dang, Tran Khanh

doi:10.1007/978-3-319-70004-5_9

Van Quoc Phuong Huynh¹⁹,
Josef Küng¹⁹,
Markus Jäger¹⁹ &
…
Tran Khanh Dang²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10646))

Included in the following conference series:

International Conference on Future Data and Security Engineering

1893 Accesses
3 Citations

Abstract

In an effort to increase throughput for IFIN, a frequent itemsets mining algorithm, in this paper we introduce a solution, called IFIN⁺, for parallelizing the algorithm IFIN with shared-memory multithreads. The inspiration for our motivation is that today commodity processors’ computational power is enhanced with multi physical computational units; and therefore, exploiting full advantage of this is a potential solution for improving performance in single-machine environments. Some portions in the serial version are changed in means which increase efficiency and computational independence for convenience in designing parallel computation with Work-Pool model, be known as a good model for load balance. We conducted experiments to evaluate IFIN⁺ against its serial version IFIN, the well-known algorithm FP-Growth and other two state-of-the-art ones FIN and PrePost⁺. The experimental results show that the running time of IFIN⁺ is the most efficient, especially in the case of mining at different support thresholds in the same running session. Compare to its serial version, IFIN⁺ performance is improved significantly.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Swapping two nodes is simply exchanging one’s item name to that of the other.

References

Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proceedings of 20th International Conference on VLDB, pp. 487–499 (1994)
Google Scholar
Han, J., Pei, J., Yin, Y.: Mining frequent itemsets without candidate generation. ACM Sigmod Rec. 29(2), 1–12 (2000)
Article Google Scholar
Cheung, W., Zaïane O.R.: Incremental mining of frequent patterns without candidate generation or support constraint. In: Proceedings of the 7th International Database Engineering and Applications Symposium, pp. 111–116. IEEE (2003)
Google Scholar
Deng, Z.-H., Lv, S.-L.: Fast mining frequent itemsets using nodesets. Expert Syst. Appl. 41(10), 4505–4512 (2014)
Article Google Scholar
Deng, Z.-H., Lv, S.-L.: PrePost⁺: an efficient N-lists-based algorithm for mining frequent itemsets via children-parent equivalence pruning. Expert Syst. Appl. 42(13), 5424–5432 (2015)
Article Google Scholar
Rymon, R.: Search through systematic set enumeration. In: Proceedings of the 1st International Conference Principles of Knowledge Representation and Reasoning, pp. 539–550 (1992)
Google Scholar
Market-Basket Synthetic Data Generator. https://synthdatagen.codeplex.com/
Savasere, A., Omiecinski, E., Navathe, S.: An efficient algorithm for mining association rules in large databases. In: VLDB, pp. 432–443 (1995)
Google Scholar
Perego, R., Orlando, S., Palmerini, P.: Enhancing the Apriori algorithm for frequent set counting. In: Kambayashi, Y., Winiwarter, W., Arikawa, M. (eds.) DaWaK 2001. LNCS, vol. 2114, pp. 71–82. Springer, Heidelberg (2001). doi:10.1007/3-540-44801-2_8
Chapter Google Scholar
Park, J.S., Chen, M.S., Yu, P.S.: Using a hash-based method with transaction trimming and database scan reduction for mining association rules. IEEE Trans. Knowl. Data Eng. 9(5), 813–825 (1997)
Article Google Scholar
Zaki, M.J.: Scalable algorithms for association mining. IEEE Trans. Knowl. Data Eng. 12(3), 372–390 (2000)
Article Google Scholar
Grahne, G., Zhu, J.: Fast algorithms for frequent itemset mining using FP-trees. Trans. Knowl. Data Eng. 17(10), 1347–1362 (2005)
Article Google Scholar
Liu, G., Lu, H., Lou, W., Xu, Y., Yu, J.X.: Efficient mining of frequent itemsets using ascending frequency ordered prefix-tree. DMKD J. 9(3), 249–274 (2004)
Google Scholar
Shenoy, P., Haritsa, J.R., Sudarshan, S.: Turbo-charging vertical mining of large databases. In: SIGMOD 2000, pp. 22–33 (2000)
Google Scholar
Zaki, M.J., Gouda, K.: Fast vertical mining using diffsets. In: 9th SIGKDD, pp. 326–335 (2003)
Google Scholar
Liu, J., Wu, Y., Zhou, Q., Fung, B.C.M., Chen, F., Yu, B.: Parallel eclat for opportunistic mining of frequent itemsets. In: Chen, Q., Hameurlain, A., Toumani, F., Wagner, R., Decker, H. (eds.) DEXA 2015. LNCS, vol. 9261, pp. 401–415. Springer, Cham (2015). doi:10.1007/978-3-319-22849-5_27
Chapter Google Scholar
Yun, U., Lee, G.: Incremental mining of weighted maximal frequent itemsets from dynamic databases. Expert Syst. Appl. 54, 304–327 (2016)
Article Google Scholar
Huynh, V.Q.P., Küng, J., Dang, T.K.: Incremental frequent itemsets mining with IPPC tree. In: Benslimane, D., Damiani, E., Grosky, W.I., Hameurlain, A., Sheth, A., Wagner, R.R. (eds.) DEXA 2017. LNCS, vol. 10438, pp. 463–477. Springer, Cham (2017). doi:10.1007/978-3-319-64468-4_35
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Engineering and Natural Sciences (TNF), Institute for Application Oriented Knowledge Processing (FAW), Johannes Kepler University (JKU), Linz, Austria
Van Quoc Phuong Huynh, Josef Küng & Markus Jäger
Faculty of Computer Science and Engineering, HCMC University of Technology, HCM City, Vietnam
Tran Khanh Dang

Authors

Van Quoc Phuong Huynh
View author publications
You can also search for this author in PubMed Google Scholar
Josef Küng
View author publications
You can also search for this author in PubMed Google Scholar
Markus Jäger
View author publications
You can also search for this author in PubMed Google Scholar
Tran Khanh Dang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Van Quoc Phuong Huynh .

Editor information

Editors and Affiliations

HCMC University of Technology, Ho Chi Minh City, Vietnam
Tran Khanh Dang
Johannes Kepler University Linz, Linz, Austria
Roland Wagner
Johannes Kepler University Linz, Linz, Austria
Josef Küng
Ho Chi Minh City University of Technolog , Ho Chi Minh City, Vietnam
Nam Thoai
Hosei University, Koganei, Tokyo, Japan
Makoto Takizawa
University of Vienna, Vienna, Austria
Erich J. Neuhold

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Huynh, V.Q.P., Küng, J., Jäger, M., Dang, T.K. (2017). IFIN⁺: A Parallel Incremental Frequent Itemsets Mining in Shared-Memory Environment. In: Dang, T., Wagner, R., Küng, J., Thoai, N., Takizawa, M., Neuhold, E. (eds) Future Data and Security Engineering. FDSE 2017. Lecture Notes in Computer Science(), vol 10646. Springer, Cham. https://doi.org/10.1007/978-3-319-70004-5_9

Download citation

DOI: https://doi.org/10.1007/978-3-319-70004-5_9
Published: 01 November 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-70003-8
Online ISBN: 978-3-319-70004-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics