Skip to main content

Metaheuristics for Frequent and High-Utility Itemset Mining

  • Chapter
  • First Online:
Book cover High-Utility Pattern Mining

Part of the book series: Studies in Big Data ((SBD,volume 51))

Abstract

Metaheuristics are often used to solve combinatorial problems. They can be viewed as general purpose problem-solving approaches based on stochastic methods, which explore very large search spaces to find near-optimal solutions in a reasonable time. Some metaheuristics are inspired by biological and physical phenomenons. During the last two decades, two population-based methods named evolutionary algorithms and swarm intelligence have shown high efficiency compared to many other metaheuristics. Frequent Itemset Mining (FIM) and High Utility Itemset Mining (HUIM) are the process of extracting useful frequent and high utility itemsets from a given transactional database. Solving FIM and HUIM problems can be very time consuming, especially when dealing with large-scale data. To deal with this issue, different metaheuristic-based methods were developed. In this chapter, we study the application of metaheuristics to FIM and HUIM. Several metaheuristics have been presented, based on evolutionary or swarm intelligence algorithms, such as genetic algorithms, particle swarm optimization, ant colony optimization and bee swarm optimization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Djenouri, Y., Belhadi, A., Belkebir, R.: Bees swarm optimization guided by data mining techniques for document information retrieval. Expert. Syst. Appl. 94, 126–136 (2018)

    Article  Google Scholar 

  2. Djenouri, Y., Belhadi, A., Fournier-Viger, P.: Extracting useful knowledge from event logs: a frequent itemset mining approach. Knowl.-Based Syst. 139, 132–148 (2018)

    Article  Google Scholar 

  3. Djenouri, Y., Habbas, Z., Djenouri, D., Fournier-Viger, P.: Bee swarm optimization for solving the MAXSAT problem using prior knowledge. Soft Comput. 1–18 (2017)

    Google Scholar 

  4. Djenouri, Y., Habbas, Z., Djenouri, D.: Data mining-based decomposition for solving the MAXSAT problem: toward a new approach. IEEE Intell. Syst. 32(4), 48–58 (2017)

    Article  Google Scholar 

  5. Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: ACM SIGMOD Record, vol. 22, No. 2, pp. 207–216. ACM (1993)

    Google Scholar 

  6. Djenouri, Y., Comuzzi, M., Djenouri, D.: SS-FIM: single scan for frequent itemsets mining in transactional databases. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 644–654. Springer, Cham (2017)

    Chapter  Google Scholar 

  7. Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: ACM SIGMOD Record, vol. 29, No. 2, pp. 1–12. ACM (2000)

    Google Scholar 

  8. Djenouri, Y., Belhadi, A., Fournier-Viger, P., Lin, J. C. W.: An hybrid multi-core/gpu-based mimetic algorithm for big association rule mining. In: International Conference on Genetic and Evolutionary Computing, pp. 59–65. Springer, Singapore (2017)

    Google Scholar 

  9. Djenouri, Y., Habbas, Z., Djenouri, D., Comuzzi, M.: Diversification heuristics in bees swarm optimization for association rules mining. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 68–78. Springer, Cham (2017)

    Google Scholar 

  10. Djenouri, Y., Comuzzi, M.: Combining Apriori euristic and bio-inspired algorithms for solving the frequent itemsets mining problem. Inf. Sci. 420, 1–15 (2017)

    Article  Google Scholar 

  11. Gheraibia, Y., Moussaoui, A., Djenouri, Y., Kabir, S., Yin, P.Y.: Penguins search optimisation algorithm for association rules mining. J. Comput. Inf. Technol. 24(2), 165–179 (2016)

    Article  Google Scholar 

  12. Brin, S., Motwani, R., Ullman, J.D., Tsur, S.: Dynamic itemset counting and implication rules for market basket data. In: ACM SIGMOD Record, vol. 26, No. 2, pp. 255–264. ACM (1997)

    Google Scholar 

  13. Mueller, A.: Fast sequential and parallel algorithms for association rule mining: A comparison (1998)

    Google Scholar 

  14. Zaki, M.J., Parthasarathy, S., Ogihara, M., Li, W.: New algorithms for fast discovery of association rules. In: International Conference on Knowledge Discovery and Data Mining, vol. 97, pp. 283–286. ACM (1997)

    Google Scholar 

  15. Amphawan, K., Lenca, P., Surarerks, A.: Efficient mining top-k regular-frequent itemset using compressed tidsets. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 124–135. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  16. Cerf, L., Besson, J., Robardet, C., Boulicaut, J. F.: Closed patterns meet n-ary relations. ACM Trans. Knowl. Discov. Data 3(1) (2009). Article 3

    Article  Google Scholar 

  17. Leung, C.K.S., Mateo, M.A.F., Brajczuk, D.A.: A tree-based approach for frequent pattern mining from uncertain data. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 653–661. Springer, Heidelberg (2008)

    Google Scholar 

  18. Grahne, G., Zhu, J.: Fast algorithms for frequent itemset mining using fp-trees. IEEE Trans. Knowl. Data Eng. 17(10), 1347–1362 (2005)

    Article  Google Scholar 

  19. Fournier-Viger, P., Lin, J. C.-W., Vo, B, Chi, T.T., Zhang, J., Le, H.B.: A survey of itemset mining. WIREs Data Min. Knowl. Discov. e1207 (2017). https://doi.org/10.1002/widm.1207.. Wiley

    Google Scholar 

  20. Hong, T.P., Lin, C.W., Wu, Y.L.: Incrementally fast updated frequent pattern trees. Expert. Syst. Appl. 34(4), 2424–2435 (2008)

    Article  Google Scholar 

  21. Hong, T.P., Lin, C.W., Wu, Y.L.: Maintenance of fast updated frequent pattern trees for record deletion. Comput. Stat. Data Anal. 53(7), 2485–2499 (2009)

    Article  MathSciNet  MATH  Google Scholar 

  22. Hong, T.P., Lin, C.W., Wu, Y.L.: An efficient FUFP-tree maintenance algorithm for record modification. Int. J. Innov. Comput., Inf. Control. 4(11), 2875–2887 (2008)

    Google Scholar 

  23. Lin, C.W., Hong, T.P., Lu, W.H.: The Pre-FUFP algorithm for incremental mining. Expert. Syst. Appl. 36(5), 9498–9505 (2009)

    Article  Google Scholar 

  24. Lin, C.W., Hong, T.P., Lu, W.H.: Using the structure of prelarge trees to incrementally mine frequent itemsets. New Gener. Comput. 28(1), 5–20 (2010)

    Article  MATH  Google Scholar 

  25. Lin, C.W., Hong, T. P., Lu, W.H.: Maintenance of fast updated frequent trees for record deletion based on prelarge concepts. In: The International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems, pp. 675–684. Springer, Heidelberg (2007)

    Google Scholar 

  26. Lin, C.W., Hong, T.P., Lu, W.H.: Efficient modification of fast updated FP-trees based on pre-large concepts. Int. J. Innov. Comput., Inf. Control. 6(12), 5163–5177 (2010)

    Google Scholar 

  27. Lin, C.W., Gan, W.S., Hong, T.P.: Efficiently maintaining the fast updated sequential pattern trees with sequence deletion. IEEE Access 2, 1374–1383 (2014)

    Article  Google Scholar 

  28. Lin, C.W., Gan, W.S., Hong, T.P., Zhang, J.: Updating the built prelarge fast updated sequential pattern trees with sequence modification. Int. J. Data Warehous. Min. 1(1), 1–21 (2015)

    Article  Google Scholar 

  29. Zhang, B., Lin, C.W., Gan, W.S., Hong, T.P.: Maintaining the discovered sequential patterns for sequence insertion in dynamic databases. Eng. Appl. Artif. Intell. 35, 131–142 (2014)

    Article  Google Scholar 

  30. Djenouri, Y., Drias, H., Habbas, Z.: Bees swarm optimisation using multiple strategies for association rule mining. Int. J. Bio-Inspired Comput. 6(4), 239–249 (2014)

    Article  Google Scholar 

  31. Mata J., Alvarez J., Riquelme J.: An evolutionary algorithm to discover numeric association rules. In: Proceedings of the ACM symposium on Applied computing SAC, pp. 590–594 (2002)

    Google Scholar 

  32. Romero, C., Zafra, A., Luna, J.M., Ventura, S.: Association rule mining using genetic programming to provide feedback to instructors from multiple-choice quiz data. Expert. Syst. 30(2), 162–172 (2013)

    Article  Google Scholar 

  33. Djenouri, Y., Bendjoudi, A., Nouali-Taboudjemat, N.: Association rules mining using evolutionary algorithms. In: The 9th International Conference on Bio-inspired Computing: Theories and Applications (BIC-TA 2014). LNCS (2014)

    Google Scholar 

  34. Martinez-Ballesteros, M., Bacardit, J., Troncoso, A., Riquelme, J.C.: Enhancing the scalability of a genetic algorithm to discover quantitative association rules in large-scale datasets. Integr. Comput.-Aided Eng. 22(1), 21–39 (2015)

    Article  Google Scholar 

  35. Martin, D., AlcaliFdez, J., Rosete, A., Herrera, F.: NICGAR: a niching genetic algorithm to mine a diverse set of interesting quantitative association rules. Inf. Sci. 355, 208–228 (2016)

    Article  Google Scholar 

  36. Wang, B., Merrick, K.E., Abbass, H.A.: Co-operative coevolutionary neural networks for mining functional association rules. IEEE Trans. Neural Netw. Learn. Syst. 28(6), 1331–1344 (2017)

    Article  Google Scholar 

  37. Ting, C.K., Liaw, R.T., Wang, T.C., Hong, T.P.: Mining fuzzy association rules using a mimetic algorithm based on structure representation. Memetic Comput. 1–14 (2017)

    Google Scholar 

  38. Mukhopadhyay, A., Maulik, U., Bandyopadhyay, S., Coello, C.A.C.: A survey of multiobjective evolutionary algorithms for data mining: Part I. IEEE Trans. Evol. Comput. 18(1), 4–19 (2014)

    Article  Google Scholar 

  39. Kuo, R.J., Chao, C.M., Chiu, Y.T.: Application of particle swarm optimization to association rule mining. Appl. Soft Comput. 11(1), 326–336 (2011)

    Article  Google Scholar 

  40. Sarath, K.N.V.D., Ravi, V.: Association rule mining using binary particle swarm optimization. Eng. Appl. Artif. Intell. 26(8), 1832–1840 (2013)

    Article  Google Scholar 

  41. Beiranvand, V., Mobasher-Kashani, M., Bakar, A.A.: Multi-objective PSO algorithm for mining numerical association rules without a priori discretization. Expert. Syst. Appl. 41(9), 4259–4273 (2014)

    Article  Google Scholar 

  42. Agrawal, J., Agrawal, S., Singhai, A., Sharma, S.: SET-PSO-based approach for mining positive and negative association rules. Knowl. Inf. Syst. 45(2), 453–471 (2015)

    Article  Google Scholar 

  43. Djenouri, Y., Drias, H., Habbas, Z., Mosteghanemi, H.: Bees swarm optimization for web association rule mining. In: IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology, vol. 3, pp. 142–146. IEEE (2012)

    Google Scholar 

  44. Djenouri, Y., Drias, H., Chemchem, A.: A hybrid bees swarm optimization and tabu search algorithm for association rule mining. In: World Congress on Nature and Biologically Inspired Computing, pp. 120–125. IEEE (2013)

    Google Scholar 

  45. Djenouri, Y., Drias, H., Habbas, Z.: Hybrid intelligent method for association rules mining using multiple strategies. Int. J. Appl. Metaheuristic Comput. 5(1), 46–64 (2014)

    Article  Google Scholar 

  46. Heraguemi, K.E., Kamel, N., Drias, H.: Multi-swarm bat algorithm for association rule mining using multiple cooperative strategies. Appl. Intell. 45(4), 1021–1033 (2016)

    Article  Google Scholar 

  47. Song, A., Ding, X., Chen, J., Li, M., Cao, W., Pu, K.: Multi-objective association rule mining with binary bat algorithm. Intell. Data Anal. 20(1), 105–128 (2016)

    Article  Google Scholar 

  48. Sheikhan, M., Rad, M.S.: Gravitational search algorithm optimized neural misuse detector with selected features by fuzzy grids based association rules mining. Neural Comput. Appl. 23(7–8), 2451–2463 (2013)

    Article  Google Scholar 

  49. Mlakar, U., Zorman, M., Fister Jr., I., Fister, I.: Modified binary cuckoo search for association rule mining. J. Intell. Fuzzy Syst. 32(6), 4319–4330 (2017)

    Article  Google Scholar 

  50. Ahmed, C.F., Tanbeer, S.K., Jeong, B.S., Lee, Y.K.: Efficient tree structures for high utility pattern mining in incremental databases. IEEE Trans. Knowl. Data Eng. 21(12), 1708–1721 (2009)

    Article  Google Scholar 

  51. Liu, Y., Liao, W. K., Choudhary, A. N.: A two-phase algorithm for fast discovery of high utility itemsets. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, vol. 3518, pp. 689–695 (2005)

    Chapter  Google Scholar 

  52. Lin, C.W., Hong, T.P., Lu, W.H.: An effective tree structure for mining high utility itemsets. Expert. Syst. Appl. 38(6), 7419–7424 (2011)

    Article  Google Scholar 

  53. Tseng, V.S., Wu, C.W., Shie, B.E., Yu, P.S.: UP-Growth: an efficient algorithm for high utility itemset mining. In: International Conference on Knowledge Discovery and Data Mining, pp. 253–262. ACM (2010)

    Google Scholar 

  54. Yun, U., Ryang, H., Ryu, K.H.: High utility itemset mining with techniques for reducing overestimated utilities and pruning candidates. Expert. Syst. Appl. 41(8), 3861–3878 (2014)

    Article  Google Scholar 

  55. Yun, U., Ryang, H.: Incremental high utility pattern mining with static and dynamic databases. Appl. Intell. 42(2), 323–352 (2015)

    Article  Google Scholar 

  56. Liu, M., Qu, J.: Mining high utility itemsets without candidate generation. In: Proceedings of the 21st ACM International Conference on Information and Knowledge Management, pp. 55–64. ACM (2012)

    Google Scholar 

  57. Liu, J., Wang, K., Fung, B.C.: Direct discovery of high utility itemsets without candidate generation. In: IEEE 12th International Conference on Data Mining, pp. 984–989. IEEE (2012)

    Google Scholar 

  58. Fournier-Viger, P., Wu, C.W., Zida, S., Tseng, V.S.: FHM: faster high-utility itemset mining using estimated utility co-occurrence pruning. In: International Symposium on Methodologies for Intelligent Systems, pp. 83–92. Springer, Cham (2014)

    Google Scholar 

  59. Zida, S., Fournier-Viger, P., Lin, J.C.W., Wu, C.W., Tseng, V.S.: EFIM: a fast and memory efficient algorithm for high-utility itemset mining. Knowl. Inf. Syst. 51(2), 595–625 (2017)

    Article  Google Scholar 

  60. Lin, C.W., Hong, T.P., Lan, G.C., Wong, J.W., Lin, W.Y.: Incrementally mining high utility patterns based on pre-large concept. Appl. Intell. 40(2), 343–357 (2014)

    Article  Google Scholar 

  61. Lin, J. C.W., Gan, W.S., Hong, T.P.: A fast maintenance algorithm of the discovered high-utility itemsets with transaction deletion. Intell. Data Anal. 20(4), 891–913 (2016)

    Article  Google Scholar 

  62. Lin, J.C.W., Gan, W., Hong, T.P.: A fast updated algorithm to maintain the discovered high-utility itemsets for transaction modification. Adv. Eng. Inf. 29(3), 562–574 (2015)

    Article  Google Scholar 

  63. Lin, C.W., Gan, W., Hong, T.P.: Maintaining the discovered high-utility itemsets with transaction modification. Appl. Intell. 44(1), 166–178 (2016)

    Article  Google Scholar 

  64. Zihayat, M., Hut, Z.Z., An, A., Hut, Y.: Distributed and parallel high utility sequential pattern mining. In: 2016 IEEE International Conference on Big Data (Big Data), pp. 853–862. IEEE (2016)

    Google Scholar 

  65. Lin, Y.C., Wu, C.W., Tseng, V.S.: Mining high utility itemsets in big data. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 649–661. Springer, Berlin (2015)

    Chapter  Google Scholar 

  66. Chen, Y., An, A.: Approximate parallel high utility itemset mining. Big Data Res. 6, 26–42 (2016)

    Article  Google Scholar 

  67. Zhang, L., Fu, G., Cheng, F., Qiu, J., Su, Y.: A multi-objective evolutionary approach for mining frequent and high utility itemsets. Appl. Soft Comput. 62, 974–986 (2017)

    Article  Google Scholar 

  68. Zhang, Q., Li, H.: MOEA/D: a multiobjective evolutionary algorithm based on decomposition. IEEE Trans. Evol. Comput. 11(6), 712–731 (2007)

    Article  Google Scholar 

  69. Deb, K., Pratap, A., Agarwal, S., Meyarivan, T.A.M.T.: A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. Evol. Comput. 6(2), 182–197 (2002)

    Article  Google Scholar 

  70. Zitzler, E.: SPEA2: improving the strength pareto evolutionary algorithm for multiobjective optimization. In: EUROGEN 2001, Evolutionary Methods for Design, Optimization and Control with Applications to Industrial Problems, Athens, Greece (2001)

    Google Scholar 

  71. Cai, X., Li, Y., Fan, Z., Zhang, Q.: An external archive guided multiobjective evolutionary algorithm based on decomposition for combinatorial optimization. IEEE Trans. Evol. Comput. 19(4), 508–523 (2015)

    Article  Google Scholar 

  72. Miettinen, K.: Nonlinear Multiobjective Optimization, vol. 12. Springer Science & Business Media (2012)

    Google Scholar 

  73. Zhang, X., Tian, Y., Cheng, R., Jin, Y.: An efficient approach to nondominated sorting for evolutionary multiobjective optimization. IEEE Trans. Evol. Comput. 19(2), 201–213 (2015)

    Article  Google Scholar 

  74. Lin, J.C.W., Yang, L., Fournier-Viger, P., Wu, M.T., Hong, T.P., Wang, S.L., Zhan, J.: Mining high-utility itemsets based on particle swarm optimization. Eng. Appl. Artif. Intell. 55, 320–330 (2016)

    Article  Google Scholar 

  75. Lin, J.C.W., Yang, L., Fournier-Viger, P., Hong, T.P., Voznak, M.: A binary PSO approach to mine high-utility itemsets. Soft Comput. 21(17), 5103–5121 (2017)

    Article  Google Scholar 

  76. Wu, J.M.T., Zhan, J., Lin, J.C.W.: An ACO-based approach to mine high-utility itemsets. Knowl.-Based Syst. 116, 102–113 (2017)

    Article  Google Scholar 

  77. Fournier-Viger, P., Gomariz, A., Gueniche, T., Soltani, A., Wu, C.W., Tseng, V.S.: SPMF: a java open-source pattern mining library. J. Mach. Learn. Res. 15, 3389–3393 (2014)

    MATH  Google Scholar 

  78. Fournier-Viger, P, Lin, J.C.W., Dinh, T, Le, H.B.: Mining correlated high-utility itemsets using the bond measure. In: Proceedings of International Conference Hybrid Artificial Intelligence Systems, pp. 53–65. Seville, Spain, 18–20 April 2016

    Google Scholar 

  79. Fournier-Viger, P, Lin, C.W, Duong, Q.H., Dam, T.L.: PHM: mining periodic high-utility itemsets. In: Proceedings of 16th Industrial Conference on Data Mining, pp. 64–79. New York, USA, 13–17 July 2016

    Google Scholar 

  80. Lin, C.-W., Ren, S., Fournier-Viger, P., Hong, T.-P.: EHAUPM: efficient high average-utility pattern mining with tighter upper-bounds . IEEE Access 14(8), 13 (2016). IEEE

    Google Scholar 

  81. Truong, T., Duong, H., Le, B., Fournier-Viger, P.: Efficient vertical mining of high average-utility itemsets based on novel upper-bounds. In: IEEE Transactions on Knowledge and Data Engineering (TKDE) (2018). https://doi.org/10.1109/TKDE.2018.2833478.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Youcef Djenouri .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Djenouri, Y., Fournier-Viger, P., Belhadi, A., Chun-Wei Lin, J. (2019). Metaheuristics for Frequent and High-Utility Itemset Mining. In: Fournier-Viger, P., Lin, JW., Nkambou, R., Vo, B., Tseng, V. (eds) High-Utility Pattern Mining. Studies in Big Data, vol 51. Springer, Cham. https://doi.org/10.1007/978-3-030-04921-8_10

Download citation

Publish with us

Policies and ethics