Abstract
Finding frequent itemsets is a popular data mining problem, aiming to extract hidden patterns from a transactional database. Several bio-inspired approaches to solve this problem have been proposed to overcome the poor performance of exact algorithms, such as Apriori and FPGrowth. Approaches based on genetic algorithms are among the most efficient ones from the point of view of runtime performance, but they are still inefficient in terms of solution’s quality, i.e., the number of frequent itemsets discovered. To deal with this issue, we propose in this paper a new genetic algorithm for finding frequent itemsets called GA-Apriori, in which the crossover and mutation operators are defined by taking into account the Apriori heuristic principle. The results of our evaluation show that GA-Apriori outperforms other approaches to frequent itemset mining based on genetic algorithms, especially when dealing with large instances. The experiments also show that GA-Apriori is competitive with exact approaches in terms of the number of frequent itemsets discovered.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: ACM Sigmod Record, vol. 22, No. 2, pp. 207–216. ACM, June 1993
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: ACM Sigmod Record, vol. 29, No. 2, pp. 1–12. ACM, May 2000
Djenouri, Y., Bendjoudi, A., Nouali-Taboudjemat, N.: Association rules mining using evolutionary algorithms. In: The 9th International Conference on Bio-inspired Computing: Theories and Applications (BIC-TA 2014). LNCS, October 2014
Smart, O., Burrell, L.: Genetic programming and frequent itemset mining to identify feature selection patterns of iEEG and fMRI epilepsy data. Eng. Appl. Artif. Intell. 39, 198–214 (2015)
Luna, J.M., Pechenizkiy, M., Ventura, S.: Mining exceptional relationships with grammar-guided genetic programming. Knowl. Inf. Syst. 47(3), 571–594 (2016)
Djenouri, Y., Drias, H., Habbas, Z.: Bees swarm optimisation using multiple strategies for association rule mining. Int. J. Bio-Inspired Comput. 6(4), 239–249 (2014)
Djenouri, Y., Drias, H., Habbas, Z.: Hybrid intelligent method for association rules mining using multiple strategies. Int. J. Appl. Metaheuristic Comput. (IJAMC) 5(1), 46–64 (2014)
Gheraibia, Y., Moussaoui, A., Djenouri, Y., Kabir, S., Yin, P.Y.: Penguins search optimisation algorithm for association rules mining. CIT. J. Comput. Inform. Technol. 24(2), 165–179 (2016)
Mata J., Alvarez J., and Riquelme J.: An Evolutionary algorithm to discover numeric association rules. In: Proceedings of the ACM Symposium on Applied Computing SAC, pp. 590–594 (2002)
Mata, J., Alvarez, J., Riquelme, J.: Mining numeric association rules with genetic algorithms. In: Proceedings of the International Conference ICANNGA, pp. 264–267 (2001)
Yan, X., Zhang, C.: Genetic algorithm based strategy for identifying association rule without specifying minimum support. Expert Syst. Appl. 36(2), 3066–3076 (2009)
wang, M., zou, Q., Lin, C.: Multi dimensions association rules mining on adaptive genetic algorithm. In: International Conference on Uncertainly Reasoning on Knowledge Engineering IEEE (2011)
Liu, D.: Improved genetic algorithm based on simulated annealing and quantum computing strategy for association rule mining. J. Softw. 5(11), 1243–1249 (2010)
Indira, K., Kanmani, S.: Performance analysis of genetic algorithm for mining association rules. Int. J. Comput. Sci. Issues 9(1) (2012)
Romero, C., Zafra, A., Luna, J., Ventura, S.: Association rule mining using genetic programming to provide feedback to instructors from multiple-choice quiz data. J. Expert Syst. 30, 162–172 (2012)
Hong, G., Zhou, Y.: An algorithm for mining association rules based on improved genetic algorithm and its application. In: Third International Conference on Genetic and Evolutionary Computing, pp. 117–120. IEEE Computer Science (2009)
Guvenir, H.A., Uysal, I.: Bilkent university function approximation repository. 20120312 (2000). http://funapp.CS.bilkent.edu.tr/DataSets
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Djenouri, Y., Comuzzi, M. (2017). GA-Apriori: Combining Apriori Heuristic and Genetic Algorithms for Solving the Frequent Itemsets Mining Problem. In: Kang, U., Lim, EP., Yu, J., Moon, YS. (eds) Trends and Applications in Knowledge Discovery and Data Mining. PAKDD 2017. Lecture Notes in Computer Science(), vol 10526. Springer, Cham. https://doi.org/10.1007/978-3-319-67274-8_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-67274-8_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-67273-1
Online ISBN: 978-3-319-67274-8
eBook Packages: Computer ScienceComputer Science (R0)