Search Techniques for Automated Proposal of Data Mining Schemes

  • Roman NerudaEmail author
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 657)


Data mining schemes, or workflows, are collections of interconnected machine learning models, including preprocessing procedures, and ensembles methods combinations. The proposal of data mining schemes for a task at hand has always been a task for experienced data scientists. We will study generating and testing workflows by automated procedures. Two representations of data mining schemes are used in this paper – a linear one, and a one based on direct acyclic graphs. Efficient procedures for generating schemes are presented and evaluated by testing the generated schemes on real data.


Direct Acyclic Graph Description Logic Machine Learning Model Data Mining Process Computational Intelligence Method 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.



This work was supported by the Czech Science Foundation project no. P103-15-19877S. and the institutional support of the Institute of Computer Science, Czech Academy of Sciences RVO 67985807.


  1. 1.
    Wolpert, D.H., Macready, W.G.: No free lunch theorems for optimization. IEEE Trans. Evol. Comput. 1(1), 67–82 (1997)CrossRefGoogle Scholar
  2. 2.
    Brazdil, P., Giraud-Carrier, C.G., Soares, C., Vilalta, R.: Metalearning – Applications to Data Mining. Cognitive Technologies. Springer, Heidelberg (2009)Google Scholar
  3. 3.
    Clarke, B., Fokoue, E., Zhang, H.H.: Principles and Theory for Data Mining and Machine Learning. Springer Series in Statistics. Springer, Heidelberg (2009)CrossRefzbMATHGoogle Scholar
  4. 4.
    Neruda, R., Beuster, G.: Toward dynamic generation of computational agents by means of logical descriptions. Int. Trans. Syst. Sci. Appl., 139–144 (2008)Google Scholar
  5. 5.
    Bache, K., Lichman, M.: UCI machine learning repository (2013)Google Scholar
  6. 6.
    Kazík, O., Neruda, R.: Data mining process optimization in computational multi-agent systems. In: Cao, L., Zeng, Y., An, B., Symeonidis, A.L., Gorodetsky, V., Coenen, F., Yu, P.S. (eds.) ADMI 2014. LNCS (LNAI), vol. 9145, pp. 93–103. Springer International Publishing, Cham (2015). doi: 10.1007/978-3-319-20230-3_8 CrossRefGoogle Scholar
  7. 7.
    Pešková, K., Šmíd, J., Pilát, M., Kazík, O., Neruda, R.: Hybrid multi-agent system for metalearning in data mining. In: Vanschoren, J., Brazdil, P., Soares, C., Kotthoff, L. (eds.) Proceedings of the MetaSel@ECAI 2014. CEUR Workshop Proceedings, vol. 1201, pp. 53–54. (2014)Google Scholar

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  1. 1.Institute of Computer Science Academy of Sciences of the Czech RepublicPragueCzech Republic

Personalised recommendations