Skip to main content

Search Techniques for Automated Proposal of Data Mining Schemes

  • Conference paper
  • First Online:
Applied Computer Sciences in Engineering (WEA 2016)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 657))

Included in the following conference series:

  • 550 Accesses

Abstract

Data mining schemes, or workflows, are collections of interconnected machine learning models, including preprocessing procedures, and ensembles methods combinations. The proposal of data mining schemes for a task at hand has always been a task for experienced data scientists. We will study generating and testing workflows by automated procedures. Two representations of data mining schemes are used in this paper – a linear one, and a one based on direct acyclic graphs. Efficient procedures for generating schemes are presented and evaluated by testing the generated schemes on real data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Wolpert, D.H., Macready, W.G.: No free lunch theorems for optimization. IEEE Trans. Evol. Comput. 1(1), 67–82 (1997)

    Article  Google Scholar 

  2. Brazdil, P., Giraud-Carrier, C.G., Soares, C., Vilalta, R.: Metalearning – Applications to Data Mining. Cognitive Technologies. Springer, Heidelberg (2009)

    Google Scholar 

  3. Clarke, B., Fokoue, E., Zhang, H.H.: Principles and Theory for Data Mining and Machine Learning. Springer Series in Statistics. Springer, Heidelberg (2009)

    Book  MATH  Google Scholar 

  4. Neruda, R., Beuster, G.: Toward dynamic generation of computational agents by means of logical descriptions. Int. Trans. Syst. Sci. Appl., 139–144 (2008)

    Google Scholar 

  5. Bache, K., Lichman, M.: UCI machine learning repository (2013)

    Google Scholar 

  6. Kazík, O., Neruda, R.: Data mining process optimization in computational multi-agent systems. In: Cao, L., Zeng, Y., An, B., Symeonidis, A.L., Gorodetsky, V., Coenen, F., Yu, P.S. (eds.) ADMI 2014. LNCS (LNAI), vol. 9145, pp. 93–103. Springer International Publishing, Cham (2015). doi:10.1007/978-3-319-20230-3_8

    Chapter  Google Scholar 

  7. Pešková, K., Šmíd, J., Pilát, M., Kazík, O., Neruda, R.: Hybrid multi-agent system for metalearning in data mining. In: Vanschoren, J., Brazdil, P., Soares, C., Kotthoff, L. (eds.) Proceedings of the MetaSel@ECAI 2014. CEUR Workshop Proceedings, vol. 1201, pp. 53–54. CEUR-WS.org (2014)

    Google Scholar 

Download references

Acknowledgment

This work was supported by the Czech Science Foundation project no. P103-15-19877S. and the institutional support of the Institute of Computer Science, Czech Academy of Sciences RVO 67985807.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Roman Neruda .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Neruda, R. (2016). Search Techniques for Automated Proposal of Data Mining Schemes. In: Figueroa-García, J., López-Santana, E., Ferro-Escobar, R. (eds) Applied Computer Sciences in Engineering. WEA 2016. Communications in Computer and Information Science, vol 657. Springer, Cham. https://doi.org/10.1007/978-3-319-50880-1_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-50880-1_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-50879-5

  • Online ISBN: 978-3-319-50880-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics