Skip to main content

Preprocessing for Data Mining and Decision Support

  • Chapter
Data Mining and Decision Support

Abstract

The goal of this chapter is to identify data preprocessing tasks that can benefit from the existence of software support, and to describe the basic requirements on the tool, which can serve this purpose. These requirements are implemented in the data transformation tool, SumatraTT. The design principles and basic functionality of SumatraTT are explained. The chapter concludes by a brief evaluation of experience gained using SumatraTT was in different tasks, and with a summary of plans for its further development.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Aubrecht, P. and Kouba, Z. (2001). Meta-Data Driven Data Transformation. Proc. 5th World Muti-conference on Systemics, Cybernetics and Informatics.

    Google Scholar 

  • Chapman, P., Clinton, J., Kerber, R., Khabaza, T., Reinartz, T., Shearer, C. and Wirth, R. (2000). CRISP-DM 1.0: Step-by-step data mining guide, CRISP-DM consortium, http://www.crisp-dm.org

    Google Scholar 

  • CRISP-DM (1999). Crosss Industry Standard Process for Data-Mining, http://www.cnspdm.org/

    Google Scholar 

  • Kietz, J. U., Vaduva, A. and Zücker, R. (2001). MiningMart: Metadata-Driven Preprocessing. Proc. ECMUPKDD Workshop on Database Support for KDD.

    Google Scholar 

  • Mikšovský, P. and Kouba, Z. (2001). GOLAP — Geographical Online Analytical Processing, Database and Expert Systems Applications, Vol. 1, 442–449.

    Article  Google Scholar 

  • MiningMart (2003). MiningMart project:, http://www-ai.cs.uni-dortund.de/MMWEB/research/index.html

  • Morik, K. and Scholz, M. (2003). The MiningMart Approach to Knowledge Discovery in Databases, In (eds. Zhong, N. and Liu, J.), Handbook of Intelligent IT, IOS Press.

    Google Scholar 

  • PMML (2001). Predictive Model Markup Language specification, http://www.dmg.org/pmml-v2-0.htm/pmml-v2-0.htm

    Google Scholar 

  • Pyle, D. (1999), Data Preparation for Data Mining, Morgan Kaufmann.

    Google Scholar 

  • SumatraTT (2003). available at, http://krizik.felk.cvut.cz/Sumatra/Sumatra

  • Zücker, R. and Kietz, J.-U. (2000). How to preprocess large databases, In Proc. PKDD 2000 Workshop on Data Mining, Decision Support, Meta-learning and ILP: Forum for Practical Problem Presentation and Prospective Solutions, (eds. Brazdil P., Jorge A.), University of Porto.

    Google Scholar 

Download references

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer Science+Business Media New York

About this chapter

Cite this chapter

Štěpánková, O., Aubrecht, P., Kouba, Z., Mikšovský, P. (2003). Preprocessing for Data Mining and Decision Support. In: Mladenić, D., Lavrač, N., Bohanec, M., Moyle, S. (eds) Data Mining and Decision Support. The Springer International Series in Engineering and Computer Science, vol 745. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-0286-9_9

Download citation

  • DOI: https://doi.org/10.1007/978-1-4615-0286-9_9

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4613-5004-0

  • Online ISBN: 978-1-4615-0286-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics