Abstract
The goal of this chapter is to identify data preprocessing tasks that can benefit from the existence of software support, and to describe the basic requirements on the tool, which can serve this purpose. These requirements are implemented in the data transformation tool, SumatraTT. The design principles and basic functionality of SumatraTT are explained. The chapter concludes by a brief evaluation of experience gained using SumatraTT was in different tasks, and with a summary of plans for its further development.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aubrecht, P. and Kouba, Z. (2001). Meta-Data Driven Data Transformation. Proc. 5th World Muti-conference on Systemics, Cybernetics and Informatics.
Chapman, P., Clinton, J., Kerber, R., Khabaza, T., Reinartz, T., Shearer, C. and Wirth, R. (2000). CRISP-DM 1.0: Step-by-step data mining guide, CRISP-DM consortium, http://www.crisp-dm.org
CRISP-DM (1999). Crosss Industry Standard Process for Data-Mining, http://www.cnspdm.org/
Kietz, J. U., Vaduva, A. and Zücker, R. (2001). MiningMart: Metadata-Driven Preprocessing. Proc. ECMUPKDD Workshop on Database Support for KDD.
Mikšovský, P. and Kouba, Z. (2001). GOLAP — Geographical Online Analytical Processing, Database and Expert Systems Applications, Vol. 1, 442–449.
MiningMart (2003). MiningMart project:, http://www-ai.cs.uni-dortund.de/MMWEB/research/index.html
Morik, K. and Scholz, M. (2003). The MiningMart Approach to Knowledge Discovery in Databases, In (eds. Zhong, N. and Liu, J.), Handbook of Intelligent IT, IOS Press.
PMML (2001). Predictive Model Markup Language specification, http://www.dmg.org/pmml-v2-0.htm/pmml-v2-0.htm
Pyle, D. (1999), Data Preparation for Data Mining, Morgan Kaufmann.
SumatraTT (2003). available at, http://krizik.felk.cvut.cz/Sumatra/Sumatra
Zücker, R. and Kietz, J.-U. (2000). How to preprocess large databases, In Proc. PKDD 2000 Workshop on Data Mining, Decision Support, Meta-learning and ILP: Forum for Practical Problem Presentation and Prospective Solutions, (eds. Brazdil P., Jorge A.), University of Porto.
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer Science+Business Media New York
About this chapter
Cite this chapter
Štěpánková, O., Aubrecht, P., Kouba, Z., Mikšovský, P. (2003). Preprocessing for Data Mining and Decision Support. In: Mladenić, D., Lavrač, N., Bohanec, M., Moyle, S. (eds) Data Mining and Decision Support. The Springer International Series in Engineering and Computer Science, vol 745. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-0286-9_9
Download citation
DOI: https://doi.org/10.1007/978-1-4615-0286-9_9
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-5004-0
Online ISBN: 978-1-4615-0286-9
eBook Packages: Springer Book Archive