Abstract
Data Mining, as defined in 1996 by Piatetsky-Shapiro ([1]) is a step (crucial, but a step nevertheless) in a KDD (Knowledge Discovery in Data Bases) process. The Piatetsky-Shapiro’s definition states that the KDD process consists of the following steps: developing an understanding of the application domain, creating a target data set, choosing the data mining task i.e. deciding whether the goal of the KDD process is classification, regression, clustering, etc..., choosing the data mining algorithm(s), data preprocessing, data mining (DM), interpreting mined patterns, deciding if a re-iteration is needed, and consolidating discovered knowledge.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P.: From Data Mining to Knowledge Discovery: An Overview. In: Fayyad, U.M., et al. (eds.) Advances in Knowledge Discovery and Data Mining, pp. 1–34. AAAI Press, Menlo Park (1996)
Inuiguchi, M., Tanino, T.: Classification versus Approximation oriented Generalization of Rough Sets. Bulletin of International Rough Set Society 7(1/2) (2003)
Pawlak, Z.: Rough Sets- theoretical Aspects Reasoning About Data. Kluwer Academic Publishers, Dordrecht (1991)
Lech, P.: Rough Sets- Mathematical Foundations. Physica- Verlag, Heidelberg (2002)
Shearer, C.: The CRISP-DM Model: The New Blueprint for Data Mining. Journal of Data Warehousing 5(4), 13–22 (2000)
Wasilewska, A., Menasalvas, E.: Data Preprocessing and Data Mining as Generalization Process. In: Proceedings of ICDM’04, The Fourth IEEE International Conference on Data Mining, Brighton, UK, Nov 1-4, 2004, pp. 133–137. IEEE Computer Society Press, Los Alamitos (2004)
Wasilewska, A., Menasalvas, E.: Data Mining Operators. In: Proceedings of ICDM’04, The Fourth IEEE International Conference on Data Mining, Brighton, UK, Nov 1-4, 2004, pp. 209–214. IEEE Computer Society Press, Los Alamitos (2004)
Wasilewska, A., Menasalvas, E., Scharff, C.: Uniform Model for Data Mining. In: Proceedings of FDM05 (Foundations of Data Mining), in ICDM2005, Fifth IEEE International Conference on Data Mining, Austin, Texas, Nov 27-29, 2005, pp. 19–27. IEEE Computer Society Press, Los Alamitos (2005)
Wasilewska, A., Menasalvas Ruiz, E.: Data Mining as Generalization: A Formal Model. In: Lin, T.Y., et al. (eds.) Foundations and Novel Approaches in Data Mining. Studies in Computational intelligence, vol. 9, pp. 99–126. Springer, Heidelberg (2006)
Ziarko, W.: Variable Precision Rough Set Model. Journal of Computer and Systen Sciences 46(1), 39–59 (1993)
Yao, J.T., Yao, Y.Y.: Induction of Classification Rules by Granular Computing. In: Alpigini, J.J., et al. (eds.) RSCTC 2002. LNCS (LNAI), vol. 2475, pp. 331–338. Springer, Heidelberg (2002)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer Berlin Heidelberg
About this chapter
Cite this chapter
Wasilewska, A., Menasalvas, E., Scharff, C. (2007). A Model PM for Preprocessing and Data Mining Proper Process. In: Peters, J.F., Skowron, A., Düntsch, I., Grzymała-Busse, J., Orłowska, E., Polkowski, L. (eds) Transactions on Rough Sets VI. Lecture Notes in Computer Science, vol 4374. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71200-8_21
Download citation
DOI: https://doi.org/10.1007/978-3-540-71200-8_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71198-8
Online ISBN: 978-3-540-71200-8
eBook Packages: Computer ScienceComputer Science (R0)