A Model PM for Preprocessing and Data Mining Proper Process

Wasilewska, Anita; Menasalvas, Ernestina; Scharff, Christelle

doi:10.1007/978-3-540-71200-8_21

Anita Wasilewska¹,
Ernestina Menasalvas² &
Christelle Scharff³

Part of the book series: Lecture Notes in Computer Science ((TRS,volume 4374))

559 Accesses
1 Citations

Abstract

Data Mining, as defined in 1996 by Piatetsky-Shapiro ([1]) is a step (crucial, but a step nevertheless) in a KDD (Knowledge Discovery in Data Bases) process. The Piatetsky-Shapiro’s definition states that the KDD process consists of the following steps: developing an understanding of the application domain, creating a target data set, choosing the data mining task i.e. deciding whether the goal of the KDD process is classification, regression, clustering, etc..., choosing the data mining algorithm(s), data preprocessing, data mining (DM), interpreting mined patterns, deciding if a re-iteration is needed, and consolidating discovered knowledge.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P.: From Data Mining to Knowledge Discovery: An Overview. In: Fayyad, U.M., et al. (eds.) Advances in Knowledge Discovery and Data Mining, pp. 1–34. AAAI Press, Menlo Park (1996)
Google Scholar
Inuiguchi, M., Tanino, T.: Classification versus Approximation oriented Generalization of Rough Sets. Bulletin of International Rough Set Society 7(1/2) (2003)
Google Scholar
Pawlak, Z.: Rough Sets- theoretical Aspects Reasoning About Data. Kluwer Academic Publishers, Dordrecht (1991)
MATH Google Scholar
Lech, P.: Rough Sets- Mathematical Foundations. Physica- Verlag, Heidelberg (2002)
MATH Google Scholar
Shearer, C.: The CRISP-DM Model: The New Blueprint for Data Mining. Journal of Data Warehousing 5(4), 13–22 (2000)
Google Scholar
Wasilewska, A., Menasalvas, E.: Data Preprocessing and Data Mining as Generalization Process. In: Proceedings of ICDM’04, The Fourth IEEE International Conference on Data Mining, Brighton, UK, Nov 1-4, 2004, pp. 133–137. IEEE Computer Society Press, Los Alamitos (2004)
Google Scholar
Wasilewska, A., Menasalvas, E.: Data Mining Operators. In: Proceedings of ICDM’04, The Fourth IEEE International Conference on Data Mining, Brighton, UK, Nov 1-4, 2004, pp. 209–214. IEEE Computer Society Press, Los Alamitos (2004)
Google Scholar
Wasilewska, A., Menasalvas, E., Scharff, C.: Uniform Model for Data Mining. In: Proceedings of FDM05 (Foundations of Data Mining), in ICDM2005, Fifth IEEE International Conference on Data Mining, Austin, Texas, Nov 27-29, 2005, pp. 19–27. IEEE Computer Society Press, Los Alamitos (2005)
Google Scholar
Wasilewska, A., Menasalvas Ruiz, E.: Data Mining as Generalization: A Formal Model. In: Lin, T.Y., et al. (eds.) Foundations and Novel Approaches in Data Mining. Studies in Computational intelligence, vol. 9, pp. 99–126. Springer, Heidelberg (2006)
Google Scholar
Ziarko, W.: Variable Precision Rough Set Model. Journal of Computer and Systen Sciences 46(1), 39–59 (1993)
Article MATH MathSciNet Google Scholar
Yao, J.T., Yao, Y.Y.: Induction of Classification Rules by Granular Computing. In: Alpigini, J.J., et al. (eds.) RSCTC 2002. LNCS (LNAI), vol. 2475, pp. 331–338. Springer, Heidelberg (2002)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Stony Brook University, NY, USA
Anita Wasilewska
Departamento de Lenguajes y Sistemas Informaticos Facultad de Informatica, U.P.M, Madrid, Spain
Ernestina Menasalvas
Computer Science Department, Pace University, New York, NY, USA
Christelle Scharff

Authors

Anita Wasilewska
View author publications
You can also search for this author in PubMed Google Scholar
Ernestina Menasalvas
View author publications
You can also search for this author in PubMed Google Scholar
Christelle Scharff
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

James F. Peters Andrzej Skowron Ivo Düntsch Jerzy Grzymała-Busse Ewa Orłowska Lech Polkowski

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Wasilewska, A., Menasalvas, E., Scharff, C. (2007). A Model PM for Preprocessing and Data Mining Proper Process. In: Peters, J.F., Skowron, A., Düntsch, I., Grzymała-Busse, J., Orłowska, E., Polkowski, L. (eds) Transactions on Rough Sets VI. Lecture Notes in Computer Science, vol 4374. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71200-8_21

Download citation

DOI: https://doi.org/10.1007/978-3-540-71200-8_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-71198-8
Online ISBN: 978-3-540-71200-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics