Skip to main content

A Model PM for Preprocessing and Data Mining Proper Process

  • Chapter
Transactions on Rough Sets VI

Part of the book series: Lecture Notes in Computer Science ((TRS,volume 4374))

Abstract

Data Mining, as defined in 1996 by Piatetsky-Shapiro ([1]) is a step (crucial, but a step nevertheless) in a KDD (Knowledge Discovery in Data Bases) process. The Piatetsky-Shapiro’s definition states that the KDD process consists of the following steps: developing an understanding of the application domain, creating a target data set, choosing the data mining task i.e. deciding whether the goal of the KDD process is classification, regression, clustering, etc..., choosing the data mining algorithm(s), data preprocessing, data mining (DM), interpreting mined patterns, deciding if a re-iteration is needed, and consolidating discovered knowledge.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P.: From Data Mining to Knowledge Discovery: An Overview. In: Fayyad, U.M., et al. (eds.) Advances in Knowledge Discovery and Data Mining, pp. 1–34. AAAI Press, Menlo Park (1996)

    Google Scholar 

  2. Inuiguchi, M., Tanino, T.: Classification versus Approximation oriented Generalization of Rough Sets. Bulletin of International Rough Set Society 7(1/2) (2003)

    Google Scholar 

  3. Pawlak, Z.: Rough Sets- theoretical Aspects Reasoning About Data. Kluwer Academic Publishers, Dordrecht (1991)

    MATH  Google Scholar 

  4. Lech, P.: Rough Sets- Mathematical Foundations. Physica- Verlag, Heidelberg (2002)

    MATH  Google Scholar 

  5. Shearer, C.: The CRISP-DM Model: The New Blueprint for Data Mining. Journal of Data Warehousing 5(4), 13–22 (2000)

    Google Scholar 

  6. Wasilewska, A., Menasalvas, E.: Data Preprocessing and Data Mining as Generalization Process. In: Proceedings of ICDM’04, The Fourth IEEE International Conference on Data Mining, Brighton, UK, Nov 1-4, 2004, pp. 133–137. IEEE Computer Society Press, Los Alamitos (2004)

    Google Scholar 

  7. Wasilewska, A., Menasalvas, E.: Data Mining Operators. In: Proceedings of ICDM’04, The Fourth IEEE International Conference on Data Mining, Brighton, UK, Nov 1-4, 2004, pp. 209–214. IEEE Computer Society Press, Los Alamitos (2004)

    Google Scholar 

  8. Wasilewska, A., Menasalvas, E., Scharff, C.: Uniform Model for Data Mining. In: Proceedings of FDM05 (Foundations of Data Mining), in ICDM2005, Fifth IEEE International Conference on Data Mining, Austin, Texas, Nov 27-29, 2005, pp. 19–27. IEEE Computer Society Press, Los Alamitos (2005)

    Google Scholar 

  9. Wasilewska, A., Menasalvas Ruiz, E.: Data Mining as Generalization: A Formal Model. In: Lin, T.Y., et al. (eds.) Foundations and Novel Approaches in Data Mining. Studies in Computational intelligence, vol. 9, pp. 99–126. Springer, Heidelberg (2006)

    Google Scholar 

  10. Ziarko, W.: Variable Precision Rough Set Model. Journal of Computer and Systen Sciences 46(1), 39–59 (1993)

    Article  MATH  MathSciNet  Google Scholar 

  11. Yao, J.T., Yao, Y.Y.: Induction of Classification Rules by Granular Computing. In: Alpigini, J.J., et al. (eds.) RSCTC 2002. LNCS (LNAI), vol. 2475, pp. 331–338. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

James F. Peters Andrzej Skowron Ivo Düntsch Jerzy Grzymała-Busse Ewa Orłowska Lech Polkowski

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer Berlin Heidelberg

About this chapter

Cite this chapter

Wasilewska, A., Menasalvas, E., Scharff, C. (2007). A Model PM for Preprocessing and Data Mining Proper Process. In: Peters, J.F., Skowron, A., Düntsch, I., Grzymała-Busse, J., Orłowska, E., Polkowski, L. (eds) Transactions on Rough Sets VI. Lecture Notes in Computer Science, vol 4374. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-71200-8_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-71200-8_21

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-71198-8

  • Online ISBN: 978-3-540-71200-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics