Abstract
The recent explosive growth of our ability to generate and store data has created a need for new, scalable and efficient, tools for data analysis. The main focus of the discipline of knowledge discovery in databases is to address this need. Knowledge discovery in databases is the fusion of many areas that are concerned with different aspects of data handling and data analysis, including databases, machine learning, statistics, and algorithms. Each of these areas addresses a different part of the problem, and places different emphasis on different requirements. For example, database techniques are designed to efficiently handle relatively simple queries on large amounts of data stored in external (disk) storage. Machine learning techniques typically consider smaller data sets, and the emphasis is on the accuracy of a relatively complicated analysis task such as classification. The analysis of large data sets requires the design of new tools that not only combine and generalize techniques from different areas, but also require the design and development of altogether new scalable techniques.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
White Paper: The wide scale deployment of Active Data Mining Solutions, at: http://www.attar.com/tutor/deploy.htm.
Altman D, 1994, Fuzzy set theoretic approaches for handling imprecision in spatial analysis. International Journal of Geographical Information Systems, 8, pp. 271–289.
Coppock David S, “Data Mining and Modeling: Model Validity”, published in DM Review Online, March 2002 (http://www.dmreview.com/).
Glymour C, Madigan D, Pregibon D, Smyth P, “Statistical Inference and Data Mining”, in CACM v39 (11), 1996, pp. 35–42.
Halkidi M, Vazirgiannis M, “Clustering Validity Assessment: Finding the optimal partitioning of a data set”, in the Proceedings of IEEE International Conference on Data Mining (ICDM), California, USA, (2001).
Jiawei Han, Micheline Kamber. Data Mining: Concepts and Techniques. Morgan Kaufmann Publishers, 2001.
Halkidi M, Vazirgiannis M. “Managing Uncertainty and Quality in the Classification Process”, in the Proceeding of SETN 2002, Thessaloniki, Greece, 2002.
Kloegen W, “Explora: Al Mukipattem and Multistrategy Discovery Assistant”, in the book Advances in Knowledge Discovery and Data Mining (eds: U. Fayyad, et al.) AAAI Press, 1996.
STATISTICA, Text book manual at: http://www.statsoftinc.com/textbook/glosc.html#Cross-Validation.
Vazirgiannis M, Halkidi M. “Uncertainty handling in the datamining process with fuzzy logic”, in the Proceedings of the IEEE-FUZZY Conference, San Antonio, Texas, May, 2000.
Vatopoulos AC, Kalapothaki V, Legakis NJ & the Greek Network for the Surveillance of Antimicrobial Resistance: An Electronic Network for the Surveillance of Antimicrobial Resistance in Bacterial Nosocomial Isolates in Greece. WHO Bulletin, 1999; 77:595–601.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag London
About this chapter
Cite this chapter
Vazirgiannis, M., Halkidi, M., Gunopulos, D. (2003). Introduction. In: Uncertainty Handling and Quality Assessment in Data Mining. Advanced Information and Knowledge Processing. Springer, London. https://doi.org/10.1007/978-1-4471-0031-7_1
Download citation
DOI: https://doi.org/10.1007/978-1-4471-0031-7_1
Publisher Name: Springer, London
Print ISBN: 978-1-4471-1119-1
Online ISBN: 978-1-4471-0031-7
eBook Packages: Springer Book Archive