Abstract
Data mining, the central activity in the process of knowledge discovery in databases, is concerned with finding patterns in data. This chapter introduces and illustrates the most common types of patterns considered by data mining approaches and gives rough outlines of the data mining algorithms that are most frequently used to look for such patterns. It also briefly introduces relational data mining, starting with patterns that involve multiple relations and laying down the basic principles common to relational data mining algorithms. An overview of the contents of this book is given, as well as pointers to literature and Internet resources on data mining.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
P. Adriaans and D. Zantinge. Data Mining. Addison-Wesley, Reading, 1996.
R. Agrawal, T. Imielinski, and A. Swami. Mining association rules between sets of items in large databases. In Proceedings of the ACM SIGMOD Conference on Management of Data, pages 207–216. ACM Press, New York, 1993.
M.J.A. Berry and G. Linoff. Data Mining Techniques for Marketing, Sales and Customer Support. John Wiley and Sons, New York, 1997.
M.J.A. Berry and G. Linoff. Mastering Data Mining: The Art and Science of Customer Relationship Management. John Wiley and Sons, New York, 1999.
A. Berson and S.J. Smith. Data Warehousing, Data Mining and OLAP. McGraw-Hill, New York, 1997.
M. Berthold and D.J. Hand, editors. Intelligent Data Analysis: An Introduction. Springer, Berlin, 1999.
L. Breiman, J. Friedman, R. Olshen, and C. Stone. Classification and Regression Trees. Wadsworth, Belmont, CA, 1984.
B. Cestnik. Estimating probabilities: A crucial task in machine learning. In Proceedings of the Ninth European Conference on Artificial Intelligence, pages 147–149. Pitman, London.
P. Clark and R. Boswell. Rule induction with CN2: Some recent improvements. In Proceedings of the Fifth European Working Session on Learning, pages 151–163. Springer, Berlin, 1991.
B.V. Dasarathy, editor. Nearest Neighbor (NN) Norms: NN Pattern Classification Techniques. IEEE Computer Society Press, Los Alamitos, CA, 1990.
S. Džeroski, L. Todorovski, I. Bratko, B. Kompare, and V. Križman. Equation discovery with ecological applications. In A.H. Fielding, editor, Machine Learning Methods for Ecological Applications, pages 185–207. Kluwer, Boston, 1999.
U. Fayyad, G. Piatetsky-Shapiro, and P. Smyth. From data mining to knowledge discovery: An overview. In U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy, editors, Advances in Knowledge Discovery and Data Mining, pages 1–34. MIT Press, Cambridge, MA, 1996.
U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy, editors. Advances in Knowledge Discovery and Data Mining. MIT Press, Cambridge, MA, 1996.
W. Frawley, G. Piatetsky-Shapiro, and C. Matheus Knowledge discovery in databases: An overview. In G. Piatetsky-Shapiro and W. Frawley, editors, Knowledge Discovery in Databases, pages 1–27. MIT Press, Cambridge, MA, 1991.
R. Groth. Data Mining: A Hands-On Approach for Business Professionals Prentice Hall, Upper Saddle River, NJ, 1997.
J. Han and M. Kamber. Data Mining: Concepts and Techniques. Morgan Kaufmann, San Francisco, CA, 2001.
R.V. Hogg and A.T. Craig. Introduction to Mathematical Statistics, 5th edition. Prentice Hall, Englewood Cliffs, NJ, 1995.
L. Kaufman and P. J. Rousseeuw. Finding Groups in Data: An Introduction to Cluster Analysis. Wiley & Sons, New York, 1990.
N. Lavrac and S. Džeroski. Inductive Logic Programming: Techniques and Applications. Ellis Horwood, Chichester, 1994. Freely available at http://www-ai.ijs.si/SasoDzeroski/ILPBook/.
R.S. Michalski, I. Bratko, and M. Kubat, editors, Machine Learning, Data Mining and Knowledge Discovery: Methods and Applications. John Wiley and Sons, Chichester, 1997.
S. Muggleton. Inductive logic programming. New Generation Computing, 8(4): 295–318, 1991.
J. Pearl. Probabilistic Reasoning in Intelligent Systems. Morgan Kaufmann, San Mateo, 1988.
G. Piatetsky-Shapiro and W. Frawley, editors. Knowledge Discovery in Databases. MIT Press, Cambridge, MA, 1991.
D. Pyle. Data Preparation for Data Mining. Morgan Kaufmann, San Francisco, CA, 1999.
J. R. Quinlan. Induction of decision trees. Machine Learning, 1: 81–106, 1986.
P. Taylor. Statistical methods. In M. Berthold and D.J. Hand, editors, Intelligent Data Analysis: An Introduction, pages 67–127. Springer, Berlin, 1999.
S. Weiss and N. Indurkhya. Predictive Data Mining: A Practical Guide. Morgan Kaufmann, San Francisco, CA, 1997.
LH. Witten and E. Frank. Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, San Francisco, CA, 1999.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Džeroski, S. (2001). Data Mining in a Nutshell. In: Džeroski, S., Lavrač, N. (eds) Relational Data Mining. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-04599-2_1
Download citation
DOI: https://doi.org/10.1007/978-3-662-04599-2_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-07604-6
Online ISBN: 978-3-662-04599-2
eBook Packages: Springer Book Archive