Skip to main content

Data Mining

  • Living reference work entry
  • First Online:
Encyclopedia of Database Systems

Synonyms

Data analysis; Knowledge discovery from data; Pattern discovery

Definition

Data miningis the process of discovering knowledge or patterns from massive amounts of data. As a young research field, data mining represents the confluence of a number of research fields, including database systems, machine learning, statistics, pattern recognition, high-performance computing, and specific application fields, such as WWW, multimedia, and bioinformatics, with broad applications. As an interdisciplinary field, data mining has several major research themes based on its mining tasks, including pattern-mining and analysis, classification and predictive modeling, cluster and outlier analysis, and multidimensional (OLAP) analysis. Data mining can also be categorized based on the kinds of data to be analyzed, such as multi-relational data mining, text mining, stream mining, web mining, multimedia (or image, video) mining, spatiotemporal data mining, information network analysis, biological...

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Recommended Reading

  1. Duda RO, Hart PE, Stork DG. Pattern classification. 2nd ed. New York: Wiley; 2001.

    MATH  Google Scholar 

  2. Han J, Kamber M. Data mining: concepts and techniques. 2nd ed. Amsterdam: Morgan Kaufmann; 2006.

    MATH  Google Scholar 

  3. Hastie T, Tibshirani R, Friedman J. The elements of statistical learning: data mining, inference, and prediction. New York: Springer; 2001.

    Book  MATH  Google Scholar 

  4. Tan P, Steinbach M, Kumar V. Introduction to data mining. Boston: Addison Wesley; 2005.

    Google Scholar 

  5. Witten IH, Frank E. Data mining: practical machine learning tools and techniques. 2nd ed. Amsterdam: Morgan Kaufmann; 2005.

    MATH  Google Scholar 

  6. Dasu T, Johnson T. Exploratory data mining and data cleaning. New York: Wiley; 2003.

    Book  MATH  Google Scholar 

  7. Chakrabarti S. Mining the web: statistical analysis of hypertex and semi-structured data. Morgan Kaufmann; 2002.

    Google Scholar 

  8. Liu B. Web data mining: exploring hyperlinks, contents, and usage data. New York: Springer; 2006.

    MATH  Google Scholar 

  9. Agrawal R, Srikant R. Fast algorithms for mining association rules. In: Proceedings of the 20th International Conference on Very Large Data Bases; 1994. p. 487–99.

    Google Scholar 

  10. Han J, Cheng H, Xin D, Yan X. Frequent pattern mining: current status and future directions. Data Min Knowl Disc. 2007;15:55–86.

    Article  MathSciNet  Google Scholar 

  11. Mitchell TM. Machine learning. New York: McGraw-Hill; 1997.

    MATH  Google Scholar 

  12. Cheng H, Yan X, Han J, Yu PS. Direct discriminative pattern mining for effective classification. In: Proceedings of the 24th International Conference on Data Engineering; 2008. p. 169–78.

    Google Scholar 

  13. Zhang T, Ramakrishnan R, Livny M. BIRCH: an efficient data clustering method for very large databases. In: Proceedings of the ACM-SIGMOD International Conference on Management of Data; 1996.p. 103–14.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jiawei Han .

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer Science+Business Media LLC

About this entry

Cite this entry

Han, J. (2016). Data Mining. In: Liu, L., Özsu, M. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4899-7993-3_104-2

Download citation

  • DOI: https://doi.org/10.1007/978-1-4899-7993-3_104-2

  • Received:

  • Accepted:

  • Published:

  • Publisher Name: Springer, New York, NY

  • Online ISBN: 978-1-4899-7993-3

  • eBook Packages: Springer Reference Computer SciencesReference Module Computer Science and Engineering

Publish with us

Policies and ethics