Skip to main content

A Careful Look at the Use of Statistical Methodology in Data Mining

  • Chapter
  • First Online:
Foundations of Data Mining and knowledge Discovery

Part of the book series: Studies in Computational Intelligence ((SCI,volume 6))

  • 302 Accesses

Abstract

Knowledge discovery in databases (KDD) is an inherently statistical activity, with a considerable literature drawing upon statistical science. However, the usage has typically been vague and informal at best, and at worst of a seriously misleading nature. In addition, much of the classical statistical methodology was designed for goals which can be very different from those of KDD. The present paper seeks to take a first step in remedying this problem by pairing precise mathematical descriptions of some of the concepts in KDD with practical interpretations and implications for specific KDD issues.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

Authors

Editor information

Tsau Young Lin Setsuo Ohsuga Churn-Jung Liau Xiaohua Hu Shusaku Tsumoto

Rights and permissions

Reprints and permissions

About this chapter

Cite this chapter

Matloff, N. A Careful Look at the Use of Statistical Methodology in Data Mining. In: Young Lin, T., Ohsuga, S., Liau, CJ., Hu, X., Tsumoto, S. (eds) Foundations of Data Mining and knowledge Discovery. Studies in Computational Intelligence, vol 6. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11498186_6

Download citation

  • DOI: https://doi.org/10.1007/11498186_6

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-26257-2

  • Online ISBN: 978-3-540-32408-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics