Skip to main content

Basic Statistical Principles and Diagnostic Tree

  • Chapter
Data Mining and Statistical Analysis Using SQL

Abstract

No one is born a data miner. In order to grow expertise as a data miner or as an information analyst, you need to obtain certain basic knowledge. Then you need data to mine, as well as a way to measure the important characteristics of a process or phenomenon, so you can employ the appropriate statistical tools. Measuring doesn’t necessarily mean using a ruler, calipers, or a scale. It can also be simply a “yes” or “no” decision. Statisticians and data miners typically categorize data as follows:

Variables data represent actual measured quantities, such as weights, dimensions, temperatures, proportions, and the like. The measurements have units associated with them (for example, inches, pounds, degrees Fahrenheit, and centimeters). Because variables data may take on any value within a certain range (subject to the precision of the measuring instrument), these observations are sometimes said to be continuous.

Attributes data on the other hand, represent the classification of measurements into one of two categories (such as “defective” or “nondefective”) or the number of occurrences of some phenomenon (such as the number of airplanes that arrive at an airport each hour). In most cases, these types of observations can only assume integer values, so attributes data are said to be discrete.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 44.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Robert P. Trueblood and John N. Lovett, Jr.

About this chapter

Cite this chapter

Trueblood, R.P., Lovett, J.N. (2001). Basic Statistical Principles and Diagnostic Tree. In: Data Mining and Statistical Analysis Using SQL. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4302-0855-6_1

Download citation

  • DOI: https://doi.org/10.1007/978-1-4302-0855-6_1

  • Publisher Name: Apress, Berkeley, CA

  • Print ISBN: 978-1-893115-54-5

  • Online ISBN: 978-1-4302-0855-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics