Abstract
No one is born a data miner. In order to grow expertise as a data miner or as an information analyst, you need to obtain certain basic knowledge. Then you need data to mine, as well as a way to measure the important characteristics of a process or phenomenon, so you can employ the appropriate statistical tools. Measuring doesn’t necessarily mean using a ruler, calipers, or a scale. It can also be simply a “yes” or “no” decision. Statisticians and data miners typically categorize data as follows:
Variables data represent actual measured quantities, such as weights, dimensions, temperatures, proportions, and the like. The measurements have units associated with them (for example, inches, pounds, degrees Fahrenheit, and centimeters). Because variables data may take on any value within a certain range (subject to the precision of the measuring instrument), these observations are sometimes said to be continuous.
Attributes data on the other hand, represent the classification of measurements into one of two categories (such as “defective” or “nondefective”) or the number of occurrences of some phenomenon (such as the number of airplanes that arrive at an airport each hour). In most cases, these types of observations can only assume integer values, so attributes data are said to be discrete.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Rights and permissions
Copyright information
© 2001 Robert P. Trueblood and John N. Lovett, Jr.
About this chapter
Cite this chapter
Trueblood, R.P., Lovett, J.N. (2001). Basic Statistical Principles and Diagnostic Tree. In: Data Mining and Statistical Analysis Using SQL. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4302-0855-6_1
Download citation
DOI: https://doi.org/10.1007/978-1-4302-0855-6_1
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-893115-54-5
Online ISBN: 978-1-4302-0855-6
eBook Packages: Springer Book Archive