Basic Statistical Principles and Diagnostic Tree

Trueblood, Robert P.; Lovett, John N.

doi:10.1007/978-1-4302-0855-6_1

Robert P. Trueblood &
John N. Lovett Jr.

580 Accesses

Abstract

No one is born a data miner. In order to grow expertise as a data miner or as an information analyst, you need to obtain certain basic knowledge. Then you need data to mine, as well as a way to measure the important characteristics of a process or phenomenon, so you can employ the appropriate statistical tools. Measuring doesn’t necessarily mean using a ruler, calipers, or a scale. It can also be simply a “yes” or “no” decision. Statisticians and data miners typically categorize data as follows:

Variables data represent actual measured quantities, such as weights, dimensions, temperatures, proportions, and the like. The measurements have units associated with them (for example, inches, pounds, degrees Fahrenheit, and centimeters). Because variables data may take on any value within a certain range (subject to the precision of the measuring instrument), these observations are sometimes said to be continuous.

Attributes data on the other hand, represent the classification of measurements into one of two categories (such as “defective” or “nondefective”) or the number of occurrences of some phenomenon (such as the number of airplanes that arrive at an airport each hour). In most cases, these types of observations can only assume integer values, so attributes data are said to be discrete.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 44.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Authors

Robert P. Trueblood
View author publications
You can also search for this author in PubMed Google Scholar
John N. Lovett Jr.
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Trueblood, R.P., Lovett, J.N. (2001). Basic Statistical Principles and Diagnostic Tree. In: Data Mining and Statistical Analysis Using SQL. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4302-0855-6_1

Download citation

DOI: https://doi.org/10.1007/978-1-4302-0855-6_1
Publisher Name: Apress, Berkeley, CA
Print ISBN: 978-1-893115-54-5
Online ISBN: 978-1-4302-0855-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics