Unsupervised Learning: Clustering

Clarke, Bertrand; Fokoué, Ernest; Zhang, Hao Helen

doi:10.1007/978-0-387-98135-2_8

Bertrand Clarke⁴,
Ernest Fokoué⁵ &
Hao Helen Zhang⁶

Part of the book series: Springer Series in Statistics ((SSS))

9859 Accesses
1 Citations

In contrast to supervised learning, unsupervised learning fits a model to observations assuming there is no dependent random variable, output, or response. That is, a set of input observations is gathered and treated as a set of random variables and analyzed as is. None of the observations is treated differently from the others. An informal way to say this is that there is no Y. For this reason, sometimes classification data that includes the Y as the class is called labeled data but clustering data is called unlabeled. Then, it’s as if the task of clustering is to surmise what variable Y should have been measured (but wasn’t). Another way to think of this is to assume that there are n independent data vectors (X ₁, ...,X_p,Y) but that all the Y _is are missing, and in fact someone has even hidden the definition of Y.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 229.00; Price excludes VAT (USA)

Softcover Book: USD 299.99; Price excludes VAT (USA)

Hardcover Book: USD 299.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Author information

Authors and Affiliations

University of Miami, 120 NW 14th Street CRB 1055 (C-213), Miami, FL 33136, Canada
Bertrand Clarke
Department of Science & Mathematics, Kettering University, 48504-4898, 1700 W. Third Ave, Flint, MI, USA
Ernest Fokoué
Department of Statistics, North Carolina State University Program in Statistical Genetics, P.O.Box 8203, 27695-8203, Raleigh, NC, USA
Hao Helen Zhang

Authors

Bertrand Clarke
View author publications
You can also search for this author in PubMed Google Scholar
Ernest Fokoué
View author publications
You can also search for this author in PubMed Google Scholar
Hao Helen Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bertrand Clarke .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Clarke, B., Fokoué, E., Zhang, H.H. (2009). Unsupervised Learning: Clustering. In: Principles and Theory for Data Mining and Machine Learning. Springer Series in Statistics. Springer, New York, NY. https://doi.org/10.1007/978-0-387-98135-2_8

Download citation

DOI: https://doi.org/10.1007/978-0-387-98135-2_8
Published: 15 June 2009
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-98134-5
Online ISBN: 978-0-387-98135-2
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics