Abstract
Many real data sets are very high dimensional. In some scenarios, real data sets may contain hundreds or thousands of dimensions. With increasing dimensionality, many of the conventional outlier detection methods do not work very effectively. This is an artifact of the well-known curse of dimensionality. In high-dimensional space, the data becomes sparse, and the true outliers become masked by the noise effects of multiple irrelevant dimensions, when analyzed in full dimensionality.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this chapter
Cite this chapter
Aggarwal, C.C. (2017). High-Dimensional Outlier Detection: The Subspace Method. In: Outlier Analysis. Springer, Cham. https://doi.org/10.1007/978-3-319-47578-3_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-47578-3_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-47577-6
Online ISBN: 978-3-319-47578-3
eBook Packages: Computer ScienceComputer Science (R0)