Abstract
The Support Vector Data Description (SVDD) has been introduced to address the problem of anomaly (or outlier) detection. It essentially fits the smallest possible sphere around the given data points, allowing some points to be excluded as outliers. Whether or not a point is excluded, is governed by a slack variable. Mathematically, the values for the slack variables are obtained by minimizing a cost function that balances the size of the sphere against the penalty associated with outliers. In this paper we argue that the SVDD slack variables lack a clear geometric meaning, and we therefore re-analyze the cost function to get a better insight into the characteristics of the solution. We also introduce and analyze two new definitions of slack variables and show that one of the proposed methods behaves more robustly with respect to outliers, thus providing tighter bounds compared to SVDD.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Ben-Hur, A., Horn, D., Siegelmann, H.T., Vapnik, V.: Support vector clustering. Journal of Machine Learning Research 2, 125–137 (2001)
Lee, J., Lee, D.: An improved cluster labeling method for support vector clustering. IEEE Transactions on Pattern Analysis and Machine Intelligence 27, 461–464 (2005)
Lee, S., Daniels, K.: Cone cluster labeling for support vector clustering. In: Proceedings of SIAM Conference on Data Mining 2006, pp. 484–488 (2006)
Schölkopf, B., Williamson, R.C.: Shrinking the tube: A new support vector regression algorithm. Advances in Neural Information Processing Systems (1999)
Schölkopf, B., Williamson, R., Smola, A., Shawe-Taylor, J., Platt, J.: Support vector method for novelty detection. Advances in Neural Information Processing Systems 12, 582–588 (2000)
Shioda, R., Tuncel, L.: Clustering via Minimum Volume Ellipsoids. Journal of Comp. Optimization and App. 37(3) (2007)
Small, C.G.: A survey of multidimensional medians. International Statistical Review 58(3), 263–277 (1990)
Tax, D.M.J., Duin, R.P.W.: Support vector domain description. Pattern Recognition Letters 20(11-13), 1191–1199 (1999)
Tax, D.M.J.L.: One-class classification: concept learning in the absence of counter example. PhD Thesis, TU Delft (2001)
Ypma, A., Duin, R.: Support objects for domain approximation. In: ICANN, Skovde, Sweden (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pauwels, E.J., Ambekar, O. (2011). One Class Classification for Anomaly Detection: Support Vector Data Description Revisited. In: Perner, P. (eds) Advances in Data Mining. Applications and Theoretical Aspects. ICDM 2011. Lecture Notes in Computer Science(), vol 6870. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23184-1_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-23184-1_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23183-4
Online ISBN: 978-3-642-23184-1
eBook Packages: Computer ScienceComputer Science (R0)