Integrating outlier filtering in large margin training

Zhou, Xi-chuan; Shen, Hai-bin; Ye, Jie-ping

doi:10.1631/jzus.C1000361

Integrating outlier filtering in large margin training

Published: 04 May 2011

Volume 12, pages 362–370, (2011)
Cite this article

Journal of Zhejiang University SCIENCE C Aims and scope Submit manuscript

Xi-chuan Zhou¹,
Hai-bin Shen² &
Jie-ping Ye³

115 Accesses
4 Citations
Explore all metrics

Abstract

Large margin classifiers such as support vector machines (SVM) have been applied successfully in various classification tasks. However, their performance may be significantly degraded in the presence of outliers. In this paper, we propose a robust SVM formulation which is shown to be less sensitive to outliers. The key idea is to employ an adaptively weighted hinge loss that explicitly incorporates outlier filtering in the SVM training, thus performing outlier filtering and classification simultaneously. The resulting robust SVM formulation is non-convex. We first relax it into a semi-definite programming which admits a global solution. To improve the efficiency, an iterative approach is developed. We have performed experiments using both synthetic and real-world data. Results show that the performance of the standard SVM degrades rapidly when more outliers are included, while the proposed robust SVM training is more stable in the presence of outliers.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Bousquet, O., Elisseeff, A., 2002. Stability and generalization. J.Mach. Learn. Res., 2(3):499–526. [doi:10.1162/153244302760200704]
Article MathSciNet MATH Google Scholar
Brodley, C.E., Friedl, M.A., 1996. Identifying and Eliminating Mislabeled Training Instances. Proc. 13th National Conf. on Artificial Intelligence, 1:799–805.
Google Scholar
Cortes, C., Vapnik, V., 1995. Support vector networks. Mach. Learn., 20(3):273–297. [doi:10.1023/A:1022627411411]
MATH Google Scholar
Davy, M., Godsill, S., 2002. Detection of Abrupt Spectral Changes Using Support Vector Machines: an Application to Audio Signal Segmentation. Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, p.1313–1316.
Eskin, E., Lee, W., Stolfo, S.J., 2001. Modeling System Calls for Intrusion Detection with Dynamic Window Sizes. Proc. DARPA Information Survivability Conf. and Exposition, p.1–11.
Fawcett, T., Provost, F.J., 1997. Adaptive fraud detection. Data Min. Knowl. Disc., 1(3):291–316. [doi:10.1023/A:1009700419189]
Article Google Scholar
Frank, A., Asuncion, A., 2010. UCI Machine Learning Repository. School of Information and Computer Science, University of California, Irvine.
Google Scholar
Herbrich, R., Weston, J., 2000. Adaptive Margin Support Vector Machines for Classification. Advances in Large Margin Classifiers. MIT Press, Cambridge, Massachusetts, USA, p.281–295.
Google Scholar
King, S.P., King, D.M., Astley, K., Tarassenko, L., Hayton, P., Utete, S., 2002. The Use of Novelty Detection Techniques for Monitoring High-Integrity Plant. Proc. Int. Conf. on Control Applications, 1:221–226. [doi:10.1109/CCA.2002.1040189]
Article Google Scholar
Krause, N., Singer, Y., 2004. Leveraging the Margin More Carefully. Proc. 21st Int. Conf. on Machine Learning, p.1–8. [doi:10.1145/1015330.1015344]
Laskov, P., Schafer, F., Kotenko, I., 2004. Intrusion Detection in Unlabeled Data with Quarter-Sphere Support Vector Machines. Proc. DIMVA, p.71–82.
Manevitz, L.M., Yousef, M., 2002. One-class SVMs for document classification. J. Mach. Learn. Res., 2(2):139–154.
Article MATH Google Scholar
Ratsch, G., Mika, S., Scholkopf, B., Muller, K.R., 2002. Constructing boosting algorithms from SVMs: an application to one-class classification. IEEE Trans. Pattern Anal. Mach. Intell., 24(9):1184–1199. [doi:10.1109/TPAMI.2002.1033211]
Article Google Scholar
Scholkopf, B., Smola, A.J., 2002. Learning with Kernels Support Vector Machines, Regularization, Optimization and Beyond. MIT Press, Cambridge, Massachusetts, USA, p.135–141.
Google Scholar
Song, Q., Hu, W., Xie, W., 2002. Robust support vector machine with bullet hole image classification. IEEE Trans. Syst. Man Cybern. C, 32(4):440–448.
Article Google Scholar
Steinwart, I., Hush, D., Scovel, C., 2005. A classification framework for anomaly detection. J. Mach. Learn. Res., 6:211–232.
MathSciNet Google Scholar
Tax, D., Ypma, A., Ypma, E., Duin, R.P.W., 1999. Support Vector Data Description Applied to Machine Vibration Analysis. Annual Conf. of the Advanced School for Computing and Imaging, p.398–405.
Tax, D.M.J., 2001. One-Class Classification: Concept-Learning in the Absence of Counter-Examples. PhD Thesis, Delft University of Technology, Delft, the Netherlands.
Google Scholar
Thongkam, J., Xu, G., Zhang, Y., Huang, F., 2008. Support Vector Machine for Outlier Detection in Breast Cancer Survivability Prediction. APWeb Workshop, p.99–109. [doi:10.1007/978-3-540-89376-9-10]
Wu, Y., Liu, Y., 2007. Robust truncated hinge loss support vector machines. J. Am. Statist. Assoc., 102(479):974–983. [doi:10.1198/016214507000000617]
Article MATH Google Scholar
Xu, L., Crammer, K., Schuurmans, D., 2006. Robust Support Vector Machine Training via Convex Outlier Ablation. Proc. National Conf. of Artificial Intelligence, 21:536–542.
Google Scholar
Zhang, T., 2008. Multi-stage Convex Relaxation for Learning with Sparse Regularization. NIPS, p.1929–1936.

Download references

Author information

Authors and Affiliations

College of Communication Engineering, Chongqing University, Chongqing, 400044, China
Xi-chuan Zhou
School of Electrical Engineering, Zhejiang University, Hangzhou, 310027, China
Hai-bin Shen
Department of Computer Science and Engineering, Arizona State University, Tempe, 85281, USA
Jie-ping Ye

Authors

Xi-chuan Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Hai-bin Shen
View author publications
You can also search for this author in PubMed Google Scholar
Jie-ping Ye
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xi-chuan Zhou.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhou, Xc., Shen, Hb. & Ye, Jp. Integrating outlier filtering in large margin training. J. Zhejiang Univ. - Sci. C 12, 362–370 (2011). https://doi.org/10.1631/jzus.C1000361

Download citation

Received: 15 October 2010
Accepted: 23 February 2011
Published: 04 May 2011
Issue Date: May 2011
DOI: https://doi.org/10.1631/jzus.C1000361

Key words

CLC number

TP301

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Integrating outlier filtering in large margin training

Abstract

Access this article

Similar content being viewed by others

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

ImageNet Large Scale Visual Recognition Challenge

Learning from imbalanced data: open challenges and future directions

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Key words

CLC number

Navigation

Integrating outlier filtering in large margin training

Abstract

Access this article

Similar content being viewed by others

A Systematic Review on Supervised and Unsupervised Machine Learning Algorithms for Data Science

ImageNet Large Scale Visual Recognition Challenge

Learning from imbalanced data: open challenges and future directions

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Key words

CLC number

Search

Navigation