Multiple Kernel Learning Improved by MMD

Ren, Jiangtao; Liang, Zhou; Hu, Shaofeng

doi:10.1007/978-3-642-17313-4_7

Jiangtao Ren²¹,
Zhou Liang²¹ &
Shaofeng Hu²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6441))

Included in the following conference series:

International Conference on Advanced Data Mining and Applications

3095 Accesses
5 Citations

Abstract

When training and testing data are drawn from different distributions, the performance of the classification model will be low. Such a problem usually comes from sample selection bias or transfer learning scenarios. In this paper, we propose a novel multiple kernel learning framework improved by Maximum Mean Discrepancy (MMD) to solve the problem. This new model not only utilizes the capacity of kernel learning to construct a nonlinear hyperplane which maximizes the separation margin, but also reduces the distribution discrepancy between training and testing data simultaneously, which is measured by MMD. This approach is formulated as a bi-objective optimization problem. Then an efficient optimization algorithm based on gradient descent and quadratic programming [13] is adopted to solve it. Extensive experiments on UCI and text datasets show that the proposed model outperforms traditional multiple kernel learning model in sample selection bias and transfer learning scenarios.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bach, F.R., Lanckriet, G.R.G., Jordan, M.I.: Multiple kernel learning, conic duality, and the smo algorithm. In: ICML (2004)
Google Scholar
Chapelle, O., Vapnik, V., Bousquet, O., Mukherjee, S.: Choosing multiple parameters for support vector machines. Machine Learning 46(1-3), 131–159 (2002)
Article MATH Google Scholar
Cortes, C., Mohri, M., Riley, M., Rostamizadeh, A.: Sample selection bias correction theory. In: Freund, Y., Györfi, L., Turán, G., Zeugmann, T. (eds.) ALT 2008. LNCS (LNAI), vol. 5254, pp. 38–53. Springer, Heidelberg (2008)
Chapter Google Scholar
Fan, W., Davidson, I.: On sample selection bias and its efficient correction via model averaging and unlabeled examples. In: Jonker, W., Petković, M. (eds.) SDM 2007. LNCS, vol. 4721, Springer, Heidelberg (2007)
Google Scholar
Huang, J., Smola, A.J., Gretton, A., Borgwardt, K.M., Schölkopf, B.: Correcting sample selection bias by unlabeled data. In: NIPS, pp. 601–608 (2006)
Google Scholar
Lanckriet, G.R.G., Cristianini, N., Bartlett, P.L., Ghaoui, L.E., Jordan, M.I.: Learning the kernel matrix with semi-definite programming. In: ICML, pp. 323–330 (2002)
Google Scholar
Pan, S.J., Kwok, J.T., Yang, Q.: Transfer learning via dimensionality reduction. In: AAAI, pp. 677–682 (2008)
Google Scholar
Pan, S.J., Yang, Q.: A survey on transfer learning. Tech. Rep. HKUST-CS08-08, Department of Computer Science and Engineering, Hong Kong University of Science and Technology, Hong Kong, China (November 2008), http://www.cse.ust.hk/~sinnopan/publications/TLsurvey_0822.pdf
Quanz, B., Huan, J.: Large margin transductive transfer learning. In: CIKM. pp. 1327–1336 (2009)
Google Scholar
Rakotomamonjy, A., Bach, F., Canu, S., Grandvalet, Y.: More efficiency in multiple kernel learning. In: ICML, pp. 775–782 (2007)
Google Scholar
Sonnenburg, S., Rätsch, G., Schäfer, C., Schölkopf, B.: Large scale multiple kernel learning. Journal of Machine Learning Research 7, 1531–1565 (2006)
MathSciNet MATH Google Scholar
Szafranski, M., Grandvalet, Y., Rakotomamonjy, A.: Composite kernel learning. In: ICML, pp. 1040–1047 (2008)
Google Scholar
Varma, M., Babu, B.R.: More generality in efficient multiple kernel learning. In: ICML, p. 134 (2009)
Google Scholar
Zhong, E., Fan, W., Peng, J., Zhang, K., Ren, J., Turaga, D.S., Verscheure, O.: Cross domain distribution adaptation via kernel mapping. In: KDD, pp. 1027–1036 (2009)
Google Scholar
Zhu, X., Khoshgoftaar, T.M., Davidson, I., Zhang, S.: Editorial: Special issue on mining low-quality data. Knowl. Inf. Syst. 11(2), 131–136 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Software, Sun Yat-sen University, China
Jiangtao Ren & Zhou Liang
Department of Computer Science, Sun Yat-sen University, China
Shaofeng Hu

Authors

Jiangtao Ren
View author publications
You can also search for this author in PubMed Google Scholar
Zhou Liang
View author publications
You can also search for this author in PubMed Google Scholar
Shaofeng Hu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Engineering and Information Technology, University of Technology Sydney, 2007, Sydney, NSW, Australia
Longbing Cao
College of Computer Science, Chongqing University, 400030, Chongqing, China
Jiang Zhong & Yong Feng &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ren, J., Liang, Z., Hu, S. (2010). Multiple Kernel Learning Improved by MMD. In: Cao, L., Zhong, J., Feng, Y. (eds) Advanced Data Mining and Applications. ADMA 2010. Lecture Notes in Computer Science(), vol 6441. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17313-4_7

Download citation

DOI: https://doi.org/10.1007/978-3-642-17313-4_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17312-7
Online ISBN: 978-3-642-17313-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics