Kernel Mean Matching with a Large Margin

Tan, Qi; Deng, Huifang; Yang, Pei

doi:10.1007/978-3-642-35527-1_19

Qi Tan^22,23,
Huifang Deng²² &
Pei Yang²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7713))

Included in the following conference series:

International Conference on Advanced Data Mining and Applications

3509 Accesses
1 Citations

Abstract

Various instance weighting methods have been proposed for instance-based transfer learning. Kernel Mean Matching (KMM) is one of the typical instance weighting approaches which estimates the instance importance by matching the two distributions in the universal reproducing kernel Hilbert space (RKHS). However, KMM is an unsupervised learning approach which does not utilize the class label knowledge of the source data. In this paper, we extended KMM by leveraging the class label knowledge and integrated KMM and SVM into an unified optimization framework called KMM-LM (Large Margin). The objective of KMM-LM is to maximize the geometric soft margin, and minimize the empirical classification error together with the domain discrepancy based on KMM simultaneously. KMM-LM utilizes an iterative minimization algorithm to find the optimal weight vector of the classification decision hyperplane and the importance weight vector of the instances in the source domain. The experiments show that KMM-LM outperforms the state-of-the-art baselines.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Sarinnapakorn, K., Kubat, M.: Combining Sub-classifiers in Text Categorization: A DST-Based Solution and a Case Study. IEEE Transactions on Knowledge and Data Engineering 19(12), 1638–1651 (2007)
Article Google Scholar
Blitzer, J., Dredze, M., Pereira, F.: Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification. In: ACL 2007, pp. 440–447 (2007)
Google Scholar
Blitzer, J., Kakade, S., Foster, D.P.: Domain Adaptation with Coupled Subspaces. In: AISTATS 2011, pp. 173–181 (2011)
Google Scholar
Pan, W., Xiang, E.W., Liu, N.N., Yang, Q.: Transfer Learning in Collaborative Filtering for Sparsity Reduction. In: AAAI 2010, pp. 230–235 (2010)
Google Scholar
Ma, H., Zhou, D., Liu, C., Lyu, M.R., King, I.: Recommender Systems with Social Regularization. In: WSDM 2011, pp. 287–296 (2011)
Google Scholar
Gao, W., Cai, P., Wong, K.-F., Zhou, A.: Learning to Rank only using Training Data from related Domain. In: SIGIR 2010, pp. 162–169 (2010)
Google Scholar
Huang, J., Smola, A.J., Gretton, A., Borgwardt, K.M., Schölkopf, B.: Correcting Sample Selection Bias by Unlabeled Data. In: NIPS 2006, pp. 601–608 (2006)
Google Scholar
Joachims, T.: Transductive Inference for Text Classification using Support Vector Machines. In: IMCL 1999, pp. 200–209 (1999)
Google Scholar
Pan, S.J., Yang, Q.: A Survey on Transfer Learning. IEEE Transactions on Knowledge and Data Engineering 22(10), 1345–1359 (2010)
Article Google Scholar
Jiang, J., Zhai, C.: Instance Weighting for Domain Adaptation in NLP. In: ACL 2007, pp. 264–271 (2007)
Google Scholar
Dai, W., Yang, Q., Xue, G., Yu, Y.: Boosting for Transfer Learning. In: ICML 2007, pp. 193–200 (2007)
Google Scholar
Dai, W., Xue, G., Yang, Q., Yu, Y.: Co-clustering Based Classification for Out-of-domain Documents. In: KDD 2007, pp. 210–219 (2007)
Google Scholar
Dayanik, A.A., Lewis, D.D., Madigan, D., Menkov, V., Genkin, A.: Constructing Informative Prior Distributions from Domain Knowledge in Text Classification. In: SIGIR 2006, pp. 493–500 (2006)
Google Scholar
Kanamori, T., Suzuki, T., Sugiyama, M.: Statistical Analysis of Kernel-based Least-squares Density-ratio Estimation. Machine Learning (ML) 86(3), 335–367 (2012)
Article MathSciNet MATH Google Scholar
Sugiyama, M., Nakajima, S., Kashima, H., von Bnau, P., Kawanabe, M.: Direct Importance Estimation with Model Selection and Its Application to Covariate Shift Adaptation. In: NIPS 2007 (2007)
Google Scholar
Tsuboi, Y., Kashima, H., Hido, S., Bickel, S., Sugiyama, M.: Direct Density Ratio Estimation for Large-scale Covariate Shift Adaptation. In: SDM 2008, pp. 443–454 (2008)
Google Scholar
Kanamori, T., Hido, S., Sugiyama, M.: A Least-squares Approach to Direct Importance Estimation. Journal of Machine Learning Research (JMLR) 10, 1391–1445 (2009)
MathSciNet MATH Google Scholar
Bickel, S., Brckner, M., Scheffer, T.: Discriminative Learning for Differing Training and Test Distributions. In: ICML 2007, pp. 81–88 (2007)
Google Scholar
Sun, Q., Chattopadhyay, R., Panchanathan, S., Ye, J.: A Two-Stage Weighting Framework for Multi-Source Domain Adaptation. In: NIPS 2011, pp. 505–513 (2011)
Google Scholar
McCallum, A.K., Nigam, K., Rennie, J., Seymore, K.: Automating the Construction of Internet Portals with Machine Learning. Information Retrieval 3(2), 127–163 (2000)
Article Google Scholar
Lewis, D.D.: Reuters-21578 Test Collection, http://www.daviddlewis.com/
Salton, G., Buckley, C.: Term-weighting Approaches in Automatic Text Retrieval. Information Processing & Management 24(5), 513–523 (1988)
Article Google Scholar

Download references

Author information

Authors and Affiliations

South China University of Technology, Guangzhou, China
Qi Tan, Huifang Deng & Pei Yang
South China Normal University, Guangzhou, China
Qi Tan

Authors

Qi Tan
View author publications
You can also search for this author in PubMed Google Scholar
Huifang Deng
View author publications
You can also search for this author in PubMed Google Scholar
Pei Yang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science, Fudan University, Handan Road 220, 200433, Shanghai, China
Shuigeng Zhou
Chinese Academy of Sciences, Academy of Mathematics and Systems Science, Dongguancun East Road 55, 100190, Beijing, China
Songmao Zhang
Department of Computer Science and Engineering, University of Minnesota, Union Street SE 200, 55455, Minneapolis, MN, USA
George Karypis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tan, Q., Deng, H., Yang, P. (2012). Kernel Mean Matching with a Large Margin. In: Zhou, S., Zhang, S., Karypis, G. (eds) Advanced Data Mining and Applications. ADMA 2012. Lecture Notes in Computer Science(), vol 7713. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35527-1_19

Download citation

DOI: https://doi.org/10.1007/978-3-642-35527-1_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35526-4
Online ISBN: 978-3-642-35527-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics