A Coupled Similarity Kernel for Pairwise Support Vector Machine

Li, Mu; Li, Jinjiu; Ou, Yuming; Cao, Longbing

doi:10.1007/978-3-319-20230-3_10

Mu Li¹¹,
Jinjiu Li¹¹,
Yuming Ou¹¹ &
…
Longbing Cao¹¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9145))

Included in the following conference series:

International Workshop on Agents and Data Mining Interaction

466 Accesses

Abstract

Support vector machine is a supervised learning model with associated learning algorithms that analyzes data and recognizes patterns. In various applications, the SVM shows its advantage of the classification performance, however, the original SVM was designed for the numerical data. For using the SVM on the nominal data, most previous research used a certain number to replace each nominal value or transformed the nominal value into the one hot vector. Both methods could not present the original nominal data’s structure and the similarity between them, which leads to information loss from the data and reduce the classification performance. In this work, we design a novel coupled similarity metric between nominally attributed data. This metric is pairwise, we also propose an adapted SVM which can handle this. The experiment result shows the proposed method outperforms the traditional SVM and other popular classification methods on various public data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 34.99; Price excludes VAT (USA)

Softcover Book: USD 44.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Abernethy, J., Bach, F., Evgeniou, T., Vert, J.-P.: A new approach to collaborative filtering: operator estimation with spectral regularization. J. Mach. Learn. Res. 10, 803–826 (2009)
MATH Google Scholar
Ahmad, A., Dey, L.: A k-mean clustering algorithm for mixed numeric and categorical data. Data Knowl. Eng. 63, 503–527 (2007)
Article Google Scholar
Bar-Hillel, A., Hertz, T., Shental, N., Weinshall, D.: Learning a mahalanobis metric from equivalence constraints. J. Mach. Learn. Res. 6(6), 937–965 (2005)
MATH MathSciNet Google Scholar
Ben-Hur, A., Noble, W.S.: Kernel methods for predicting protein-protein interactions. Bioinformatics 21(suppl 1), i38–i46 (2005)
Article Google Scholar
Boriah, S., Chandola, V., Kumar, V.: Similarity measures for categorical data: a comparative evaluation. In: Proceedings of the 8th SIAM International Conference on Data Mining, pp. 243–254 (2008)
Google Scholar
Cao, L.: In-depth behavior understanding and use: the behavior informatics approach. Inf. Sci. 180(17), 3067–3085 (2010)
Article Google Scholar
Brunner, C., Fischer, A., Luig, K., Thies, T.: Pairwise support vector machines and their application to large scale problems. J. Mach. Learn. Res. 13, 2279–2292 (2012)
MATH MathSciNet Google Scholar
Cao, L., Philip, S.Y.: Behavior Computing. Modeling, Analysis, Mining and Decision. Springer, London (2012)
Book Google Scholar
Cost, S., Salzberg, S.: A weighted nearest neighbor algorithm for learning with symbolic features. Mach. Learn. 10(1), 57–78 (1993)
Google Scholar
Cao, L., Gorodetsky, V., Mitkas, P.: Agent mining: the synergy of agents and data mining. IEEE Intell. Syst. 24(3), 64–72 (2009)
Article Google Scholar
Cao, L.: Non-IIDness learning in behavioral and social data. Comput. J. 57(9), 1358–1370 (2014)
Article Google Scholar
Cao, L., Ou, Y., Yu, P.S.: Coupled behavior analysis with applications. IEEE Trans. Knowl. Data Eng. 24(8), 1378–1392 (2012)
Article Google Scholar
Cao, L., Ou, Y., Yu, P.S., Wei, G.: Detecting abnormal coupled sequences and sequence changes in group-based manipulative trading behaviors. In: KDD2010, pp. 85–94 (2010)
Google Scholar
Cao, L., Zhang, H., Zhao, Y., Luo, D., Zhang, C.: Combined mining: discovering informative knowledge in complex data. IEEE Trans. SMC Part B 41(3), 699–712 (2011)
Google Scholar
Cao, L., Zhao, Y., Zhang, C.: Mining impact-targeted activity patterns in imbalanced data. IEEE Trans. Knowl. Data Eng. 20(8), 1053–1066 (2008)
Article Google Scholar
Das, G., Mannila, H.: Context-based similarity measures for categorical databases. In: Zighed, D.A., Komorowski, J., Żytkow, J.M. (eds.) PKDD 2000. LNCS (LNAI), vol. 1910, pp. 201–210. Springer, Heidelberg (2000)
Chapter Google Scholar
Duan, K.-B., Keerthi, S.S.: Which is the best multiclass SVM method? an empirical study. In: Oza, N.C., Polikar, R., Kittler, J., Roli, F. (eds.) MCS 2005. LNCS, vol. 3541, pp. 278–285. Springer, Heidelberg (2005)
Chapter Google Scholar
Gan, G., Ma, C., Wu, J.: Data Clustering: Theory, Algorithms, and Applications. ASA-SIAM Series on Statistics and Applied Probability, Philadelphia (2007)
Book Google Scholar
Cao, L.: Coupling learning of complex interactions. Inf. Process. Manage. 51(2), 167–186 (2015)
Article Google Scholar
Hill, S.I., Doucet, A.: A framework for kernel-based multi-category classification. J. Artif. Intell. Res. (JAIR) 30, 525–564 (2007)
MATH MathSciNet Google Scholar
Hsu, C.-W., Lin, C.-J.: A comparison of methods for multiclass support vector machines. IEEE Trans. Neural Netw. 13(2), 415–425 (2002)
Article Google Scholar
Kulis, B., Basu, S., Dhillon, I., Mooney, R.: Semi-supervised graph clustering: a kernel approach. Mach. Learn. 74(1), 1–22 (2009)
Article Google Scholar
Phillips, P.J. et al.: Support vector machines applied to face recognition, vol. 285. Citeseer (1998)
Google Scholar
Cao, L.: Combined mining: analyzing object and pattern relations for discovering and constructing complex but actionable patterns. WIREs Data Min. Knowl. Discovery 3(2), 140–155 (2013)
Article Google Scholar
Rapaport, F., Barillot, E., Vert, J.-P.: Classification of arrayCGH data using fused SVM. Bioinformatics 24(13), i375–i382 (2008)
Article Google Scholar
Rifkin, R., Klautau, A.: In defense of one-vs-all classification. J. Mach. Learn. Res. 5, 101–141 (2004)
MATH MathSciNet Google Scholar
Wang, C., Cao, L. et al.: Coupled nominal similarity in unsupervised learning. In: Proceedings of CIKM2011, pp. 973–978 (2011)
Google Scholar
Wang, C., She, Z., Cao, L.: Coupled clustering ensemble: incorporating coupling relationships both between base clusterings and objects. In: Proceedings of ICDE2013 (2013)
Google Scholar
Wang, C., She, Z., Cao, L.: Coupled attribute analysis on numerical data. In: Proceedings of IJCAI2013 (2013)
Google Scholar
Wilson, D.R., Martinez, T.R.: Improved heterogeneous distance functions. J. Artif. Intell. Res. 6, 1–34 (1997)
MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

University of Technology, Sydney, NSW, 2018, Australia
Mu Li, Jinjiu Li, Yuming Ou & Longbing Cao

Authors

Mu Li
View author publications
You can also search for this author in PubMed Google Scholar
Jinjiu Li
View author publications
You can also search for this author in PubMed Google Scholar
Yuming Ou
View author publications
You can also search for this author in PubMed Google Scholar
Longbing Cao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mu Li .

Editor information

Editors and Affiliations

University of Technology Sydney, Sydney, New South Wales, Australia
Longbing Cao
Teesside University, Middlesborough, United Kingdom
Yifeng Zeng
Nanyang Technological University, Singapore, Singapore
Bo An
Aristotle University of Thessaloniki, Thessaloniki, Greece
Andreas L. Symeonidis
Russian Academy of Sciences, St. Petersburg, Russia
Vladimir Gorodetsky
University of Liverpool, Liverpool, United Kingdom
Frans Coenen
University of Illinois at Chicago, Chicago, Illinois, USA
Philip S. Yu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, M., Li, J., Ou, Y., Cao, L. (2015). A Coupled Similarity Kernel for Pairwise Support Vector Machine. In: Cao, L., et al. Agents and Data Mining Interaction. ADMI 2014. Lecture Notes in Computer Science(), vol 9145. Springer, Cham. https://doi.org/10.1007/978-3-319-20230-3_10

Download citation

DOI: https://doi.org/10.1007/978-3-319-20230-3_10
Published: 05 June 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-20229-7
Online ISBN: 978-3-319-20230-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics