Clustering Classifiers Learnt from Local Datasets Based on Cosine Similarity

Zhao, Kaikai; Suzuki, Einoshin

doi:10.1007/978-3-319-25252-0_16

Kaikai Zhao¹⁸ &
Einoshin Suzuki¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9384))

Included in the following conference series:

International Symposium on Methodologies for Intelligent Systems

738 Accesses
1 Citations

Abstract

In this paper we present a new method to measure the degree of dissimilarity of a pair of linear classifiers. This method is based on the cosine similarity between the normal vectors of the hyperplanes of the linear classifiers. A significant advantage of this method is that it has a good interpretation and requires very little information to exchange among datasets. Evaluations on a synthetic dataset, a dataset from the UCI Machine Learning Repository, and facial expression datasets show that our method outperforms previous methods in terms of the normalized mutual information.

E. Suzuki—A part of this research was supported by Grant-in-Aid for Scientific Research 25280085 and 15K12100 from the Japanese Ministry of Education, Culture, Sports, Science and Technology.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
The same risk exists for relying on the density in the example space.
2.
http://www.microsoft.com/en-us/kinectforwindows/.
3.
http://msdn.microsoft.com/en-us/library/jj130970.aspx.

References

Ben-David, S., Schuller, R.: Exploiting task relatedness for multiple task learning. In: Schölkopf, B., Warmuth, M.K. (eds.) COLT/Kernel 2003. LNCS (LNAI), vol. 2777, pp. 567–580. Springer, Heidelberg (2003)
Chapter Google Scholar
Pedersen, T., Pakhomov, S.V.S., Patwardhan, S., Chute, C.G.: Measures of Semantic Similarity and Relatedness in the Biomedical Domain. J. Biomed. Inf. 40(3), 288–299 (2007)
Article Google Scholar
Li, Y., Tian, X., Song, M., Tao, D.: Multi-task proximal support vector machine. Pattern Recogn. 48(10), 3249–3257 (2015)
Article Google Scholar
Tsoumakas, G., Angelis, L., Vlahavas, I.P.: Clustering classifiers for knowledge discovery from physically distributed databases. Data Knowl. Eng. 49(3), 223–242 (2004)
Article Google Scholar
Jacob, L., Bach, F.R., Vert, J.P.: Clustered multi-task learning: a convex formulation. In: NIPS 2008, pp. 745–752 (2009)
Google Scholar
Parthasarathy, S., Ogihara, M.: Clustering distributed homogeneous datasets. In: Zighed, D.A., Komorowski, J., Żytkow, J.M. (eds.) PKDD 2000. LNCS (LNAI), vol. 1910, pp. 566–574. Springer, Heidelberg (2000)
Chapter Google Scholar
McClean, S.I., Scotney, B.W., Morrow, P.J., Greer, K.: Knowledge discovery by probabilistic clustering of distributed databases. Data Knowl. Eng. 54(2), 189–210 (2005)
Article Google Scholar
Chen, R., Sivakumar, K., Kargupta, H.: Collective mining of Bayesian networks from distributed heterogeneous data. Knowl. Inf. Syst. 6(2), 164–187 (2004)
Article Google Scholar
Flores, M.J., Gmez, J.A., Martnez, A.M.: Meta-prediction of semi-naive Bayesian network classifiers based on dataset complexity characterization. In: Proceedings of Sixth European Workshop on Probabilistic Graphical Models, pp. 107–114 (2012)
Google Scholar
Ho, T.K., Basu, M.: Complexity measures of supervised classification problems. IEEE Trans. Pattern Anal. Mach. Intell. 24(3), 289–300 (2002)
Article Google Scholar
Thrun, S., O’Sullivan, J.: Clustering learning tasks and the selective cross-task transfer of knowledge. In: Thrun, S., Pratt, L. (eds.) Learning To Learn, pp. 235–257. Kluwer, New York (1998)
Chapter Google Scholar
Evgeniou, T., Micchelli, C.A., Pontil, M.: Learning multiple tasks with kernel methods. J. Mach. Learn. Res. 6, 615–637 (2005)
MathSciNet MATH Google Scholar
Xue, Y., Liao, X., Carin, L., Krishnapuram, B.: Multi-task learning for classification with Dirichlet process priors. J. Mach. Learn. Res. 8, 35–63 (2007)
MathSciNet MATH Google Scholar
Singhal, A.: Modern information retrieval: a brief overview. IEEE Data Eng. Bull. 24(4), 35–43 (2001)
Google Scholar
Hosmer, D.W., Lemeshow, S.: Introduction to the logistic regression model. In: Applied Logistic Regression, 2 edn., pp. 1–30. Wiley (2005)
Google Scholar
Lichman, M.: UCI Machine Learning Repository. University of California, Irvine, School of Information and Computer Sciences (2013). http://archive.ics.uci.edu/ml
Erna, A., Yu, L., Zhao, K., Chen, W., Suzuki, E.: Facial expression data constructed with Kinect and their clustering stability. In: Ślȩzak, D., Schaefer, G., Vuong, S.T., Kim, Y.-S. (eds.) AMT 2014. LNCS, vol. 8610, pp. 421–431. Springer, Heidelberg (2014)
Google Scholar
Scott, P.D., Wilkins, E.: Evaluating data mining procedures: techniques for generating artificial data sets. Inf. Softw. Technol. 41(9), 579–587 (1999)
Article Google Scholar
Widmer, G., Kubat, M.: Learning in the presence of concept drift and hidden contexts. Mach. Learn. 23(1), 69–101 (1996)
Google Scholar
Sebe, N., Lew, M., Sun, Y., Cohen, I., Geners, T., Huang, T.: Authentic facial expression analysis. Image Vis. Comput. 25(12), 1856–1863 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Systems Life Sciences, Kyushu University, Fukuoka, Japan
Kaikai Zhao
Department of Informatics, ISEE, Kyushu University, Fukuoka, Japan
Einoshin Suzuki

Authors

Kaikai Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Einoshin Suzuki
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kaikai Zhao .

Editor information

Editors and Affiliations

Computer Science, University of Bari, Bari, Italy
Floriana Esposito
Enssat, Lannion, France
Olivier Pivert
LISI-UFR d'Informatique, Université Claude Bernard Lyon 1, Villeurbanne Cedex, France
Mohand-Said Hacid
University of North Carolina, CHARLOTTE, North Carolina, USA
Zbigniew W. Rás
Dipartimento di Informatica, Università degli Studi di Bari, Bari, Italy
Stefano Ferilli

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhao, K., Suzuki, E. (2015). Clustering Classifiers Learnt from Local Datasets Based on Cosine Similarity. In: Esposito, F., Pivert, O., Hacid, MS., Rás, Z., Ferilli, S. (eds) Foundations of Intelligent Systems. ISMIS 2015. Lecture Notes in Computer Science(), vol 9384. Springer, Cham. https://doi.org/10.1007/978-3-319-25252-0_16

Download citation

DOI: https://doi.org/10.1007/978-3-319-25252-0_16
Published: 30 December 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25251-3
Online ISBN: 978-3-319-25252-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics