Skip to main content

Clustering Classifiers Learnt from Local Datasets Based on Cosine Similarity

  • Conference paper
  • First Online:
Foundations of Intelligent Systems (ISMIS 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9384))

Included in the following conference series:

Abstract

In this paper we present a new method to measure the degree of dissimilarity of a pair of linear classifiers. This method is based on the cosine similarity between the normal vectors of the hyperplanes of the linear classifiers. A significant advantage of this method is that it has a good interpretation and requires very little information to exchange among datasets. Evaluations on a synthetic dataset, a dataset from the UCI Machine Learning Repository, and facial expression datasets show that our method outperforms previous methods in terms of the normalized mutual information.

E. Suzuki—A part of this research was supported by Grant-in-Aid for Scientific Research 25280085 and 15K12100 from the Japanese Ministry of Education, Culture, Sports, Science and Technology.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The same risk exists for relying on the density in the example space.

  2. 2.

    http://www.microsoft.com/en-us/kinectforwindows/.

  3. 3.

    http://msdn.microsoft.com/en-us/library/jj130970.aspx.

References

  1. Ben-David, S., Schuller, R.: Exploiting task relatedness for multiple task learning. In: Schölkopf, B., Warmuth, M.K. (eds.) COLT/Kernel 2003. LNCS (LNAI), vol. 2777, pp. 567–580. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  2. Pedersen, T., Pakhomov, S.V.S., Patwardhan, S., Chute, C.G.: Measures of Semantic Similarity and Relatedness in the Biomedical Domain. J. Biomed. Inf. 40(3), 288–299 (2007)

    Article  Google Scholar 

  3. Li, Y., Tian, X., Song, M., Tao, D.: Multi-task proximal support vector machine. Pattern Recogn. 48(10), 3249–3257 (2015)

    Article  Google Scholar 

  4. Tsoumakas, G., Angelis, L., Vlahavas, I.P.: Clustering classifiers for knowledge discovery from physically distributed databases. Data Knowl. Eng. 49(3), 223–242 (2004)

    Article  Google Scholar 

  5. Jacob, L., Bach, F.R., Vert, J.P.: Clustered multi-task learning: a convex formulation. In: NIPS 2008, pp. 745–752 (2009)

    Google Scholar 

  6. Parthasarathy, S., Ogihara, M.: Clustering distributed homogeneous datasets. In: Zighed, D.A., Komorowski, J., Żytkow, J.M. (eds.) PKDD 2000. LNCS (LNAI), vol. 1910, pp. 566–574. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  7. McClean, S.I., Scotney, B.W., Morrow, P.J., Greer, K.: Knowledge discovery by probabilistic clustering of distributed databases. Data Knowl. Eng. 54(2), 189–210 (2005)

    Article  Google Scholar 

  8. Chen, R., Sivakumar, K., Kargupta, H.: Collective mining of Bayesian networks from distributed heterogeneous data. Knowl. Inf. Syst. 6(2), 164–187 (2004)

    Article  Google Scholar 

  9. Flores, M.J., Gmez, J.A., Martnez, A.M.: Meta-prediction of semi-naive Bayesian network classifiers based on dataset complexity characterization. In: Proceedings of Sixth European Workshop on Probabilistic Graphical Models, pp. 107–114 (2012)

    Google Scholar 

  10. Ho, T.K., Basu, M.: Complexity measures of supervised classification problems. IEEE Trans. Pattern Anal. Mach. Intell. 24(3), 289–300 (2002)

    Article  Google Scholar 

  11. Thrun, S., O’Sullivan, J.: Clustering learning tasks and the selective cross-task transfer of knowledge. In: Thrun, S., Pratt, L. (eds.) Learning To Learn, pp. 235–257. Kluwer, New York (1998)

    Chapter  Google Scholar 

  12. Evgeniou, T., Micchelli, C.A., Pontil, M.: Learning multiple tasks with kernel methods. J. Mach. Learn. Res. 6, 615–637 (2005)

    MathSciNet  MATH  Google Scholar 

  13. Xue, Y., Liao, X., Carin, L., Krishnapuram, B.: Multi-task learning for classification with Dirichlet process priors. J. Mach. Learn. Res. 8, 35–63 (2007)

    MathSciNet  MATH  Google Scholar 

  14. Singhal, A.: Modern information retrieval: a brief overview. IEEE Data Eng. Bull. 24(4), 35–43 (2001)

    Google Scholar 

  15. Hosmer, D.W., Lemeshow, S.: Introduction to the logistic regression model. In: Applied Logistic Regression, 2 edn., pp. 1–30. Wiley (2005)

    Google Scholar 

  16. Lichman, M.: UCI Machine Learning Repository. University of California, Irvine, School of Information and Computer Sciences (2013). http://archive.ics.uci.edu/ml

  17. Erna, A., Yu, L., Zhao, K., Chen, W., Suzuki, E.: Facial expression data constructed with Kinect and their clustering stability. In: Ślȩzak, D., Schaefer, G., Vuong, S.T., Kim, Y.-S. (eds.) AMT 2014. LNCS, vol. 8610, pp. 421–431. Springer, Heidelberg (2014)

    Google Scholar 

  18. Scott, P.D., Wilkins, E.: Evaluating data mining procedures: techniques for generating artificial data sets. Inf. Softw. Technol. 41(9), 579–587 (1999)

    Article  Google Scholar 

  19. Widmer, G., Kubat, M.: Learning in the presence of concept drift and hidden contexts. Mach. Learn. 23(1), 69–101 (1996)

    Google Scholar 

  20. Sebe, N., Lew, M., Sun, Y., Cohen, I., Geners, T., Huang, T.: Authentic facial expression analysis. Image Vis. Comput. 25(12), 1856–1863 (2007)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kaikai Zhao .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Zhao, K., Suzuki, E. (2015). Clustering Classifiers Learnt from Local Datasets Based on Cosine Similarity. In: Esposito, F., Pivert, O., Hacid, MS., Rás, Z., Ferilli, S. (eds) Foundations of Intelligent Systems. ISMIS 2015. Lecture Notes in Computer Science(), vol 9384. Springer, Cham. https://doi.org/10.1007/978-3-319-25252-0_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-25252-0_16

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-25251-3

  • Online ISBN: 978-3-319-25252-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics