Cascade RSVM in Peer-to-Peer Networks

Ang, Hock Hee; Gopalkrishnan, Vivekanand; Hoi, Steven C. H.; Ng, Wee Keong

doi:10.1007/978-3-540-87479-9_22

Cascade RSVM in Peer-to-Peer Networks

Hock Hee Ang¹,
Vivekanand Gopalkrishnan¹,
Steven C. H. Hoi¹ &
…
Wee Keong Ng¹

Conference paper

5696 Accesses
14 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5211))

Abstract

The goal of distributed learning in P2P networks is to achieve results as close as possible to those from centralized approaches. Learning models of classification in a P2P network faces several challenges like scalability, peer dynamism, asynchronism and data privacy preservation. In this paper, we study the feasibility of building SVM classifiers in a P2P network. We show how cascading SVM can be mapped to a P2P network of data propagation. Our proposed P2P SVM provides a method for constructing classifiers in P2P networks with classification accuracy comparable to centralized classifiers and better than other distributed classifiers. The proposed algorithm also satisfies the characteristics of P2P computing and has an upper bound on the communication overhead. Extensive experimental results confirm the feasibility and attractiveness of this approach.

Download to read the full chapter text

Chapter PDF

References

Asuncion, A., Newman, D.: UCI machine learning repository (2007)
Google Scholar
Breiman, L.: Pasting small votes for classification in large databases and on-line. Machine Learning 36(1-2), 85–103 (1999)
Article Google Scholar
Chan, P., Stolfo, S.: Toward parallel and distributed learning by meta-learning. In: AAAI Workshop in Knowledge Discovery in Databases, pp. 227–240 (1993)
Google Scholar
Chang, C.-C., Lin, C.-J.: LIBSVM: a library for support vector machines (2001), http://www.csie.ntu.edu.tw/~cjlin/libsvm
Chawla, N.V., Hall, L.O., Bowyer, K.W., Moore, T.E., Kegelmeyer, W.P.: Distributed pasting of small votes. In: Multiple Classifier Systems, pp. 52–61 (2002)
Google Scholar
Datta, S., Bhaduri, K., Giannella, C., Wolff, R., Kargupta, H.: Distributed data mining in peer-to-peer networks. IEEE Internet Computing, Special issue on Distributed Data Mining 10(4), 18–26 (2006)
Google Scholar
Džeroski, S., Ženko, B.: Is combining classifiers with stacking better than selecting the best one? Machine Learning 54(3), 255–273 (2004)
Article MATH Google Scholar
Gorodetskiy, V., Karsaev, O., Samoilov, V., Serebryakov, S.: Agent-based service-oriented intelligent P2P networks for distributed classification. In: International Conference on Hybrid Information Technology, pp. 224–233 (2006)
Google Scholar
Graf, H.P., Cosatto, E., Bottou, L., Dourdanovic, I., Vapnik, V.: Parallel support vector machines: The cascade SVM. In: NIPS (2004)
Google Scholar
Hoi, S.C.H., Jin, R., Zhu, J., Lyu, M.R.: Batch mode active learning and its application to medical image classification. In: ICML, pp. 417–424 (2006)
Google Scholar
Hoi, S.C.H., Lyu, M.R., Chang, E.Y.: Learning the unified kernel machines for classification. In: SIGKDD, pp. 187–196 (2006)
Google Scholar
Lazarevic, A., Obradovic, Z.: Boosting algorithms for parallel and distributed learning. Distributed and Parallel Databases 11(2), 203–229 (2002)
Article MATH Google Scholar
Lee, Y., Mangasarian, O.: RSVM: Reduced support vector machines. In: SIAM International Conference on Data Mining, pp. 00–07 (2001)
Google Scholar
Lin, K., Lin, C.: A study on reduced support vector machines. IEEE Transactions on Neural Networks 14(6), 1449–1459 (2003)
Article Google Scholar
Lu, B., Wang, K., Wen, Y.: Comparison of parallel and cascade methods for training support vector machines on large-scale problems. In: International Conference on Machine Learning and Cybernetics, pp. 3056–3061 (2004)
Google Scholar
Luo, P., Xiong, H., Lü, K., Shi, Z.: Distributed classification in peer-to-peer networks. In: SIGKDD, pp. 968–976 (2007)
Google Scholar
Pfahringer, B., Bensusan, H., Giraud-Carrier, C.G.: Meta-learning by landmarking various learning algorithms. In: ICML, pp. 743–750 (2000)
Google Scholar
Siersdorfer, S., Sizov, S.: Automatic document organization in a P2P environment. In: European Conference on IR Research, pp. 265–276 (2006)
Google Scholar
Tveit, A., Engum, H.: Parallelization of the incremental proximal support vector machine classifier using a heap-based tree topology. Technical report, IDI, NTNU, Trondheim, Norway (2003)
Google Scholar
Vapnik, V.N.: The nature of statistical learning theory. Springer, New York (1995)
MATH Google Scholar
Wang, Z., Das, S.K., Kumar, M., Shen, H.: An efficient update propagation algorithm for P2P systems. Computer Communications 30(5), 1106–1115 (2007)
Article Google Scholar
Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)
MATH Google Scholar
Zhang, J., Li, Z., Yang, J.: A parallel SVM training algorithm on large-scale classification problems. In: International Conference on Machine Learning and Cybernetics, pp. 1637–1641 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Nanyang Technological University, Singapore
Hock Hee Ang, Vivekanand Gopalkrishnan, Steven C. H. Hoi & Wee Keong Ng

Authors

Hock Hee Ang
View author publications
You can also search for this author in PubMed Google Scholar
Vivekanand Gopalkrishnan
View author publications
You can also search for this author in PubMed Google Scholar
Steven C. H. Hoi
View author publications
You can also search for this author in PubMed Google Scholar
Wee Keong Ng
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Walter Daelemans Bart Goethals Katharina Morik

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ang, H.H., Gopalkrishnan, V., Hoi, S.C.H., Ng, W.K. (2008). Cascade RSVM in Peer-to-Peer Networks. In: Daelemans, W., Goethals, B., Morik, K. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2008. Lecture Notes in Computer Science(), vol 5211. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87479-9_22

Download citation

DOI: https://doi.org/10.1007/978-3-540-87479-9_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87478-2
Online ISBN: 978-3-540-87479-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics