Semi-supervised Affinity Propagation Clustering Based on Subtractive Clustering for Large-Scale Data Sets

Zhu, Qi; Zhang, Huifu; Yang, Quanqin

doi:10.1007/978-3-662-46248-5_32

Qi Zhu¹⁸,
Huifu Zhang¹⁸ &
Quanqin Yang¹⁸

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 503))

Included in the following conference series:

International Conference of Young Computer Scientists, Engineers and Educators

2003 Accesses

Abstract

In the face of a growing number of large-scale data sets, affinity propagation clustering algorithm to calculate the process required to build the similarity matrix, will bring huge storage and computation. Therefore, this paper proposes an improved affinity propagation clustering algorithm. First, add the subtraction clustering, using the density value of the data points to obtain the point of initial clusters. Then, calculate the similarity distance between the initial cluster points, and reference the idea of semi-supervised clustering, adding pairs restriction information, structure sparse similarity matrix. Finally, the cluster representative points conduct AP clustering until a suitable cluster division. Experimental results show that the algorithm allows the calculation is greatly reduced, the similarity matrix storage capacity is also reduced, and better than the original algorithm on the clustering effect and processing speed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Demiriz, A., Benneit, K.P., Embrechts, M.J.: Semi-supervised clustering using genetic algorithm. In: Proc of Intelligent Engineering systems through Artificial Neural Networks, pp. 809–814 (1999)
Google Scholar
Liu, X., Yin, M., Luo, J.: An Improved Affinity Propagation Clustering Algorithm for Large-scale Data Sets. In: 2013 Ninth International Conference on Natural Computation, pp. 894–899 (2013)
Google Scholar
Zhang, X., Furtlehner, C., Germain-Renaud, C., Sebag, M.: Data Stream Clustering with Affinity Propagation. IEEE Transactions on Knowledge and Data Engineering 26(7), 1644–1656 (2014)
Article Google Scholar
Frey, B.J., Dueck, D.: clustering by passing messages between data points. Science 315(5814), 972–976 (2007)
Article MathSciNet MATH Google Scholar
Fu, Y.-D., Lan, J.-L.: Kernel-based adaptation for affinity propagation clustering algorithm. Application Research of Computers 29(5), 1644–1647 (2012)
Google Scholar
Li, X., Wang, L., Song, Y.: Parallel computation of semi-supervised clustering algorithm based on affinity propagation. Computer Engineering and Applications 47(7), 149–152 (2011)
Google Scholar
Feng, X.-L., Yu, H.-T.: Semi-supervised affinity propagation clustering based on manifold distance. Computer Engineering and Applications 28(10), 3656–3658 (2011)
Google Scholar
Jun, D., Suo-Ping, W., Fan-Lun, X.: Affinity Propagation Clustering Based on Variable-Similarity Measure. Journal of Electronics & Information Technology 32(3), 509–514 (2010)
Article Google Scholar
Chiu, S.L.: Fuzzy model identification based on cluster estimation. Journal of Intelligent and Fuzzy Systems 2(3), 267–278 (1994)
Google Scholar
Cai, W., Cheng, J.: Fuzzy Clustering Based on Subtractive Clustering. Lanzhou Jiao Tong University Learned Journal 30(6), 50–54 (2011)
Google Scholar
Nikhil, R.P., Chakraborty, D.: Mountain and subtractive clustering method: Improvements and generalizations [J]. International Journal of Intelligent Systems 15(4), 329–341 (2000)
Article MATH Google Scholar
Dash, M., Huan, L., Scheuermann, P., TanK, L.: Fast hierarchical clustering and its validation. Data & Knowledge Engineering 44, 109–138 (2003)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Engineering, Hunan University of Science and Technology, Xiangtan, 411201, China
Qi Zhu, Huifu Zhang & Quanqin Yang

Authors

Qi Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Huifu Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Quanqin Yang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Harbin Institute of Technology, Harbin, China
Hongzhi Wang & Wanxiang Che &
School of Computer Science and Technology, Heilongjiang Institute of Technology, Harbin, China
Haoliang Qi & Zhongyuan Han &
Northeast Forestry University, Harbin, China
Zhaowen Qiu
Heilongjiang Institute of Technology, Harbin, China
Leilei Kong
Harbin Engineering University, China
Junyu Lin
Zhongkeyunhai Company, Harbin, China
Zeguang Lu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhu, Q., Zhang, H., Yang, Q. (2015). Semi-supervised Affinity Propagation Clustering Based on Subtractive Clustering for Large-Scale Data Sets. In: Wang, H., et al. Intelligent Computation in Big Data Era. ICYCSEE 2015. Communications in Computer and Information Science, vol 503. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-46248-5_32

Download citation

DOI: https://doi.org/10.1007/978-3-662-46248-5_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-46247-8
Online ISBN: 978-3-662-46248-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics