Anomaly Detection Algorithm Based on Cluster of Entropy

Tan, Wenan; Fang, Xi; Zhao, Lu; Tang, Anqiong

doi:10.1007/978-981-13-3044-5_26

Wenan Tan^14,15,
Xi Fang¹⁴,
Lu Zhao¹⁴ &
…
Anqiong Tang¹⁵

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 917))

Included in the following conference series:

CCF Conference on Computer Supported Cooperative Work and Social Computing

822 Accesses

Abstract

To address the issue that the K-means algorithm chooses and determines the initial cluster center in a random way, which would fall into the local optimal clustering result, a way towards choosing the initial clustering center using information entropy is proposed. This proposed method divides the dataset evenly into data blocks with more than K, and then uses the entropy method to obtain the value of target function of each data block, as well as selects the centroid corresponding to the data block with the smallest value function of the first k target as the initial cluster center. By using entropy method to ensure the efficiency of the initial clustering center selection, an anomaly detection method is proposed. The result of the experiment show that this method performs better than the traditional K-means algorithm both in clustering effect and anomaly detection ability.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Hawkins, D.M.: Indentification of oufliers. Monogr. Appl. Probab. Stat. 80(2), 321–328 (1980)
Google Scholar
Agrawal, S., Agrawal, J.: Survey on anomaly detection using data mining techniques. Proc. Comput. Sci. 60(1), 708–713 (2015)
Article Google Scholar
Joseph, S.R., Hlomani, H., Letsholo, K.: Data mining algorithms: an overview. Neuroscience 12(3), 719–743 (2016)
Google Scholar
Lee, W.: Applying data mining to intrusion detection. ACM SIGKDD Explor. Newsl. 4(2), 35–42 (2002)
Article Google Scholar
Arora, P., Deepali, Varshney, S.: Analysis of k-means and k-medoids algorithm for big data. Proc. Comput. Sci. 78, 507–512 (2016)
Google Scholar
Celebi, M.E., Kingravi, H.A., Vela, P.A.: A Comparative Study of Efficient Initialization Methods for the K-Means Clustering Algorithm. Pergamon Press Inc., Oxford (2013)
Book Google Scholar
Han, Z.-J.: An adaptive k—means initialization method based on data density. Comput. Appl. Softw. 3t(2), 182–187 (2014). (in Chinese)
Google Scholar
Zuo, J., Chen, Z.: Anomaly detection algorithm based on improved k-means clustering. Comput. Sci. 43(8), 258–261 (2016). (in Chinese)
MathSciNet Google Scholar
Liang, J., Shi, Z., Li, D., et al.: Information entropy, rough entropy and knowledge granulation in incomplete information systems. Int. J. Gen Syst 35(6), 641–654 (2016)
Article MathSciNet Google Scholar
Qian, P., Jiang, Y., Deng, Z., et al.: Cluster prototypes and fuzzy memberships jointly leveraged cross-domain maximum entropy clustering. IEEE Trans. Cybern. 46(1), 181 (2016)
Article Google Scholar
Yang, Y.-M.: Improved k-means dynamic clustering algorithm based on information entropy. J. Chongqing Univ. Posts Telecommun. (Nat. Sci. Ed.) 28(2), 254–259 (2016). (in Chinese)
Google Scholar
Har-Peled, S., Mazumdar, S.: Coresets for k-means and k-median clustering and their applications. In: Annual ACM Symposium on Theory of Computing, pp. 291–300 (2004)
Google Scholar
Jia, G., Cheng, G., Gangahar, D.M., et al.: Traffic anomaly detection using k-means clustering 40(6), 403–410 (2012)
Google Scholar
Cohenaddad, V., Klein, P.N., Mathieu, C.: Local search yields approximation schemes for k-means and k-median in euclidean and minor-free metrics. In: Foundations of Computer Science, pp. 353–364. IEEE (2016)
Google Scholar
UCI Homepage. http://archive.ics.uci.edu/ml/datasets.html. Accessed 07 May 2018

Download references

Acknowledgements

This paper is supported in part by the National Natural Science Foundation of China under Grant No. 61672022, Key Disciplines of Computer Science and Technology of Shanghai Polytechnic University under Grant No. XXKZD1604, and the Graduate Innovation Program No. A01GY17F022.

Author information

Authors and Affiliations

School of Computer and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, 210016, China
Wenan Tan, Xi Fang & Lu Zhao
School of Computer and Information, Shanghai Polytechnic University, Shanghai, 2012209, China
Wenan Tan & Anqiong Tang

Authors

Wenan Tan
View author publications
You can also search for this author in PubMed Google Scholar
Xi Fang
View author publications
You can also search for this author in PubMed Google Scholar
Lu Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Anqiong Tang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wenan Tan .

Editor information

Editors and Affiliations

Shandong University, Jinan, China
Yuqing Sun
Fudan University, Shanghai, China
Tun Lu
Guilin University of Technology, Guilin, China
Xiaolan Xie
University of Shanghai for Science and Technology, Shanghai , China
Liping Gao
Tongji University, Shanghai, China
Hongfei Fan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tan, W., Fang, X., Zhao, L., Tang, A. (2019). Anomaly Detection Algorithm Based on Cluster of Entropy. In: Sun, Y., Lu, T., Xie, X., Gao, L., Fan, H. (eds) Computer Supported Cooperative Work and Social Computing. ChineseCSCW 2018. Communications in Computer and Information Science, vol 917. Springer, Singapore. https://doi.org/10.1007/978-981-13-3044-5_26

Download citation

DOI: https://doi.org/10.1007/978-981-13-3044-5_26
Published: 11 December 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-3043-8
Online ISBN: 978-981-13-3044-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics