Clustering data stream with uncertainty using belief function theory and fading function
- 14 Downloads
Data stream clustering faces major challenges such as lack of memory and time. Therefore, traditional clustering methods are not suitable for this kind of data. On the other hand, most data stream clustering methods do not consider the problems of uncertainty and ambiguity in the data. So, in this case, where an object is close to a set of clusters, this object cannot be correctly and simply categorized. The aim of this study is to provide a new method for clustering data stream, called clustering data stream using belief function, with regard to the problem of uncertain and ambiguous data. In the proposed method, the belief function theory is used to cluster objects into single clusters or a set of clusters and determines the structure of data. In addition, using window, weighted centers, and the fading function overcomes the restrictions of data stream. The results of the experiments have been compared with state-of-the-art methods, which show the superiority of the proposed method in terms of purity, error rate, and ambiguity rate measures.
KeywordsClustering Data stream Uncertainty Belief function theory Dempster–Shafer theory Fading function
Compliance with ethical standards
Conflict of interest
The authors declare that they have no conflict of interest.
This article does not contain any studies with animals performed by any of the authors.
- Aggarwal C, Yu P (2008) A framework for clustering uncertain data streams. In: IEEE international conference on data engineering, pp 150–159Google Scholar
- Aggarwal C, Han J, Wang J, Yu P, Watson T (2003) A framework for clustering evolving data streams. In: Proceedings of VLDB 2003, pp 81–92Google Scholar
- Aggarwal C, Han J, Wang J, Yu P (2004) A framework for projected clustering of high dimensional data streams. In: Proceedings of VLDB, pp 852–863Google Scholar
- Bahri M, Elouedi Z (2017) Clustering data stream under a belief function framework. In: IEEE/ACS 13th international conference of computer systems and applications (AICCSA), pp 1–8Google Scholar
- Cao F, Ester M, Qian W, Zhou A (2006) Density-based clustering over an evolving data stream with noise. In: Proceedings of the sixth SIAM international conference on data mining. https://doi.org/10.1137/1.9781611972764.29
- Chakeri A, Nekooimehr I, Hall LO (2013) Dempster–Shafer theory of evidence in Single Pass Fuzzy C Means. In: 2013 IEEE international conference on fuzzy systems, Hyderabad, pp 1–5Google Scholar
- Chen Y, Tu L (2007) Density-based clustering for real-time stream data. In: Proceedings KDD’07 proceedings of the 13th ACM SIGKDD international conference on knowledge discovery and data mining, pp 133–142Google Scholar
- Dua D, Taniskidou E (2017) UCI machine learning repository. University of California, Irvine, School of Information and Computer Sciences. http://archive.ics.uci.edu/ml. Accessed 5 Feb 2018
- Meesuksabai W, Kangkachit T, Waiyamai K (2011) HUE-stream: evolution-based clustering technique for heterogeneous data streams with uncertainty. In: Tang J, King I, Chen L, Wang J (eds) Advanced data mining and applications. ADMA 2011. Lecture notes in computer science. Springer, Berlin, pp 27–40Google Scholar
- Mousavi M, Abu Bakar A, Vakilian M (2015) Data stream clustering algorithms: a review. Int J Adv Soft Comput Appl 7:1–15Google Scholar
- Smets P (2000) Data fusion in the transferable belief model. In: Proceedings of the third international conference on information fusion, pp 21–33Google Scholar
- Yang Y, Liu Z, Xing Z (2015) A review of uncertain data stream clustering algorithms. In: Eighth international conference on internet computing for science and engineering (ICICSE), Harbin, pp 111–116Google Scholar
- Yu X, Xu X, Lin L (2015) A data stream subspace clustering algorithm. In: Wang H et al (eds) Intelligent computation in big data era. ICYCSEE 2015. Communications in computer and information science. Springer, Berlin, pp 334–343Google Scholar
- Zabihi M, Vafaei Jahan M, Hamidzadeh J (2014) A density based clustering approach for web robot detection. In: Proceedings of the 4th international conference on computer and knowledge engineering. https://doi.org/10.1109/ICCKE.2014.6993362