Abstract
In the data mining context, semi-supervised learning is applicable in circumstances where only a scarce amount of information on the intrinsic structure of a dataset is available. This information may be in the form a few labelled instances or a relatively small set of constraints on the pairwise memberships of particular instances. In this study we are providing a semi-supervised fuzzy clustering model which modifies versions of conventional DBSCAN algorithm in order to generate soft clusters which foreclose the noise points. The employed modifications are mostly related to the control parameters of the algorithm intending to utilize the additional information (which in our case is in the form of a few labelled instances) and adaptations towards the fuzzy clustering approach. Finally, several experimental procedures have been conducted on synthetic and real-world benchmark datasets in order to assess the accuracy of our employed model and to compare it to the conventional algorithms of the respective domain.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Zhu, X.: Semi-supervised learning literature survey. University of Wisconsin-Madison, Department of Computer Sciences (2005)
Bedalli, E., Mançellari, E., Asilkan, O.: A heterogeneous cluster ensemble model for improving the stability of fuzzy cluster analysis. Proc. Comput. Sci. 102, 129–136.4 (2016)
Berkhin, P.: A survey of clustering data mining techniques. In: Grouping Multidimensional Data, pp. 25–71. Springer, Heidelberg (2006)
Bedalli, E., Ninka, I.: Adapting the fuzzy c-means clustering algorithm for a semi-supervised learning approach. Sci. Innov. New Technol. 1, 61 (2014)
Grabocka, J., Bedalli, E., Schmidt-Thieme, L.: Supervised nonlinear factorizations excel in semi-supervised regression. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining, pp. 188–199. Springer, Cham (2014)
Schubert, E., Sander, J., Ester, M., Kriegel, H.-P., Xu, X.: DBSCAN revisited, revisited: why and how you should (still) use DBSCAN. ACM Trans. Database Syst. (TODS) 42(3), 19 (2017)
Wagstaff, K., Cardie, C., Rogers, S., Schrödl, S.: Constrained k-means clustering with background knowledge. In: ICML, vol. 1, pp. 577–584 (2001)
Grira, N., Crucianu, M., Boujemaa, N.: Semi-supervised fuzzy clustering with pairwise-constrained competitive agglomeration. In: The 14th IEEE International Conference on Fuzzy Systems. IEEE (2005)
Lelis, L., Sander, J.: Semi-supervised density-based clustering. In: Ninth IEEE International Conference on Data Mining. IEEE (2009)
Abir, S., Eloudi, Z.: Soft DBSCAN: improving DBSCAN clustering method using fuzzy set theory. In: 2013 6th International Conference on Human System Interactions (HSI), pp. 380–385. IEEE (2013)
Gan, J., Tao, Y.: DBSCAN revisited: mis-claim, un-fixability, and approximation. In: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, pp. 519–530 (2015)
Bedalli, E., Enea, M., Esteriana, H.: Exploring user feedback data via a hybrid fuzzy clustering model combining variations of FCM and density-based clustering. In: International Conference on Intelligent Networking and Collaborative Systems, pp. 71–81. Springer, Cham (2018)
Khan, K., Rehman, S.U., Aziz, K., Fong, S., Sarasvady, S.: DBSCAN: past, present and future. In: The Fifth International Conference on the Applications of Digital Information and Web Technologies (ICADIWT 2014), pp. 232–238 (2014)
Amin, K., Johansson, R.: Choosing DBSCAN parameters automatically using differential evolution. Int. J. Comput. Appl. 91(7), 1–11 (2014)
Carlos, R., Spiliopoulou, M., Menasalvas, E.: C-DBSCAN: density-based clustering with constraints. In: International Workshop on Rough Sets, Fuzzy Sets, Data Mining, and Granular-Soft Computing, pp. 216–223. Springer, Heidelberg, (2007)
Dino, I., Bordogna, G.: Fuzzy extensions of the DBSCAN clustering algorithm. Soft Comput. 22(5), 1719–1730 (2018)
Arthur, A., Newman, D.: UCI machine learning repository (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Bedalli, E., Mançellari, E., Rada, R. (2020). A Semi-supervised Fuzzy Clustering Approach via Modifications of the DBSCAN Algorithm. In: Aliev, R., Kacprzyk, J., Pedrycz, W., Jamshidi, M., Babanli, M., Sadikoglu, F. (eds) 10th International Conference on Theory and Application of Soft Computing, Computing with Words and Perceptions - ICSCCW-2019. ICSCCW 2019. Advances in Intelligent Systems and Computing, vol 1095. Springer, Cham. https://doi.org/10.1007/978-3-030-35249-3_29
Download citation
DOI: https://doi.org/10.1007/978-3-030-35249-3_29
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-35248-6
Online ISBN: 978-3-030-35249-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)