Abstract
This research is related to designing a new algorithm which is based on the existing DBSCAN algorithm to improve the quality of clustering. DBSCAN algorithm categorizes each data object as either a core point, a border point or a noise point. These points are classified based on the density determined by the input parameters. However, in DBSCAN algorithm, a border point is designated the same cluster as its core point. This leads to a disadvantage of DBSCAN algorithm which is popularly known as the problem of transitivity. The proposed algorithmーtwo DBSCAN with local outlier detection (2DBSCAN-LOD), tries to address this problem. Average silhouette width score is used here to compare the quality of clusters formed by both algorithms. By testing 2DBSCAN-LOD on different artificial datasets, it is found that the average silhouette width score of clusters formed by DBSCAN-LOD is higher than that of the clusters formed by DBSCAN.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ghuman, Sukhdev Singh. 2016. Clustering techniques—a review.
Khan, M.M.R., M.A.B. Siddique, R.B. Arif, and M.R. Oishe. 2018. ADBSCAN: adaptive density-based spatial clustering of applications with noise for ıdentifying clusters with varying densities.
Martin, E., Hans-Peter Kriegel, Jörg Sander, and X. Xiaowei.1996. A density-based algorithm for discovering clusters a density-based algorithm for discovering clusters in large spatial databases with noise.
Campello, Ricardo J.G.B., Davoud Moulavi, Arthur Zimek, and Jörg Sander. 2015. Hierarchical density estimates for data clustering, visualization, and outlier detection.
Chepenko, D. 2018. A Density-based algorithm for outlier detection. Medium. https://towardsdatascience.com/density-based-algorithm-for-outlier-detection-8f278d2f7983.
Rousseeuw, P.J. 1987. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis.
Barton, T. 2015. Clustering-benchmark. Github. https://github.com/deric/clustering-benchmark.
Bandyopadhyay, S., and U. Maulik. 2002. Nonparametric genetic clustering: comparison validity indices.
Bhattacharyya, S. 2019. DBSCAN algorithm: complete guide and application with python scikit-learn, clustering spatial database. https://towardsdatascience.com/dbscan-algorithm-complete-guide-and-application-with-python-scikit-learn-d690cbae4c5d.
DBSCAN Wikipedia. https://en.wikipedia.org/wiki/DBSCAN.
Silhouette Width Wikipedia. https://en.wikipedia.org/wiki/Silhouette_(clustering).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Pandya, U., Mistry, V., Rathwa, A., Kachroo, H., Jivani, A. (2020). 2DBSCAN with Local Outlier Detection. In: Hu, YC., Tiwari, S., Trivedi, M., Mishra, K. (eds) Ambient Communications and Computer Systems. Advances in Intelligent Systems and Computing, vol 1097. Springer, Singapore. https://doi.org/10.1007/978-981-15-1518-7_21
Download citation
DOI: https://doi.org/10.1007/978-981-15-1518-7_21
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-1517-0
Online ISBN: 978-981-15-1518-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)