Fast Minimum Spanning Tree Based Clustering Algorithms on Local Neighborhood Graph

Jothi, R.; Mohanty, Sraban Kumar; Ojha, Aparajita

doi:10.1007/978-3-319-18224-7_29

R. Jothi¹⁷,
Sraban Kumar Mohanty¹⁷ &
Aparajita Ojha¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9069))

Included in the following conference series:

International Workshop on Graph-Based Representations in Pattern Recognition

1368 Accesses
6 Citations

Abstract

Minimum spanning tree (MST) based clustering algorithms have been employed successfully to detect clusters of heterogeneous nature. Given a dataset of n random points, most of the MST-based clustering algorithms first generate a complete graph G of the dataset and then construct MST from G. The first step of the algorithm is the major bottleneck which takes O(n ²) time. This paper proposes two algorithms namely MST-based clustering on K-means Graph and MST-based clustering on Bi-means Graph for reducing the computational overhead. The proposed algorithms make use of a centroid based nearest neighbor rule to generate a partition-based Local Neighborhood Graph (LNG). We prove that both the size and the computational time to construct the graph (LNG) is O(n ^3/2), which is a \(O(\sqrt n)\) factor improvement over the traditional algorithms. The approximate MST is constructed from LNG in \(O(n^{3/2} \lg n)\) time, which is asymptotically faster than O(n ²). The advantage of the proposed algorithms is that they do not require any parameter setting which is a major issue in many of the nearest neighbor finding algorithms. Experimental results demonstrate that the computational time has been reduced significantly by maintaining the quality of the clusters obtained from the MST.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Computing Surveys (CSUR) 31(3), 264–323 (1999)
Article Google Scholar
Schaeffer, S.E.: Graph clustering. Computer Science Review 1(1), 27–64 (2007)
Article MATH MathSciNet Google Scholar
Zahn, C.T.: Graph-theoretical methods for detecting and describing gestalt clusters. IEEE Transactions on Computers 100(1), 68–86 (1971)
Article Google Scholar
Xu, Y., Olman, V., Xu, D.: Minimum spanning trees for gene expression data clustering. GENOME INFORMATICS SERIES, pp. 24–33 (2001)
Google Scholar
Laszlo, M., Mukherjee, S.: Minimum spanning tree partitioning algorithm for microaggregation. IEEE Transactions on Knowledge and Data Engineering 17(7), 902–911 (2005)
Article Google Scholar
Luo, T., Zhong, C.: A neighborhood density estimation clustering algorithm based on minimum spanning tree. In: Yu, J., Greco, S., Lingras, P., Wang, G., Skowron, A. (eds.) RSKT 2010. LNCS, vol. 6401, pp. 557–565. Springer, Heidelberg (2010)
Chapter Google Scholar
Zhong, C., Miao, D., Fränti, P.: Minimum spanning tree based split-and-merge: A hierarchical clustering method. Information Sciences 181(16), 3397–3410 (2011)
Article Google Scholar
Wang, X., Wang, X.L., Chen, C., Wilkes, D.M.: Enhancing minimum spanning tree-based clustering by removing density-based outliers. Digital Signal Processing 23(5), 1523–1538 (2013)
Article MathSciNet Google Scholar
Wang, X., Wang, X., Wilkes, D.M.: A divide-and-conquer approach for minimum spanning tree-based clustering. IEEE Transactions on Knowledge and Data Engineering 21(7), 945–958 (2009)
Article Google Scholar
Cheng, B., Yang, J., Yan, S., Fu, Y., Huang, T.S.: Learning with L1-graph for image analysis. IEEE Transactions on Image Processing 19(4), 858–866 (2010)
Article MathSciNet Google Scholar
Liu, H., Yan, S.: Robust graph mode seeking by graph shift. In: Proceedings of the 27th International Conference on Machine Learning (ICML 2010), pp. 671–678 (2010)
Google Scholar
Zhong, C., Malinen, M., Miao, D., Fränti, P.: Fast approximate minimum spanning tree algorithm based on K-means. In: Wilson, R., Hancock, E., Bors, A., Smith, W. (eds.) CAIP 2013, Part I. LNCS, vol. 8047, pp. 262–269. Springer, Heidelberg (2013)
Chapter Google Scholar
Chen, X.: Clustering based on a near neighbor graph and a grid cell graph. Journal of Intelligent Information Systems 40(3), 529–554 (2013)
Article Google Scholar
Chavent, M., Lechevallier, Y., Briant, O.: DIVCLUS-T: A monothetic divisive hierarchical clustering method. Computational Statistics and Data Analysis 52(2), 687–701 (2007)
Article MATH MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Indian Institute of Information Technology, Design and Manufacturing, Jabalpur, Madhya Pradesh, India
R. Jothi, Sraban Kumar Mohanty & Aparajita Ojha

Authors

R. Jothi
View author publications
You can also search for this author in PubMed Google Scholar
Sraban Kumar Mohanty
View author publications
You can also search for this author in PubMed Google Scholar
Aparajita Ojha
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to R. Jothi .

Editor information

Editors and Affiliations

Institute of Automation of CAS, Beijing, China
Cheng-Lin Liu
Anhui University, Anhui, China
Bin Luo
Vienna University of Technology, Beijing, China
Walter G. Kropatsch
Institute of Automation, Beijing, China
Jian Cheng

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jothi, R., Mohanty, S.K., Ojha, A. (2015). Fast Minimum Spanning Tree Based Clustering Algorithms on Local Neighborhood Graph. In: Liu, CL., Luo, B., Kropatsch, W., Cheng, J. (eds) Graph-Based Representations in Pattern Recognition. GbRPR 2015. Lecture Notes in Computer Science(), vol 9069. Springer, Cham. https://doi.org/10.1007/978-3-319-18224-7_29

Download citation

DOI: https://doi.org/10.1007/978-3-319-18224-7_29
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-18223-0
Online ISBN: 978-3-319-18224-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics