Abstract
Most graph-theoretical clustering algorithms require the complete adjacency relation of the graph representing the examined data. This is infeasible for very large graphs currently emerging in many application areas. We propose a local approach that computes clusters in graphs, one at a time, relying only on the neighborhoods of the vertices included in the current cluster candidate. This enables implementing a local and parameter-free algorithm. Approximate clusters may be identified quickly by heuristic methods. We report experimental results on clustering graphs using simulated annealing.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Brandes, U., Gaertler, M., Wagner, D.: Experiments on graph clustering algorithms. In: Di Battista, G., Zwick, U. (eds.) ESA 2003. LNCS, vol. 2832, pp. 568–579. Springer, Heidelberg (2003)
Guha, S., et al.: Clustering data streams. In: Proc. of FOCS, pp. 359–366. IEEE Comp. Soc. Press, Los Alamitos (2000)
Kannan, R., Vempala, S., Vetta, A.: On clusterings — good, bad and spectral. In: Proc. of FOCS, pp. 367–377. IEEE Comp. Soc. Press, Los Alamitos (2000)
Kirkpatrick, S., Gelatt Jr., C.D., Vecchi, M.P.: Optimization by simulated annealing. Science 220(4598), 671–680 (1983)
Kleinberg, J., Lawrence, S.: The structure of the web. Science 294(5548), 1849–1850 (2001)
Mihail, M., et al.: On the semantics of Internet topologies. Tech. report GIT-CC-02-07, Atlanta, USA (2002)
Newman, M.E.J.: The structure of scientific collaboration networks. Proc. Natl. Acad. Sci. USA 98(2), 404–409 (2001)
Newman, M.E.J.: Fast algorithm for detecting community structure in networks. Phys. Rev. EÂ 69(066133) (2004)
O’Callaghan, L., et al.: Streaming-data algorithms for high-quality clustering. In: Proc. of ICDE, pp. 685–694. IEEE Comp. Soc. Press, Los Alamitos (2002)
Orponen, P., Schaeffer, S.E.: Local clustering of large graphs by approximate Fiedler vectors. In: Nikoletseas, S.E. (ed.) WEA 2005. LNCS, vol. 3503, pp. 524–533. Springer, Heidelberg (2005)
Å Ãma, J., Schaeffer, S.E.: On the NP-completeness of some cluster fitness measures (2005) (In preparation)
Virtanen, S.E.: Clustering the Chilean web. In: Proc. of LA-Web, pp. 229–231. IEEE Comp. Soc. Press, Los Alamitos (2003)
Virtanen, S.E.: Properties of nonuniform random graph models. Research Report A77, TCS, TKK, Espoo, Finland (2003)
Virtanen, S.E., Nikander, P.: Local clustering for hierarchical ad hoc networks. In: Proc. of WiOpt 2004, pp. 404–405 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Schaeffer, S.E. (2005). Stochastic Local Clustering for Massive Graphs. In: Ho, T.B., Cheung, D., Liu, H. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2005. Lecture Notes in Computer Science(), vol 3518. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11430919_42
Download citation
DOI: https://doi.org/10.1007/11430919_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26076-9
Online ISBN: 978-3-540-31935-1
eBook Packages: Computer ScienceComputer Science (R0)