Graph Clustering Based on Structural Similarity of Fragments

Yoshida, Tetsuya; Shoda, Ryosuke; Motoda, Hiroshi

doi:10.1007/11605126_6

Tetsuya Yoshida²²,
Ryosuke Shoda²³ &
Hiroshi Motoda²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3847))

294 Accesses
5 Citations

Abstract

Resources available over the Web are often used in combination to meet a specific need of a user. Since resource combinations can be represented as graphs in terms of the relations among the resources, locating desirable resource combinations can be formulated as locating the corresponding graph. This paper describes a graph clustering method based on structural similarity of fragments (currently, connected subgraphs are considered) in graph-structured data. A fragment is characterized based on the connectivity (degree) of a node in the fragment. A fragment spectrum of a graph is created based on the frequency distribution of fragments. Thus, the representation of a graph is transformed into a fragment spectrum in terms of the properties of fragments in the graph. Graphs are then clustered with respect to the transformed spectra by applying a standard clustering method. We also devise a criterion to determine the number of clusters by defining a pseudo-entropy for clusters. Preliminary experiments with synthesized data were conducted and the results are reported.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Amaral, L.A.N., Scala, A., Barthélémy, M., Stanley, H.E.: Classes of small-world networks. Proceedings of the National Academy of Sciences 97(21), 11149–11152 (2000)
Article Google Scholar
Chakrabarti, S.: Mining the Web: Discovering Knowledge from Hypertext Data. Morgan Kaufmann, San Francisco (2002)
Google Scholar
Clark, P., Niblett, T.: The cn2 induction algorithm. Machine Learning 3, 261–283 (1989)
Google Scholar
Cook, D.J., Holder, L.B.: Graph-based data mining. IEEE Intelligent Systems 15(2), 32–41 (2000)
Article Google Scholar
Dehaspe, L., Toivonen, H., King, R.D.: Finding frequent substructures in chemical compound. In: Proc. the 4th International conference on Knowledge Discovery and Data Mining, pp. 30–36 (1998)
Google Scholar
Inokuchi, A., Washio, T., Motoda, H.: Complete mining of frequent patterns from graphs: Mining graph data. Machine Learning 50(3), 321–354 (2003)
Article MATH Google Scholar
Kuramochi, M., Karypis, G.: Frequent subgraph discovery. In: Proc. of the 1st IEEE ICDM, pp. 313–320 (2001)
Google Scholar
Matsuda, T., Motoda, H., Yoshida, T., Washio, T.: Mining patterns from structured data by beam-wise graph-based induction. In: Lange, S., Satoh, K., Smith, C.H. (eds.) DS 2002. LNCS, vol. 2534, pp. 422–429. Springer, Heidelberg (2002)
Chapter Google Scholar
Matsuda, T., Yoshida, T., Motoda, H., Washio, T.: Beam-wise graph-based induction for structured data mining. In: International Workshop on Active Mining (AM 2002): working notes, pp. 23–30 (2002)
Google Scholar
Michalski, R.S.: Learning flexible concepts: Fundamental ideas and a method based on two-tiered representaion. Machine Learning: An Artificial Intelligence Approach 3, 63–102 (1990)
Google Scholar
Muggleton, S., de Raedt, L.: Inductive logic programming: Theory and methods. Journal of Logic Programming 19(20), 629–679 (1994)
Article MathSciNet Google Scholar
Nomura, S., Miki, T., Ishida, T.: Comparative Study of Web Citation Analysis and Bibliographical Citation Analysis in Community Mining. IEICE Transaction J87-D-I(3), 382–389 (2004) (in Japanese)
Google Scholar
Palmer, C.R., Gibbons, P.B., Faloutsos, C.: ANF: A fast and scalable tool for data mining in massive graphs. In: Proc. of the KDD 2002 (2002)
Google Scholar
Quinlan, J.R.: Induction of decision trees. Machine Learning 1, 81–106 (1986)
Google Scholar
Quinlan, J.R.: C4.5:Programs For Machine Learning. Morgan Kaufmann Publishers, San Francisco (1993)
Google Scholar
Raymond, J.W., Blankley, C.J., Willett, P.: Comparison of chemical clustering methods using graph- and fingerprint-based similarity measures. Molecular Graphics and Modelling 21(5), 421–433 (2003)
Article Google Scholar
Takahashi, Y., Ohoka, H., Ishiyama, Y.: Structural similarity analysis based on topological fragement spectra. Adavances in Molecular Similarity 2, 93–104 (1998)
Google Scholar
Watts, D.J.: Small Worlds: The Dynamics of Networks Between Order and Randomness. Princeton University Press, Princeton (2004)
MATH Google Scholar
Watts, D.J., Strogatz, S.H.: Collective dynamics of ‘small-world’ networks. Nature 393, 440–442 (1998)
Article Google Scholar
Yoshida, T., Warodom, G., Mogi, A., Ohara, K., Motoda, H., Washio, T., Yokoi, H., Takabayashi, K.: Preliminary analysis of interferon therapy by graph-based induction. In: Working note of International Workshop on Active Mining (AM 2004), pp. 31–40 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Information Science and Technology, Hokkaido University, N-14 W-9, Sapporo, 060-0814, Japan
Tetsuya Yoshida
Institute of Scientific and Industrial Research, Osaka University, 8-1 Mihogaoka, Ibaraki, Osaka, 567-0047, Japan
Ryosuke Shoda & Hiroshi Motoda

Authors

Tetsuya Yoshida
View author publications
You can also search for this author in PubMed Google Scholar
Ryosuke Shoda
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Motoda
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Meme Media Laboratory, Hokkaido University Sapporo, Kita 13, Nishi 8, Kita-ku, 060-8628, Sapporo, Japan
Klaus P. Jantke
Meme Media Laboratory, Hokkaido University, 060-8628, Sapporo, Japan
Aran Lunzer
Laboratoire de Recherche en Informatique, Université Paris-Sud, Orsay Cedex, France
Nicolas Spyratos
Meme Media Laboratory, Hokkaido University, N13 W8, 0608628, Sapporo, Japan
Yuzuru Tanaka

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yoshida, T., Shoda, R., Motoda, H. (2006). Graph Clustering Based on Structural Similarity of Fragments. In: Jantke, K.P., Lunzer, A., Spyratos, N., Tanaka, Y. (eds) Federation over the Web. Lecture Notes in Computer Science(), vol 3847. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11605126_6

Download citation

DOI: https://doi.org/10.1007/11605126_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31018-1
Online ISBN: 978-3-540-32587-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics