Skip to main content

Classification of Query Graph Using Maximum Connected Component

  • Conference paper
  • First Online:
Book cover Smart Innovations in Communication and Computational Sciences

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 851))

  • 643 Accesses

Abstract

Graphs have been successfully applied in several applications such as classification of protein molecules or chemical compounds, software bug detection, text classification etc. to model structural relationships in the data. Existing graph classification methods have focused on either graph kernels or frequent subgraph patterns for classification. Graph kernel approaches work with fixed input dimension whereas subgraph feature based methods suffer from exponential pattern search space and high complexity due to graph isomorphism. To overcome these limitations, the proposed approach of classification using maximum connected component (CMCC) builds DFS tree to store only maximum substructures based on DFS canonical code approach of gSpan. Once DFS trees are built for all the classes, the query graph is classified based on its maximum match with input graph set. Advantages of using DFS trees are faster graph matching as compared to graph isomorphism test, less storage requirements and speed-up in execution time. Experimental analysis on five real-world datasets shows that the proposed method CMCC significantly enhances the classification accuracy as compared to other established methods on most of the datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://cdb.ics.uci.edu/cgibin/LearningDatasetsWeb.py.

References

  1. Deshpande, M., Kuromochi, M., Wale, N., Karypis, G.: Frequent substructure-based approaches for classifying chemical compounds. IEEE Trans. Knowl. Data Eng. 17(8), 1036–1050 (2005)

    Google Scholar 

  2. Jin, N., Young, C.: Graph Classification Based on Pattern Co-occurrence. CIKM, Hong Kong, China (2009)

    Book  Google Scholar 

  3. Jin, N., Young, C.: Graph Classification Using Evolutionary Computation. SIGMOD, USA (2010)

    Google Scholar 

  4. Ranu, S., Hoang, M., Singh, A.K.: Mining discriminative subgraphs from global-state networks. In Proceedings of the 19th ACM International Conference on Knowledge Discovery and Data Mining, pp. 509–517 (2013)

    Google Scholar 

  5. Dang, X.H., Singh, A.K., Bogdanov, P., You, H., Hsu, B.: Discriminative subnetworks with regularized spectral learning for global-state network data. In: Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pp. 290–306. Springer, Berlin, Heidelberg (2014)

    Google Scholar 

  6. Dang, X.H., You, H., Bogdanov, P., Singh, A.K.: Learning predictive substructures with regularization for network data. In: IEEE International Conference on Data Mining (ICDM), pp. 81–90 (2015)

    Google Scholar 

  7. Pan, S., Zhu, X., Zhang, C., Philip, S.Y.: Graph stream classification using labeled and unlabeled graphs. In: IEEE 29th International Conference on Data Engineering (ICDE), pp. 398–409 (2013)

    Google Scholar 

  8. Inokuchi, A., Washio, T., Motoda, H.: An apriori-based algorithm for mining frequent substructures from graph data. In: Principles of Data Mining and Knowledge Discovery, pp. 13–23 (2000)

    Google Scholar 

  9. Kuramochi, M., Karypis, G.: Frequent subgraph discovery. In: Proceedings of the IEEE International Conference on Data Mining ICDM, pp. 313–320 (2001)

    Google Scholar 

  10. Huan, J., Wang, W., Prins, J.: Efficient mining of frequent subgraphs in the presence of isomorphism. In: IEEE 3rd International Conference on Data Mining, ICDM, pp. 549–552 (2003)

    Google Scholar 

  11. Yan,X., Han, J.: gspan: graph-based substructure pattern mining. In: Proceedings of the IEEE International Conference on Data Mining, ICDM, pp. 721–724 (2002)

    Google Scholar 

  12. Nijssen, S., Kok, J.N.: A quickstart in frequent structure mining can make a difference. In: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge discovery and data mining, pp. 647–652 (2004)

    Google Scholar 

  13. Schölkopf, B., Tsuda, K., Vert, J.P. (eds.): Kernel Methods in Computational Biology. MIT press (2004)

    Google Scholar 

  14. Mahé, P., Ueda, N., Akutsu, T., Perret, J.L.: Extensions of marginalized graph kernels. In: Proceedings of the 21st ACM International Conference Machine Learning, p. 70 (2004)

    Google Scholar 

  15. Seeland, M., Karwath, A., Kramer, S.: A structural cluster kernel for learning on graphs. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge discovery and data mining, pp. 516–524 (2012)

    Google Scholar 

  16. Shervashidze, N., Schweitzer, P., Leeuwen, E.J.V., Mehlhorn, K., Borgwardt, K.M.: Weisfeiler-lehman graph kernels. J. Mach. Learn. Res. 12, 2539–2561 (2011)

    Google Scholar 

  17. Neumann, M., Garnett, R., Bauckhage, C., Kersting, K.: Propagation kernels: efficient graph kernels from propagated information. Mach. Learn. 102(2), 209–245 (2016)

    Article  MathSciNet  Google Scholar 

  18. Riesen, K., Bunke, H.: Graph classification by means of Lipschitz embedding. IEEE Trans. Syst. Man Cybern. Part B (Cybernetics) 39(6), 1472–1483 (2009)

    Google Scholar 

  19. Kong, X., Philip, S.Y.: gMLC: a multi-label feature selection framework for graph classification. Knowl. Inf. Syst. 31(2), 281–305 (2012)

    Article  Google Scholar 

  20. Zhou, K.: gspan algorithm in data mining. https://github.com/Jokeren/DataMining-gSpan. Accessed 20 Nov 2017

  21. Zhu, Y., Yu, J.X., Cheng, H., Qin, L.: Graph classification: a diversified discriminative feature selection approach. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management, pp. 205–214 (2012)

    Google Scholar 

  22. Ketkar, N.S., Holder, L.B., Cook, D.J.: Subdue: compression-based frequent pattern discovery in graph data. In: Proceedings of the 1st international workshop on Open Source Data Mining: Frequent Pattern Mining Implementations, pp. 71–76 (2005)

    Google Scholar 

  23. Moonesinghe, H.D.K., Valizadegan, H., Fodeh, S., Tan, P.N.: A probabilistic substructure-based approach for graph classification. In: 19th IEEE Conference on ICTAI 2007, vol. 1, pp. 346–349 (2007)

    Google Scholar 

  24. Ketkar, N.S., Holder, L.B., Cook, D.J.: Empirical comparison of graph classification algorithms. In: IEEE Symposium on Computational Intelligence and Data Mining (CIDM), pp. 259–266. IEEE (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Parnika Paranjape .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Paranjape, P., Dhabu, M., Pathak, R., Funde, N., Deshpande, P. (2019). Classification of Query Graph Using Maximum Connected Component. In: Tiwari, S., Trivedi, M., Mishra, K., Misra, A., Kumar, K. (eds) Smart Innovations in Communication and Computational Sciences. Advances in Intelligent Systems and Computing, vol 851. Springer, Singapore. https://doi.org/10.1007/978-981-13-2414-7_38

Download citation

Publish with us

Policies and ethics