Classification of Query Graph Using Maximum Connected Component

Paranjape, Parnika; Dhabu, Meera; Pathak, Rushikesh; Funde, Nitesh; Deshpande, Parag

doi:10.1007/978-981-13-2414-7_38

Parnika Paranjape¹⁹,
Meera Dhabu¹⁹,
Rushikesh Pathak²⁰,
Nitesh Funde¹⁹ &
…
Parag Deshpande¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 851))

644 Accesses

Abstract

Graphs have been successfully applied in several applications such as classification of protein molecules or chemical compounds, software bug detection, text classification etc. to model structural relationships in the data. Existing graph classification methods have focused on either graph kernels or frequent subgraph patterns for classification. Graph kernel approaches work with fixed input dimension whereas subgraph feature based methods suffer from exponential pattern search space and high complexity due to graph isomorphism. To overcome these limitations, the proposed approach of classification using maximum connected component (CMCC) builds DFS tree to store only maximum substructures based on DFS canonical code approach of gSpan. Once DFS trees are built for all the classes, the query graph is classified based on its maximum match with input graph set. Advantages of using DFS trees are faster graph matching as compared to graph isomorphism test, less storage requirements and speed-up in execution time. Experimental analysis on five real-world datasets shows that the proposed method CMCC significantly enhances the classification accuracy as compared to other established methods on most of the datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
http://cdb.ics.uci.edu/cgibin/LearningDatasetsWeb.py.

References

Deshpande, M., Kuromochi, M., Wale, N., Karypis, G.: Frequent substructure-based approaches for classifying chemical compounds. IEEE Trans. Knowl. Data Eng. 17(8), 1036–1050 (2005)
Google Scholar
Jin, N., Young, C.: Graph Classification Based on Pattern Co-occurrence. CIKM, Hong Kong, China (2009)
Book Google Scholar
Jin, N., Young, C.: Graph Classification Using Evolutionary Computation. SIGMOD, USA (2010)
Google Scholar
Ranu, S., Hoang, M., Singh, A.K.: Mining discriminative subgraphs from global-state networks. In Proceedings of the 19th ACM International Conference on Knowledge Discovery and Data Mining, pp. 509–517 (2013)
Google Scholar
Dang, X.H., Singh, A.K., Bogdanov, P., You, H., Hsu, B.: Discriminative subnetworks with regularized spectral learning for global-state network data. In: Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pp. 290–306. Springer, Berlin, Heidelberg (2014)
Google Scholar
Dang, X.H., You, H., Bogdanov, P., Singh, A.K.: Learning predictive substructures with regularization for network data. In: IEEE International Conference on Data Mining (ICDM), pp. 81–90 (2015)
Google Scholar
Pan, S., Zhu, X., Zhang, C., Philip, S.Y.: Graph stream classification using labeled and unlabeled graphs. In: IEEE 29th International Conference on Data Engineering (ICDE), pp. 398–409 (2013)
Google Scholar
Inokuchi, A., Washio, T., Motoda, H.: An apriori-based algorithm for mining frequent substructures from graph data. In: Principles of Data Mining and Knowledge Discovery, pp. 13–23 (2000)
Google Scholar
Kuramochi, M., Karypis, G.: Frequent subgraph discovery. In: Proceedings of the IEEE International Conference on Data Mining ICDM, pp. 313–320 (2001)
Google Scholar
Huan, J., Wang, W., Prins, J.: Efficient mining of frequent subgraphs in the presence of isomorphism. In: IEEE 3rd International Conference on Data Mining, ICDM, pp. 549–552 (2003)
Google Scholar
Yan,X., Han, J.: gspan: graph-based substructure pattern mining. In: Proceedings of the IEEE International Conference on Data Mining, ICDM, pp. 721–724 (2002)
Google Scholar
Nijssen, S., Kok, J.N.: A quickstart in frequent structure mining can make a difference. In: Proceedings of the 10th ACM SIGKDD International Conference on Knowledge discovery and data mining, pp. 647–652 (2004)
Google Scholar
Schölkopf, B., Tsuda, K., Vert, J.P. (eds.): Kernel Methods in Computational Biology. MIT press (2004)
Google Scholar
Mahé, P., Ueda, N., Akutsu, T., Perret, J.L.: Extensions of marginalized graph kernels. In: Proceedings of the 21st ACM International Conference Machine Learning, p. 70 (2004)
Google Scholar
Seeland, M., Karwath, A., Kramer, S.: A structural cluster kernel for learning on graphs. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge discovery and data mining, pp. 516–524 (2012)
Google Scholar
Shervashidze, N., Schweitzer, P., Leeuwen, E.J.V., Mehlhorn, K., Borgwardt, K.M.: Weisfeiler-lehman graph kernels. J. Mach. Learn. Res. 12, 2539–2561 (2011)
Google Scholar
Neumann, M., Garnett, R., Bauckhage, C., Kersting, K.: Propagation kernels: efficient graph kernels from propagated information. Mach. Learn. 102(2), 209–245 (2016)
Article MathSciNet Google Scholar
Riesen, K., Bunke, H.: Graph classification by means of Lipschitz embedding. IEEE Trans. Syst. Man Cybern. Part B (Cybernetics) 39(6), 1472–1483 (2009)
Google Scholar
Kong, X., Philip, S.Y.: gMLC: a multi-label feature selection framework for graph classification. Knowl. Inf. Syst. 31(2), 281–305 (2012)
Article Google Scholar
Zhou, K.: gspan algorithm in data mining. https://github.com/Jokeren/DataMining-gSpan. Accessed 20 Nov 2017
Zhu, Y., Yu, J.X., Cheng, H., Qin, L.: Graph classification: a diversified discriminative feature selection approach. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management, pp. 205–214 (2012)
Google Scholar
Ketkar, N.S., Holder, L.B., Cook, D.J.: Subdue: compression-based frequent pattern discovery in graph data. In: Proceedings of the 1st international workshop on Open Source Data Mining: Frequent Pattern Mining Implementations, pp. 71–76 (2005)
Google Scholar
Moonesinghe, H.D.K., Valizadegan, H., Fodeh, S., Tan, P.N.: A probabilistic substructure-based approach for graph classification. In: 19th IEEE Conference on ICTAI 2007, vol. 1, pp. 346–349 (2007)
Google Scholar
Ketkar, N.S., Holder, L.B., Cook, D.J.: Empirical comparison of graph classification algorithms. In: IEEE Symposium on Computational Intelligence and Data Mining (CIDM), pp. 259–266. IEEE (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Visvesvaraya National Institute of Technology, Nagpur, India
Parnika Paranjape, Meera Dhabu, Nitesh Funde & Parag Deshpande
Nutanix Technologies India Pvt. Ltd., Bangalore, Karnataka, India
Rushikesh Pathak

Authors

Parnika Paranjape
View author publications
You can also search for this author in PubMed Google Scholar
Meera Dhabu
View author publications
You can also search for this author in PubMed Google Scholar
Rushikesh Pathak
View author publications
You can also search for this author in PubMed Google Scholar
Nitesh Funde
View author publications
You can also search for this author in PubMed Google Scholar
Parag Deshpande
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Parnika Paranjape .

Editor information

Editors and Affiliations

Computer Science and Engineering Department, ABES Engineering College, Ghaziabad, Uttar Pradesh, India
Shailesh Tiwari
Department of Information Technology, Rajkiya Engineering College, Azamgarh, Uttar Pradesh, India
Munesh C. Trivedi
Department of Computer Science and Engineering, Motilal Nehru National Institute of Technology, Allahabad, Uttar Pradesh, India
Krishn K. Mishra
Department of Computer Science and Engineering, Motilal Nehru National Institute of Technology, Allahabad, Uttar Pradesh, India
A. K. Misra
Department of Computer Science and Engineering, University of Mauritius, Moka, Mauritius
Khedo Kavi Kumar

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Paranjape, P., Dhabu, M., Pathak, R., Funde, N., Deshpande, P. (2019). Classification of Query Graph Using Maximum Connected Component. In: Tiwari, S., Trivedi, M., Mishra, K., Misra, A., Kumar, K. (eds) Smart Innovations in Communication and Computational Sciences. Advances in Intelligent Systems and Computing, vol 851. Springer, Singapore. https://doi.org/10.1007/978-981-13-2414-7_38

Download citation

DOI: https://doi.org/10.1007/978-981-13-2414-7_38
Published: 20 November 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-2413-0
Online ISBN: 978-981-13-2414-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics