Skip to main content

A General Powerful Graph Pattern Matching System for Data Analysis

  • Conference paper
  • First Online:

Part of the book series: Studies in Computational Intelligence ((SCI,volume 813))

Abstract

Graph pattern matching is a powerful mechanism for searching on network data. Most of the graph pattern matching tools available are based on subgraph isomorphism, i.e. finding a one-to-one correspondence between nodes of a query graph and nodes of a target graph. Often this approach is not flexible enough, since it does not allow the query to represent sets of nodes of any size that share the same profile of connectivity. We propose a novel powerful graph matching approach that overcomes the existing limits and allows the user to define complex patterns in a simple and intuitive way. In our approach, queries are expressed as graphs, whose nodes and edges specify structural constraints and filtering criteria. We show that, despite its simplicity, the proposed approach can solve a large variety of practical problems.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD   219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    Our approach can also be applied to undirected graphs or multi-graphs. For simplicity of exposition, in this paper we refer to directed graphs only.

  2. 2.

    In this formulation we do not make any distinction between fixed nodes and variable nodes since the number of candidate valid associations depends on f.

  3. 3.

    Filtering conditions can be imposed on edges as well, though for simplicity of exposition this is not considered in our description.

  4. 4.

    https://github.com/mauricioaniche/repodriller.

References

  1. Angles, R., Arenas, M., Barceló, P., Hogan, A., Reutter, J., Vrgoč, D.: Foundations of modern query languages for graph databases. ACM Comput. Surv. (CSUR) 50(5), 68 (2017)

    Article  Google Scholar 

  2. Cohen, E., Datar, M., Fujiwara, S., Gionis, A., Indyk, P., Motwani, R., Ullman, J.D., Yang, C.: Finding interesting associations without support pruning. IEEE Trans. Knowl. Data Eng. 13(1), 64–78 (2001)

    Article  Google Scholar 

  3. Cordella, L.P., Foggia, P., Sansone, C., Vento, M.: A (sub) graph isomorphism algorithm for matching large graphs. IEEE Trans. Pattern Anal. Mach. Intell. 26(10), 1367–1372 (2004)

    Article  Google Scholar 

  4. Di Natale, R., Ferro, A., Giugno, R., Mongiovì, M., Pulvirenti, A., Shasha, D.: Sing: Subgraph search in non-homogeneous graphs. BMC Bioinform. 11(1), 1 (2010)

    Article  Google Scholar 

  5. Fan, W., Li, J., Ma, S., Tang, N., Wu, Y., Wu, Y.: Graph pattern matching: from intractable to polynomial time. Proc. VLDB Endow. 3(1–2), 264–275 (2010)

    Article  Google Scholar 

  6. Gamma, E., Helm, R., Johnson, R., Vlissides, J.: Design Patterns: Elements of Reusable Object-oriented Software. Addison-Wesley, Boston (1994)

    Google Scholar 

  7. He, H., Singh, A.K.: Closure-tree: an index structure for graph queries. In: Proceedings of International Conference on Data Engineering (ICDE), pp. 38–38. IEEE (2006)

    Google Scholar 

  8. Henzinger, M.R., Henzinger, T.A., Kopke, P.W.: Computing simulations on finite and infinite graphs. In: Proceedings of Annual Symposium on Foundations of Computer Science, pp. 453–462. IEEE (1995)

    Google Scholar 

  9. Ma, S., Cao, Y., Fan, W., Huai, J., Wo, T.: Capturing topology in graph pattern matching. Proc. VLDB Endow. 5(4), 310–321 (2011)

    Article  Google Scholar 

  10. Ma, S., Cao, Y., Fan, W., Huai, J., Wo, T.: Strong simulation: capturing topology in graph pattern matching. ACM Trans. Database Syst. (TODS) 39(1), 4 (2014)

    Article  MathSciNet  Google Scholar 

  11. Mongiovì, M., Di Natale, R., Giugno, R., Pulvirenti, A., Ferro, A., Sharan, R.: Sigma: a set-cover-based inexact graph matching algorithm. J. Bioinform. Comput. Biol. 8(02), 199–218 (2010)

    Article  Google Scholar 

  12. Mongiovì, M., Pappalardo, G., Tramontana, E.: Specifying and identifying widely used crosscutting concerns. Knowl.-Based Syst. 126, 20–32 (2017)

    Article  Google Scholar 

  13. Peng, P., Zou, L., Chen, L., Lin, X., Zhao, D.: Answering subgraph queries over massive disk resident graphs. World Wide Web 19(3), 417–448 (2016)

    Article  Google Scholar 

  14. Pienta, R., Tamersoy, A., Tong, H., Chau, D.H.: Mage: Matching approximate patterns in richly-attributed graphs. In: Proceedings of International Conference on Big Data (Big Data), pp. 585–590. IEEE (2014)

    Google Scholar 

  15. Pinter, R.Y., Rokhlenko, O., Yeger-Lotem, E., Ziv-Ukelson, M.: Alignment of metabolic pathways. Bioinformatics (Oxford, England) 21(16), 3401–8 (2005)

    Article  Google Scholar 

  16. Zhang, S., Hu, M., Yang, J.: Treepi: A novel graph indexing method. In: Proceedings of International Conference on Data Engineering (ICDE), pp. 966–975. IEEE (2007)

    Google Scholar 

  17. Zhang, S., Li, S., Yang, J.: GADDI: distance index based subgraph matching in biological networks. In: Proceedings of International Conference on Extending Database Technology: Advances in Database Technology, pp. 192–203. ACM (2009)

    Google Scholar 

  18. Zhang, S., Yang, J., Jin, W.: Sapper: subgraph indexing and approximate matching in large graphs. Proc. VLDB Endow. 3(1–2), 1185–1194 (2010)

    Article  Google Scholar 

Download references

Acknowledgement

This work is supported by the CLARA - CLoud plAtform and smart underground imaging for natural Risk Assessment - project, SCN 00451, funded by the Italian Ministry of Education, Universities and Research, within the “Smart Cities and Communities and Social Innovation” initiative.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Misael Mongiovì .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Fornaia, A., Mongiovì, M., Pappalardo, G., Tramontana, E. (2019). A General Powerful Graph Pattern Matching System for Data Analysis. In: Aiello, L., Cherifi, C., Cherifi, H., Lambiotte, R., Lió, P., Rocha, L. (eds) Complex Networks and Their Applications VII. COMPLEX NETWORKS 2018. Studies in Computational Intelligence, vol 813. Springer, Cham. https://doi.org/10.1007/978-3-030-05414-4_4

Download citation

Publish with us

Policies and ethics