Abstract
Network querying is a powerful approach to mine molecular interaction networks. Most network querying tools support queries in the form of a template sub-network, in case of topology-constrained queries, or a set of colored vertices in case of topology-free queries. A third approach is grammar-based queries, which are more flexible and expressive as they allow the addition of logic rules to the query. Previous grammar-based querying tools defined queries via string grammars and identified paths in graphs. In this paper, we extend the scope of grammar-based queries to regular tree grammar (RTG), and the scope of the identified sub-graphs from paths to trees. We introduce a new problem and propose a novel algorithm to search a given graph for the k highest scoring sub-graphs matching a tree accepted by an RTG. Our algorithm is based on dynamic programming and combines an extension to k-best parsing optimization with color coding. We implement the new algorithm and exemplify its application to mining the human-viral interaction network. Our code is available at http://www.cs.bgu.ac.il/~smolyi/RTGnet/.
These authors contributed equally to the paper.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abbas, Y.M., Pichlmair, A., Górna, M.W., Superti-Furga, G., Nagar, B.: Structural basis for viral 5 [prime]-PPP-RNA recognition by human IFIT proteins. Nature 494, 60–64 (2013)
Alon, N., Yuster, R., Zwick, U.: Color-coding. J. ACM (JACM) 42(4), 844–856 (1995)
Arimoto, K.I., Konishi, H., Shimotohno, K.: Ubch8 regulates ubiquitin and ISG15 conjugation to RIG-I. Mol. Immunol. 45(4), 1078–1084 (2008)
Ashburner, M., Ball, C., Blake, J., et al.: Gene ontology: tool for the unification of biology. The gene ontology consortium database resources of the national center for biotechnology information. Nucleic Acids Res. 34 (2006)
Barabási, A.L., Gulbahce, N., Loscalzo, J.: Network medicine: a network-based approach to human disease. Nat. Rev. Genet. 12(1), 56–68 (2011)
Belshe, R.B.: Implications of the emergence of a novel H1 influenza virus. N. Engl. J. Med. 360(25), 2667–2668 (2009)
Bruckner, S., Hüffner, F., Karp, R.M., Shamir, R., Sharan, R.: Topology-free querying of protein interaction networks. J. Comput. Biol. 17(3), 237–252 (2010)
Carme, J., Niehren, J., Tommasi, M.: Querying unranked trees with stepwise tree automata. In: van Oostrom, V. (ed.) RTA 2004. LNCS, vol. 3091, pp. 105–118. Springer, Heidelberg (2004)
Comon, H., Dauchet, M., Gilleron, R., Löding, C., Jacquemard, F., Lugiez, D., Tison, S., Tommasi, M.: Tree automata techniques and applications (2007) (release 12 October 2007)
Dost, B., Shlomi, T., Gupta, N., Ruppin, E., Bafna, V., Sharan, R.: Qnet: a tool for querying protein interaction networks. J. Comput. Biol. 15(7), 913–925 (2008)
Ea, C.K., Deng, L., Xia, Z.P., Pineda, G., Chen, Z.J.: Activation of IKK by TNF\(\alpha \) requires site-specific ubiquitination of RIP1 and polyubiquitin binding by NEMO. Mol. Cell 22(2), 245–257 (2006)
Fan, W., Li, J., Ma, S., Tang, N., Wu, Y.: Adding regular expressions to graph reachability and pattern queries. In: 2011 IEEE 27th International Conference on Data Engineering (ICDE), pp. 39–50. IEEE (2011)
Finkelstein, A., Roytberg, M.: Computation of biopolymers: a general approach to different problems. BioSystems 30(1), 1–19 (1993)
Gack, M.U., Shin, Y.C., Joo, C.H., Urano, T., Liang, C., Sun, L., Takeuchi, O., Akira, S., Chen, Z., Inoue, S., et al.: TRIM25 RING-finger E3 ubiquitin ligase is essential for RIG-I-mediated antiviral activity. Nature 446(7138), 916–920 (2007)
Gack, M.U., Albrecht, R.A., Urano, T., Inn, K.S., Huang, I.C., Carnero, E., Farzan, M., Inoue, S., Jung, J.U., GarcÃa-Sastre, A.: Influenza a virus NS1 targets the ubiquitin ligase TRIM25 to evade recognition by the host viral RNA sensor RIG-I. Cell Host Microbe 5(5), 439–449 (2009)
Gesmundo, A., Henderson, J.: Faster cube pruning. In: IWSLT, pp. 267–274. Citeseer (2010)
Guan, R., Ma, L.C., Leonard, P.G., Amer, B.R., Sridharan, H., Zhao, C., Krug, R.M., Montelione, G.T.: Structural basis for the sequence-specific recognition of human ISG15 by the NS1 protein of influenza B virus. Proc. Natl. Acad. Sci. 108(33), 13468–13473 (2011)
Haas, T.L., Emmerich, C.H., Gerlach, B., Schmukle, A.C., Cordier, S.M., Rieser, E., Feltham, R., Vince, J., Warnken, U., Wenger, T., et al.: Recruitment of the linear ubiquitin chain assembly complex stabilizes the TNF-R1 signaling complex and is required for TNF-mediated gene induction. Mol. Cell 36(5), 831–844 (2009)
Huang, L., Chiang, D.: Better k-best parsing. In: Proceedings of the Ninth International Workshop on Parsing Technology, pp. 53–64. Association for Computational Linguistics (2005)
Inn, K.S., Gack, M.U., Tokunaga, F., Shi, M., Wong, L.Y., Iwai, K., Jung, J.U.: Linear ubiquitin assembly complex negatively regulates RIG-I-and TRIM25-mediated type I interferon induction. Mol. cell 41(3), 354–365 (2011)
Kanehisa, M., Goto, S., Sato, Y., Kawashima, M., Furumichi, M., Tanabe, M.: Data, information, knowledge and principle: back to metabolism in KEGG. Nucleic Acids Res. 42(D1), D199–D205 (2014)
Koschmieder, A., Leser, U.: Regular path queries on large graphs. In: Ailamaki, A., Bowers, S. (eds.) SSDBM 2012. LNCS, vol. 7338, pp. 177–194. Springer, Heidelberg (2012)
Lacroix, V., Fernandes, C.G., Sagot, M.F.: Motif search in graphs: application to metabolic networks. IEEE/ACM Trans. Comput. Biol. Bioinform. 3(4), 360–368 (2006)
Lange, M., Leiß, H.: To CNF or not to CNF? An efficient yet presentable version of the CYK algorithm. Informatica Didactica 8, 2008–2010 (2009)
Martens, W., Niehren, J.: Minimizing tree automata for unranked trees. In: Bierman, G., Koch, C. (eds.) DBPL 2005. LNCS, vol. 3774, pp. 232–246. Springer, Heidelberg (2005)
Mendelzon, A.O., Wood, P.T.: Finding regular simple paths in graph databases. SIAM J. Comput. 24(6), 1235–1258 (1995)
Patro, R., Kingsford, C.: Predicting protein interactions via parsimonious network history inference. Bioinformatics 29(13), i237–i246 (2013)
Pinter, R.Y., Rokhlenko, O., Yeger-Lotem, E., Ziv-Ukelson, M.: Alignment of metabolic pathways. Bioinformatics 21(16), 3401–3408 (2005)
Ponty, Y., Saule, C.: A combinatorial framework for designing (pseudoknotted) RNA algorithms. In: Przytycka, T.M., Sagot, M.-F. (eds.) WABI 2011. LNCS, vol. 6833, pp. 250–269. Springer, Heidelberg (2011)
Scott, J., Ideker, T., Karp, R.M., Sharan, R.: Efficient algorithms for detecting signaling pathways in protein interaction networks. J. Comput. Biol. 13(2), 133–144 (2006)
Sevon, P., Eronen, L.: Subgraph queries by context-free grammars. J. Integr. Bioinform. 5(2), 100 (2008)
Shapira, S.D., Gat-Viks, I., Shum, B.O., Dricot, A., de Grace, M.M., Wu, L., Gupta, P.B., Hao, T., Silver, S.J., Root, D.E., et al.: A physical and regulatory map of host-influenza interactions reveals pathways in H1N1 infection. Cell 139(7), 1255–1267 (2009)
Shlomi, T., Segal, D., Ruppin, E., Sharan, R.: Qpath: a method for querying pathways in a protein-protein interaction network. BMC Bioinform. 7(1), 199 (2006)
Verhelst, J., Parthoens, E., Schepens, B., Fiers, W., Saelens, X.: Interferon-inducible protein MX1 inhibits influenza virus by interfering with functional viral ribonucleoprotein complex assembly. J. Virol. 86(24), 13445–13455 (2012)
Zhao, C., Denison, C., Huibregtse, J.M., Gygi, S., Krug, R.M.: Human ISG15 conjugation targets both ifn-induced and constitutively expressed proteins functioning in diverse cellular pathways. Proc. Natl. Acad. Sci. USA 102(29), 10200–10205 (2005)
Acknowledgments
We thank the anonymous WABI referees for their many helpful comments.The work of Ilan Smoly, Amir Carmel and Michal Ziv-Ukelson was partially supported by the Frankel Center for Computer Science at Ben Gurion University of the Negev and by the Israel Science Foundation (ISF 179/14).
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Smoly, I., Carmel, A., Shemer-Avni, Y., Yeger-Lotem, E., Ziv-Ukelson, M. (2015). Algorithms for Regular Tree Grammar Network Search and Their Application to Mining Human-Viral Infection Patterns. In: Pop, M., Touzet, H. (eds) Algorithms in Bioinformatics. WABI 2015. Lecture Notes in Computer Science(), vol 9289. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-48221-6_4
Download citation
DOI: https://doi.org/10.1007/978-3-662-48221-6_4
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-48220-9
Online ISBN: 978-3-662-48221-6
eBook Packages: Computer ScienceComputer Science (R0)