Skip to main content

Learning Bayesian Network Using Parse Trees for Extraction of Protein-Protein Interaction

  • Conference paper
Computational Linguistics and Intelligent Text Processing (CICLing 2013)

Abstract

Extraction of protein-protein interactions from scientific papers is a relevant task in the biomedical field. Machine learning-based methods such as kernel-based represent the state-of-the-art in this task. Many efforts have focused on obtaining new types of kernels in order to employ syntactic information, such as parse trees, to extract interactions from sentences. These methods have reached the best performances on this task. Nevertheless, parse trees were not exploited by other machine learning-based methods such as Bayesian networks. The advantage of using Bayesian networks is that we can exploit the structure of the parse trees to learn the Bayesian network structure, i.e., the parse trees provide the random variables and also possible relations among them. Here we use syntactic relation as a causal dependence between variables. Hence, our proposed method learns a Bayesian network from parse trees. The evaluation was carried out over five protein-protein interaction benchmark corpora. Results show that our method is competitive in comparison with state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Chowdhary, R., Zhang, J., Liu, J.S.: Bayesian inference of protein-protein interactions from biological literature. Bioinformatics 25(12), 1536–1542 (2009)

    Article  Google Scholar 

  2. Airola, A., Pyysalo, S., Björne, J., Pahikkala, T., Ginter, F., Salakoski, T.: A graph kernel for protein-protein interaction extraction. In: Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing. BioNLP 2008, Stroudsburg, PA, USA, pp. 1–9. Association for Computational Linguistics (2008)

    Google Scholar 

  3. Choi, S.P., Myaeng, S.H.: Simplicity is better: revisiting single kernel ppi extraction. In: Proceedings of the 23rd International Conference on Computational Linguistics, COLING 2010, Stroudsburg, PA, USA, pp. 206–214. Association for Computational Linguistics (2010)

    Google Scholar 

  4. Rosario, B., Hearst, M.A.: Multi-way relation classification: Application to protein-protein interactions. In: Proceedings of HLTNAAC05 (2002)

    Google Scholar 

  5. Koller, D., Friedman, N.: Probabilistic Graphical Models: Principles and Techniques. MIT Press (2009)

    Google Scholar 

  6. Bui, Q.C.C., Katrenko, S., Sloot, P.M.: A hybrid approach to extract protein-protein interactions. Bioinformatics 27(2), 259–265 (2011)

    Article  Google Scholar 

  7. Miwa, M., Saetre, R., Miyao, Y., Tsujii, J.: Entity-focused sentence simplification for relation extraction. In: Proceedings of the 23rd International Conference on Computational Linguistics, COLING 2010, Stroudsburg, PA, USA, pp. 788–796. Association for Computational Linguistics (2010)

    Google Scholar 

  8. Leitner, F., Mardis, S.A., Krallinger, M., Cesareni, G., Hirschman, L.A., Valencia, A.: An overview of biocreative ii.5. IEEE/ACM Trans. Comput. Biol. Bioinformatics 7(3), 385–399 (2010)

    Article  Google Scholar 

  9. Pyysalo, S., Airola, A., Heimonen, J., Bjorne, J., Ginter, F., Salakoski, T.: Comparative analysis of five protein-protein interaction corpora. BMC Bioinformatics 9, S6 (2008)

    Google Scholar 

  10. Miwa, M., Saetre, R., Miyao, Y., Tsujii, J.: A rich feature vector for protein-protein interaction extraction from multiple corpora. In: Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, EMNLP 2009, Stroudsburg, PA, USA, vol. 1, pp. 121–130. Association for Computational Linguistics (2009)

    Google Scholar 

  11. Miwa, M., Saetre, R., Miyao, Y., Tsujii, J.: Protein-protein interaction extraction by leveraging multiple kernels and parsers. International Journal of Medical Informatics 78(12), e39–e46 (2009)

    Google Scholar 

  12. Liu, B., Qian, L., Wang, H., Zhou, G.: Dependency-driven feature-based learning for extracting protein-protein interactions from biomedical text. In: Proceedings of the 23rd International Conference on Computational Linguistics: Posters. COLING 2010, Stroudsburg, PA, USA, pp. 757–765. Association for Computational Linguistics (2010)

    Google Scholar 

  13. Kim, S., Yoon, J., Yang, J.: Kernel approaches for genic interaction extraction. Bioinformatics 24(1), 118–126 (2008)

    Article  Google Scholar 

  14. Rinaldi, F., Schneider, G., Kaljurand, K., Clematide, S., Vachon, T., Romacker, M.: Ontogene in biocreative ii.5. IEEE/ACM Trans. Comput. Biol. Bioinformatics 7(3), 472–480 (2010)

    Article  Google Scholar 

  15. Cooper, G.F., Herskovits, E.: A bayesian method for the induction of probabilistic networks from data. Mach. Learn. 9, 309–347 (1992)

    MATH  Google Scholar 

  16. Zhang, M., Zhou, G., Aw, A.: Exploring syntactic structured features over parse trees for relation extraction using kernel methods. Inf. Process. Manage. 44(2), 687–701 (2008)

    Article  Google Scholar 

  17. Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann Series in Data Management Systems. Morgan Kaufmann Publishers Inc., San Francisco (2005)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Shiguihara-Juárez, P.N., de Andrade Lopes, A. (2013). Learning Bayesian Network Using Parse Trees for Extraction of Protein-Protein Interaction. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2013. Lecture Notes in Computer Science, vol 7817. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37256-8_29

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-37256-8_29

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-37255-1

  • Online ISBN: 978-3-642-37256-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics