From Literature to Knowledge: Exploiting PubMed to Answer Biomedical Questions in Natural Language

Bhaskar, Pinaki; Buzzi, Marina; Geraci, Filippo; Pellegrini, Marco

doi:10.1007/978-3-319-22741-2_1

Pinaki Bhaskar¹⁷,
Marina Buzzi¹⁷,
Filippo Geraci¹⁷ &
…
Marco Pellegrini¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9267))

Included in the following conference series:

International Conference on Information Technology in Bio- and Medical Informatics

566 Accesses
2 Citations

Abstract

Researchers, practitioners and the general public strive to be constantly up to date with the latest developments in the subjects of bio-medical research of their interest. Meanwhile the collection of high quality research papers freely available on the Web has increase dramatically in the last few years and this trend is likely to continue. This state of facts brings about opportunities as well as challenges for the construction of effective web-based searching tools. Question/Answering systems based on user interactions in Natural Language have emerged as a promising alternative to traditional keyword based search engines. However this technology still needs to mature in order to fulfill its promises. In this paper we present and test a new graph-based proof-of-concept paradigm for processing the knowledge base and the user queries expressed in natural Language. The user query is mapped as a subgraph matching problem onto the internal graph representation, and thus can handle efficiently also partial matches. Preliminary user-based output quality measurements confirm the viability of our method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 34.99; Price excludes VAT (USA)

Softcover Book: USD 44.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

Allam, A., Schulz, P., Nakamoto, K.: The impact of search engine selection and sorting criteria on vaccination beliefs and attitudes: two experiments manipulating google output. J. Med. Internet Res. 16(4), e100 (2014)
Article Google Scholar
Athenikos, S.J., Han, H.: Biomedical question answering: a survey. Comput. Meth. Prog. biomed. 99(1), 1–24 (2010)
Article Google Scholar
Bauer, M., Berleant, D.: Usability survey of biomedical question answering systems. Hum. Genomics 6(1), 17 (2012)
Article Google Scholar
Bleik, S., Mishra, M., Huan, J., Song, M.: Text categorization of biomedical data sets using graph kernels and a controlled vocabulary. IEEE/ACM Trans. Comput. Biol. Bioinf. 10(5), 1211–1217 (2013)
Article Google Scholar
Can, A.B., Baykal, N.: Medicoport: a medical search engine for all. Comput. Meth. Programs Biomed. 86(1), 73–86 (2007)
Article MATH Google Scholar
Cao, Y., Liu, F., Simpson, P., Antieau, L., Bennett, A., Cimino, J.J., Ely, J., Yu, H.: Askhermes: an online question answering system for complex clinical questions. J. Biomed. Inf. 44(2), 277–288 (2011)
Article MATH Google Scholar
Celi, L.A., Zimolzak, A.J., Stone, D.J.: Dynamic clinical data mining: search engine-based decision support. JMIR Med. Inf. 2(1), e13 (2014)
Article Google Scholar
Cohen, A.M., Hersh, W.R.: A survey of current work in biomedical text mining. Briefings Bioinf. 6(1), 57–71 (2005)
Article Google Scholar
Cruchet, S., Gaudinat, A., Boyer, C.: Supervised approach to recognize question type in a QA system for health. Stud. Health Technol. Inf. 136, 407–412 (2008)
Google Scholar
Gobeill, J., Patsche, E., Theodoro, D., Veuthey, A.L., Lovis, C., Ruch, P.: Question answering for biology and medicine. In: 9th International Conference on Information Technology and Applications in Biomedicine, 2009. ITAB 2009, pp. 1–5, November 2009
Google Scholar
Gori, M., Maggini, M., Sarti, L.: Exact and approximate graph matching using random walks. IEEE Trans. Pattern Anal. Mach. Intell. 27(7), 1100–1111 (2005)
Article Google Scholar
Güting, R.H.: GraphDB: modeling and querying graphs in databases. In: VLDB, vol. 94, pp. 12–15. Citeseer (1994)
Google Scholar
Hristovski, D., Dinevski, D., Kastrin, A., Rindflesch, T.C.: Biomedical question answering using semantic relations. BMC Bioinf. 16(1), 16 (2015)
Article Google Scholar
Kaplan, I.L., Abdulla, G.M., Brugger, S.T., Kohn, S.R.: Implementing graph pattern queries on a relational database. Technical report, Lammerce Livermore National Laboratory (2008)
Google Scholar
Kipper, K., Korhonen, A., Ryant, N., Palmer, M.: Extending verbnet with novel verb classes. In: Proceedings of LREC, vol. 2006, p. 1. Citeseer (2006)
Google Scholar
Kolomiyets, O., Moens, M.F.: A survey on question answering technology from an information retrieval perspective. Inf. Sci. 181(24), 5412–5434 (2011)
Article MathSciNet Google Scholar
Luo, G., Tang, C., Yang, H., Wei, X.: Medsearch: a specialized search engine for medical information retrieval. In: Proceedings of the 17th ACM Conference on Information and Knowledge Management, pp. 143–152. ACM (2008)
Google Scholar
Nielsen, J.: Designing Web Usability: The Practice of Simplicity. New Riders Publishing, Thousand Oaks (1999)
Google Scholar
Peñas, A., Forner, P., Sutcliffe, R., Rodrigo, Á., Forăscu, C., Alegria, I., Giampiccolo, D., Moreau, N., Osenova, P.: Overview of respubliQA 2009: question answering evaluation over european legislation. In: Peters, C., Di Nunzio, G.M., Kurimo, M., Mandl, T., Mostefa, D., Peñas, A., Roda, G. (eds.) CLEF 2009. LNCS, vol. 6241, pp. 174–196. Springer, Heidelberg (2010)
Google Scholar
Regev, Y., Finkelstein-Landau, M., Feldman, R., Gorodetsky, M., Zheng, X., Levy, S., Charlab, R., Lawrence, C., Lippert, R.A., Zhang, Q., Shatkay, H.: Rule-based extraction of experimental evidence in the biomedical domain: the KDD cup 2002 (task 1). SIGKDD Explor. Newsl. 4(2), 90–92 (2002)
Article Google Scholar
Sackett, D.L., Rosenberg, W.M.C., Gray, J.A.M., Haynes, R.B., Richardson, W.S.: Evidence based medicine: what it is and what it isn’t. BMJ 312(7023), 71–72 (1996)
Article Google Scholar
Simpson, M.S., Demner-Fushman, D.: Biomedical text mining: a survey of recent progress. In: Aggarwal, C.C., Zhai, C. (eds.) Mining Text Data, pp. 465–517. Springer, New York (2012)
Chapter Google Scholar
Soldaini, L., Cohan, A., Yates, A., Goharian, N., Frieder, O.: Retrieving medical literature for clinical decision support. In: Hanbury, A., Kazai, G., Rauber, A., Fuhr, N. (eds.) ECIR 2015. LNCS, vol. 9022, pp. 538–549. Springer, Heidelberg (2015)
Google Scholar
Voorhees, E.M., Tice, D.M.: Building a question answering test collection. In: Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2000, pp. 200–207. ACM, New York (2000)
Google Scholar
Voorhees, E.M., et al.: The TREC-8 question answering track report. In: TREC. vol. 99, pp. 77–82 (1999)
Google Scholar
Wang, L., Wang, J., Wang, M., Li, Y., Liang, Y., Xu, D.: Using internet search engines to obtain medical information: a comparative study. J. Med. Internet Res. 14(3), e74 (2012)
Article Google Scholar
Zweigenbaum, P., Demner-Fushman, D., Yu, H., Cohen, K.B.: Frontiers of biomedical text mining: current progress. Briefings in Bioinf. 8(5), 358–375 (2007)
Article MATH Google Scholar

Download references

Acknowledgments

We acknowledge the support of the Italian Registry of ccTLD “.it” and the ERCIM ‘Alain Bensoussan’ Fellowship Programme.

Author information

Authors and Affiliations

CNR, Institute for Informatics and Telematics, Via G. Moruzzi 1, Pisa, Italy
Pinaki Bhaskar, Marina Buzzi, Filippo Geraci & Marco Pellegrini

Authors

Pinaki Bhaskar
View author publications
You can also search for this author in PubMed Google Scholar
Marina Buzzi
View author publications
You can also search for this author in PubMed Google Scholar
Filippo Geraci
View author publications
You can also search for this author in PubMed Google Scholar
Marco Pellegrini
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Filippo Geraci .

Editor information

Editors and Affiliations

Institute of Informatics and Telematics, Pisa, Italy
M. Elena Renda
Czech Technical University in Prague, Prague, Czech Republic
Miroslav Bursa
Medical University Graz, Graz, Austria
Andreas Holzinger
San Jose State University, San Jose, California, USA
Sami Khuri

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bhaskar, P., Buzzi, M., Geraci, F., Pellegrini, M. (2015). From Literature to Knowledge: Exploiting PubMed to Answer Biomedical Questions in Natural Language. In: Renda, M., Bursa, M., Holzinger, A., Khuri, S. (eds) Information Technology in Bio- and Medical Informatics. ITBAM 2015. Lecture Notes in Computer Science(), vol 9267. Springer, Cham. https://doi.org/10.1007/978-3-319-22741-2_1

Download citation

DOI: https://doi.org/10.1007/978-3-319-22741-2_1
Published: 11 August 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-22740-5
Online ISBN: 978-3-319-22741-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics