Summarizing Citation Contexts of Scientific Publications

Mitrović, Sandra; Müller, Henning

doi:10.1007/978-3-319-24027-5_13

Sandra Mitrović²¹ &
Henning Müller²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9283))

Included in the following conference series:

International Conference of the Cross-Language Evaluation Forum for European Languages

1849 Accesses
2 Citations

Abstract

As the number of publications is increasing rapidly, it becomes increasingly difficult for researchers to find existing scientific papers most relevant for their work, even when the domain is limited. To overcome this, it is common to use paper summarization techniques in specific domains. In difference to approaches that exploit the paper content itself, in this paper we perform summarization of the citation context of a paper. For this, we adjust and apply existing summarization techniques and we come up with a hybrid method, based on clustering and latent semantic analysis. We apply this on medical informatics publications and compare performance of methods that outscore other techniques on a standard database. Summarization of the citation context can be complementary to full text summarization, particularly to find candidate papers. The reached performance seems good for routine use even though it was only tested on a small database.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aliguliyev, R.: A new sentence similarity measure and sentence based extractive technique for automatic text summarization. Expert Systems with Applications 36(4), 7764–7772 (2009)
Article Google Scholar
Bergmark, D.: Automatic extraction of reference linking information from online documents. Tech. rep., Cornell University, Ithaca, NY, USA (2000)
Google Scholar
Bird, S.: NLTK: the natural language toolkit. In: Proceedings of the Coling/ACL on Interactive Presentation Sessions, pp. 69–72, Stroudsburg, PA, USA (2006)
Google Scholar
Bradshaw, S.: Reference directed indexing: redeeming relevance for subject search in citation indexes. In: Koch, T., Sølvberg, I.T. (eds.) ECDL 2003. LNCS, vol. 2769, pp. 499–510. Springer, Heidelberg (2003)
Chapter Google Scholar
Conroy, J., O’Leary, D.: Text summarization via Hidden Markov models. In: Proceedings of the 24th Annual International ACM SIGIR Conference, pp. 406–407, New York, NY, USA (2001)
Google Scholar
Elkiss, A., Shen, S., Fader, A., Erkan, G., States, D., Radev, D.: Blind men and elephants: What do citation summaries tell us about a research article? Journal of the American Society Information Science and Technology 59(1), 51–62 (2008)
Article Google Scholar
Frey, B., Dueck, D.: Clustering by passing messages between data points. Science 315(5814), 972–976 (2007)
Article MathSciNet MATH Google Scholar
Jing, H., Barzilay, R., McKeown, K., Elhadad, M.: Summarization evaluation methods: experiments and analysis. In: AAAI Symposium on Intelligent Summarization, pp. 51–59 (1998)
Google Scholar
Jurafsky, D., Martin, J.: Speech & Language Processing. Pearson Education India (2000)
Google Scholar
Lin, C.Y.: Rouge: a package for automatic evaluation of summaries. In: Text Summarization Branches Out: Proceedings of the ACL-04 Workshop, pp. 74–81 (2004)
Google Scholar
Miller, G.: Wordnet: A lexical database for english. Communications of the ACM 38(11), 39–41 (1995)
Article Google Scholar
Haddou ou Moussa, K., Mayr, P.: Automatische referenzextraktion mit parscit. In: Social Media and Web Science - Das Web als Lebensraum, DGI, pp. 425–428 (2012)
Google Scholar
Nenkova, A., Passonneau, R.: Evaluating content selection in summarization: the pyramid method. In: Proceedings of Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 145–152 (2004)
Google Scholar
Niwa, Y., Nitta, Y.: Co-occurrence vectors from corpora vs. distance vectors from dictionaries. In: Proceedings of the 15th Conference on Computational Linguistics, COLING ’94, vol. 1, pp. 304–309 (1994)
Google Scholar
Ozsoy, M.G., Cicekli, I., Alpaslan, F.N.: Text summarization of turkish texts using latent semantic analysis. In: Huang, C.R., Jurafsky, D. (eds.) Proceedings of the 23rd International Conference on Computational Linguistics, pp. 869–876. Tsinghua University Press (2010)
Google Scholar
Pei-Ying, Z., Cun-He, L.: Automatic text summarization based on sentences clustering and extraction. In: Proceedings of 2nd IEEE International Conference on the Computer Science and Information Technology, pp. 167–170. IEEE (2009)
Google Scholar
Qazvinian, V., Radev, D.: Scientific paper summarization using citation summarynetworks. In: Proceedings of the 22nd International Conference on Computational Linguistics, vol. 1, pp. 689–696 (2008)
Google Scholar
Resnik, P.: Using information content to evaluate semantic similarity in a taxonomy. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence, vol. 1, pp. 448–453 (1995)
Google Scholar
Ritchie, A., Robertson, S., Teufel, S.: Comparing citation contexts for information retrieval. In: Proceedings of the 17th ACM Conference on Information and Knowledge Management. CIKM ’08, pp. 213–222. ACM, New York (2008)
Google Scholar
Saad, S.M., Kamarudin, S.S.: Comparative analysis of similarity measures for sentence level semantic measurement of text. In: IEEE International Conference on Control System, Computing and Engineering, pp. 90–94. IEEE (2013)
Google Scholar
Steinberger, J., Ježek, K.: Using latent semantic analysis in text summarization and summary evaluation. In: Proceedings of Industrial Management, ISIM ’04, pp. 93–100 (2004)
Google Scholar
Svore, K.M., Vanderwende, L., Burges, C.: Enhancing single-document summarization by combining ranknet and third-party sources. In: Proceedings of Conference on Empirical Methods on Natural Language Processing and Computational Natural Language Learning, pp. 448–457 (2007)
Google Scholar
Torres-Moreno, J.M., Saggion, H., da Cunha, I., SanJuan, E.: Summary Evaluation With and Without References. Polibits: Research Journal on Computer Science and Computer Engineering with Applications 42, 13–19 (2010)
Article Google Scholar
Zechner, K.: Fast generation of abstracts from general domain text corpora by extracting relevant sentences. In: Proceedings of the 16th Conference on Computational Linguistics, vol. 2, pp. 986–989 (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Applied Sciences Western Switzerland (HES–SO), Sierre, Switzerland
Sandra Mitrović & Henning Müller

Authors

Sandra Mitrović
View author publications
You can also search for this author in PubMed Google Scholar
Henning Müller
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sandra Mitrović .

Editor information

Editors and Affiliations

Institut de Recherche en Informatique de Toulouse, Toulouse , France
Josanne Mothe
Department of Computer Science, University of Neuchatel, Neuchâtel, Switzerland
Jacques Savoy
Faculteit der Geesteswetenschappen, Universiteit Amsterdam, Amsterdam, The Netherlands
Jaap Kamps
Institut de Recherche en Informatique de Toulouse, Toulouse, France
Karen Pinel-Sauvagnat
School of Computing, Dublin City University, Dublin, Ireland
Gareth Jones
LIA - CERI, Université d'Avignon et des Pays de Vaucluse, Avignon, France
Eric San Juan
Department of Information Engineering, University of Padua, Padua, Italy
Linda Capellato
of Information Engineering (DEI), University of Padua, Department, Padova, Italy
Nicola Ferro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mitrović, S., Müller, H. (2015). Summarizing Citation Contexts of Scientific Publications. In: Mothe, J., et al. Experimental IR Meets Multilinguality, Multimodality, and Interaction. CLEF 2015. Lecture Notes in Computer Science(), vol 9283. Springer, Cham. https://doi.org/10.1007/978-3-319-24027-5_13

Download citation

DOI: https://doi.org/10.1007/978-3-319-24027-5_13
Published: 20 November 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24026-8
Online ISBN: 978-3-319-24027-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics