Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Text Summarization

  • Dou Shen
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_424

Synonyms

Document summarization

Definition

Text summarization is the process of distilling the most important information from a text to produce an abridged version for a particular task and user [9].

Historical Background

With more and more digitalized text being available, especially with the development of the Internet, people are being overwhelmed with data. How to help people effectively and efficiently capture the information from the data becomes extremely important. Many techniques have been proposed for this goal and text summarization is one of them.

Text summarization in some form has been in existence since the 1950s [8]. Two main influences have dominated the research in this area, as summarized by Mani in [10]. Work in library science, office automation, and information retrieval has resulted in a focus on methods for producing extracts from scientific papers, including the use of “shallow” linguistic analysis and the use of term statistics. The other influence has been...

This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Berry MW, Dumais ST, O’Brien GW. Using linear algebra for intelligent information retrieval. SIAM Rev. 1995;37(4):573–95.MathSciNetzbMATHCrossRefGoogle Scholar
  2. 2.
    Brandowa R, Mitzeb K, Rauc LF. Automatic condensation of electronic publications by sentence selection. Inform Process Manage. 1995;41(6):675–85.CrossRefGoogle Scholar
  3. 3.
    Conroy JM, O’leary DP. Text summarization via hidden markov models. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 2001. p. 406–7.Google Scholar
  4. 4.
    Goldstein J, Kantrowitz M, Mittal V, Carbonell J. Summarizing text documents: sentence selection and evaluation metrics. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1999. p. 121–8.Google Scholar
  5. 5.
    Gong Y, Liu X. Generic text summarization using relevance measure and latent semantic analysis. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 2001. p. 19–25.Google Scholar
  6. 6.
    Kupiec J, Pedersen J, Chen F. A trainable document summarizer. In: Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1995. p. 68–73.Google Scholar
  7. 7.
    Lin C-Y, Hovy E. Automatic evaluation of summaries using n-gram co-occurrence statistics. In: Proceedings of the Human Language Technology Conference of the North American Chapter of Association Computational Linguistics; 2003. p. 71–8.Google Scholar
  8. 8.
    Luhn HP. The automatic creation of literature abstracts. IBM J Res Dev. 1958;2(2):159–65.MathSciNetCrossRefGoogle Scholar
  9. 9.
    Mani I. Advances in automatic text summarization. Cambridge: MIT; 1999.Google Scholar
  10. 10.
    Mani I. Recent developments in text summarization. In: Proceedings of the 10th International Conference on Information and Knowledge Management; 2001. p. 529–31.Google Scholar
  11. 11.
    Marcu D. From discourse structures to text summaries. In: Proceedings of the ACL Workshop on Intelligent Scalable Text Summarization; 1997. p. 82–8.Google Scholar
  12. 12.
    Mihalcea R. Language independent extractive summarization. In: Proceedings of the 20th National Conference on AI and 17th Innovative Applications of AI Conference; 2005. p. 1688–9.Google Scholar
  13. 13.
    Shen D, Chen Z, Yang Q, Zeng H-J, Zhang B, Lu Y, Ma W-Y. Web-page classification through summarization. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 2004. p. 242–9.Google Scholar
  14. 14.
    Shen D, Sun J-T, Li H, Yang Q, Chen Z. Document summarization using conditional random fields. In: Proceedings of the 20th International Joint Conference on AI; 2007. p. 2862–7.Google Scholar
  15. 15.
    Teufel S, Moens M. Sentence extraction as a classification task. In: Proceedings of the ACL Workshop on Intelligent Text Summarization; 1997. p. 58–65.Google Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Microsoft CorporationRedmondUSA
  2. 2.Baidu, Inc.Beijing CityChina

Section editors and affiliations

  • Zheng Chen
    • 1
  1. 1.Microsoft Research AsiaMicrosoft CorporationBeijingChina