Skip to main content

Graph-Based Multi-Document Summarization: An Initial Investigation

  • Conference paper
  • First Online:
Research and Development in Intelligent Systems XXXII (SGAI 2015)

Abstract

With the explosion of information on the Internet, manual summarization has become an unrealistic solution, and the need for automatic summarization–more specifically, multi-document summarization–has grown. Automatic summarization provides the most significant and relevant information while saving time and effort. This paper proposes a graph-based multi-document summarization model to produce extractive multi-document summaries. The model formulates the summarization problem as a capacitated profitable tour problem to optimize the coverage and coherence of the resulting summary. The solution is approximated using swarm intelligence meta-heuristics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.nlpir.nist.gov/projects/duc/index.html.

References

  1. Alguliev, R.M., Aliguliyev, R.M., Isazade, N.R.: Formulation of document summarization as a 0–1 nonlinear programming problem. Comput. Ind. Eng. 64(1), 94–102 (2013)

    Article  Google Scholar 

  2. Archetti, C., Feillet, D., Hertz, A., Speranza, M.G.: The capacitated team orienteering and profitable tour problems. J. Oper. Res. Soc. 60(6), 831–842 (2009)

    Article  MATH  Google Scholar 

  3. Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. Comput. Netw. ISDN Syst. 30(1–7), 107–117 (1998)

    Article  Google Scholar 

  4. Conroy, J.M., Schlesinger, J.D., Oleary, D.P., Goldstein, J.: Back to basics: classy 2006. In: Proceedings of 2006 Document Understanding Conference (2006)

    Google Scholar 

  5. Huang, L., He, Y., Wei, F., Li, W.: Modeling document summarization as multi-objective optimization. In: Intelligent Information Technology and Security Informatics (IITSI), 2010 Third International Symposium on, pp. 382–386, April (2010)

    Google Scholar 

  6. Jurafsky, D., Martin, J.H.: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, 2nd edn. Prentice-Hall Inc., Upper Saddle River (2009)

    Google Scholar 

  7. Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. J. ACM 46(5), 604–632 (1999)

    Article  MathSciNet  MATH  Google Scholar 

  8. Li, P., Deng, G., Zhu, Q.: Using context inference to improve sentence ordering for multi-document summarization. In: Proceedings of 5th International Joint Conference on Natural Language Processing, pp. 1055–1061, Chiang Mai, Thailand, November 2011. Asian Federation of Natural Language Processing (2011)

    Google Scholar 

  9. Lin, H., Bilmes, J.: A class of submodular functions for document summarization. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, HLT’11, vol. 1, pp. 510–520, Stroudsburg. Association for Computational Linguistics (2011)

    Google Scholar 

  10. McDonald, R.: A study of global inference algorithms in multi-document summarization. In: Proceedings of the 29th European Conference on IR Research, ECIR’07, pp. 557–564. Springer, Berlin (2007)

    Google Scholar 

  11. Nenkova, A., McKeown, K.: A survey of text summarization techniques. In: Aggarwal, C.C., Zhai, C. (eds.) Mining Text Data, pp. 43–76. Springer, New York (2012)

    Chapter  Google Scholar 

  12. Nishikawa, H., Hasegawa, T., Matsuo, Y., Kikui, G.: Opinion summarization with integer linear programming formulation for sentence extraction and ordering. In: Proceedings of the 23rd International Conference on Computational Linguistics: Posters, COLING’10, pp. 910–918, Stroudsburg. Association for Computational Linguistics (2010)

    Google Scholar 

  13. Nishino, M., Yasuda, N., Hirao, T., Suzuki, J., Nagata, M.: Text summarization while maximizing multiple objectives with lagrangian relaxation. In: Serdyukov, P., Braslavski, P., Kuznetsov, S., Kamps, J., Rger, S., Agichtein, E., Segalovich, I., Yilmaz, E. (eds.) Advances in Information Retrieval. Lecture Notes in Computer Science, vol. 7814, pp. 772–775. Springer, Berlin (2013)

    Google Scholar 

  14. Shen, C., Li, T.: Multi-document summarization via the minimum dominating set. In: Proceedings of the 23rd International Conference on Computational Linguistics, COLING’10, pp. 984–992, Stroudsburg. Association for Computational Linguistics (2010)

    Google Scholar 

  15. Takamura, H., Okumura, M.: Text summarization model based on maximum coverage problem and its variant. In: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics, EACL’09, pp. 781–789, Stroudsburg. Association for Computational Linguistics (2009)

    Google Scholar 

  16. Wan, X., Yang, J., Xiao, J.: Towards an iterative reinforcement approach for simultaneous document summarization and keyword extraction. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp. 552–559, Prague, June 2007. Association for Computational Linguistics (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Asma Bader Al-Saleh .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Al-Saleh, A.B., Menai, M.E.B. (2015). Graph-Based Multi-Document Summarization: An Initial Investigation. In: Bramer, M., Petridis, M. (eds) Research and Development in Intelligent Systems XXXII. SGAI 2015. Springer, Cham. https://doi.org/10.1007/978-3-319-25032-8_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-25032-8_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-25030-4

  • Online ISBN: 978-3-319-25032-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics