Abstract
With the explosion of information on the Internet, manual summarization has become an unrealistic solution, and the need for automatic summarization–more specifically, multi-document summarization–has grown. Automatic summarization provides the most significant and relevant information while saving time and effort. This paper proposes a graph-based multi-document summarization model to produce extractive multi-document summaries. The model formulates the summarization problem as a capacitated profitable tour problem to optimize the coverage and coherence of the resulting summary. The solution is approximated using swarm intelligence meta-heuristics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Alguliev, R.M., Aliguliyev, R.M., Isazade, N.R.: Formulation of document summarization as a 0–1 nonlinear programming problem. Comput. Ind. Eng. 64(1), 94–102 (2013)
Archetti, C., Feillet, D., Hertz, A., Speranza, M.G.: The capacitated team orienteering and profitable tour problems. J. Oper. Res. Soc. 60(6), 831–842 (2009)
Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. Comput. Netw. ISDN Syst. 30(1–7), 107–117 (1998)
Conroy, J.M., Schlesinger, J.D., Oleary, D.P., Goldstein, J.: Back to basics: classy 2006. In: Proceedings of 2006 Document Understanding Conference (2006)
Huang, L., He, Y., Wei, F., Li, W.: Modeling document summarization as multi-objective optimization. In: Intelligent Information Technology and Security Informatics (IITSI), 2010 Third International Symposium on, pp. 382–386, April (2010)
Jurafsky, D., Martin, J.H.: Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, 2nd edn. Prentice-Hall Inc., Upper Saddle River (2009)
Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. J. ACM 46(5), 604–632 (1999)
Li, P., Deng, G., Zhu, Q.: Using context inference to improve sentence ordering for multi-document summarization. In: Proceedings of 5th International Joint Conference on Natural Language Processing, pp. 1055–1061, Chiang Mai, Thailand, November 2011. Asian Federation of Natural Language Processing (2011)
Lin, H., Bilmes, J.: A class of submodular functions for document summarization. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, HLT’11, vol. 1, pp. 510–520, Stroudsburg. Association for Computational Linguistics (2011)
McDonald, R.: A study of global inference algorithms in multi-document summarization. In: Proceedings of the 29th European Conference on IR Research, ECIR’07, pp. 557–564. Springer, Berlin (2007)
Nenkova, A., McKeown, K.: A survey of text summarization techniques. In: Aggarwal, C.C., Zhai, C. (eds.) Mining Text Data, pp. 43–76. Springer, New York (2012)
Nishikawa, H., Hasegawa, T., Matsuo, Y., Kikui, G.: Opinion summarization with integer linear programming formulation for sentence extraction and ordering. In: Proceedings of the 23rd International Conference on Computational Linguistics: Posters, COLING’10, pp. 910–918, Stroudsburg. Association for Computational Linguistics (2010)
Nishino, M., Yasuda, N., Hirao, T., Suzuki, J., Nagata, M.: Text summarization while maximizing multiple objectives with lagrangian relaxation. In: Serdyukov, P., Braslavski, P., Kuznetsov, S., Kamps, J., Rger, S., Agichtein, E., Segalovich, I., Yilmaz, E. (eds.) Advances in Information Retrieval. Lecture Notes in Computer Science, vol. 7814, pp. 772–775. Springer, Berlin (2013)
Shen, C., Li, T.: Multi-document summarization via the minimum dominating set. In: Proceedings of the 23rd International Conference on Computational Linguistics, COLING’10, pp. 984–992, Stroudsburg. Association for Computational Linguistics (2010)
Takamura, H., Okumura, M.: Text summarization model based on maximum coverage problem and its variant. In: Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics, EACL’09, pp. 781–789, Stroudsburg. Association for Computational Linguistics (2009)
Wan, X., Yang, J., Xiao, J.: Towards an iterative reinforcement approach for simultaneous document summarization and keyword extraction. In: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pp. 552–559, Prague, June 2007. Association for Computational Linguistics (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Al-Saleh, A.B., Menai, M.E.B. (2015). Graph-Based Multi-Document Summarization: An Initial Investigation. In: Bramer, M., Petridis, M. (eds) Research and Development in Intelligent Systems XXXII. SGAI 2015. Springer, Cham. https://doi.org/10.1007/978-3-319-25032-8_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-25032-8_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25030-4
Online ISBN: 978-3-319-25032-8
eBook Packages: Computer ScienceComputer Science (R0)