Abstract
Manifold ranking is one of the most competitive approaches for query-focused multi-document summarization. Despite its success for this task, it usually constructs a sentence affinity graph first based on inter-sentence content similarity, and then perform manifold ranking on the graph to score each sentence with the assumption that all the sentences live on a single manifold. Actually, for a document set to be summarized, the distribution of the sentences might form different, but related manifolds. This paper aims to generalize the basic manifold-ranking based approach to the more generic setting by introducing a novel affinity graph to estimate the similarity between sentences, which leverages both the local geometric structures and the contents of sentences jointly. Preliminary experimental results on the DUC datasets demonstrate the good effectiveness of the proposed approach.
Preview
Unable to display preview. Download preview PDF.
References
Wan, X.J., Yang, J.W., Xiao, J.G.: Manifold-ranking based topic-focused multi-document summarization. In: Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI 2007), pp. 2903–2908 (2007)
Wan, X.J., Xiao, J.G.: Graph-based multi-modality learning for topic-focused multi-document summarization. In: Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI 2009), pp. 1586–1591 (2009)
Saggion, H., Bontcheva, K., Cunningham, H.: Robust generic and query-based summarization. In: Proceedings of the 10th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2003), pp. 2903–2908 (2003)
Radev, D.R., Jing, H.Y., Stys, M., Tam, D.: Centroid-Based Summarization of Multiple Documents. Information Processing and Management 40, 919–938 (2004)
Lin, C.Y., Hovy, E.: From single to multi-document summarization: a prototype system and its evaluation. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics (ACL 2002), pp. 457–464 (2002)
Nenkova, A., Louis, A.: Can you summarize this? identifying correlates of input difficulty for generic multi-document summarization. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL 2008), pp. 825–833 (2008)
Celikyilmaz, A., Hakkani-Tur, D.: A hybrid hierarchical model for multi-document summarization. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL 2010), pp. 815–824 (2010)
Gillick, D., Favre, B.: A scalable global model for summarization. In: Proceedings of the Workshop on Integer Linear Programming for Natural Language Processing (ILP 2009), pp. 10–18 (2009)
Erkan, G., Radev, D.R.: LexRank: Graph-Based Centrality as Salience in Text Summarization. Journal of Artificial Intelligence Research 22, 457–479 (2004)
Mihalcea, R., Tarau, P.: TextRank-bringing order into text. In: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP 2004), pp. 404–411 (2004)
Wan, X.J., Yang, J.W.: Multi-document summarization using cluster-based link analysis. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), pp. 299–306 (2008)
Haveliwala, T.: Topic-Sensitive PageRank: A Context-Sensitive Ranking Algorithm for Web Search. IEEE Transactions on Knowledge and Data Engineering 15, 784–796 (2003)
Zhao, L., Wu, L.D., Huang, X.J.: Query Expansion in Graph-Based Approach for Query-Focused Multi-Document Summarization. Information Processing and Management 45, 35–41 (2009)
Carbonell, J., Goldstein, J.: The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 1998), pp. 335–336 (1998)
Wang, D.D., Li, T., Zhu, S.H., Ding, C.: Multi-Document summarization via sentence-level semantic analysis and symmetric matrix factorization. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), pp. 307–314 (2008)
Tang, J., Yao, L.M., Chen, D.W.: Multi-topic based query-oriented summarization. In: Proceedings of the 9th SIAM International Conference on Data Mining (SDM 2009), pp. 1147–1158 (2009)
Wei, F., Li, W., Lu, Q., He, Y.: A cluster-sensitive graph model for query-oriented multi-document summarization. In: Plachouras, V., Macdonald, C., Ounis, I., White, R.W., Ruthven, I. (eds.) ECIR 2008. LNCS, vol. 4956, pp. 446–453. Springer, Heidelberg (2008)
Tong, H.H., He, J.R., Li, M.J., Zhang, C.S., Ma, W.Y.: Graph based multi-modality learning. In: Proceedings of the 13th Annual ACM International Conference on Multimedia (MM 2005), pp. 862–871 (2005)
Zhou, D., Weston, J., Gretton, A., Bousquet, O., Scholkopf, B.: Ranking on Data Manifolds. Advances in Neural Information Processing Systems 16, 169–176 (2004)
Goldberg, A.B., Zhu, X.J., Singh, A., Xu, Z., Nowak, R.: Multi-Manifold Semi-Supervised Learning. Journal of Machine Learning Research 5, 169–176 (2009). Proceedings Track
Lin, C.Y., Hovy, E.: Automatic evaluation of summaries using N-gram cooccurrence statistics. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology (NAACL 2003), pp. 71–78 (2003)
Cheng, X.Q., Du, P., Guo, J.F., Zhu, X.F., Chen, Y.X.: Ranking on Data Manifold with Sink Points. IEEE Transactions on Knowledge and Data Engineering 25, 177–191 (2013)
Cai, X.Y., Li, W.J.: Mutually Reinforced Manifold-Ranking Based Relevance Propagation Model for Query-Focused Multi-Document Summarization. IEEE Transactions on Audio, Speech and Language Processing 20, 1597–1607 (2012)
Tenenbaum, J.B., Silva, V., Langford, J.C.: A Global Geometric Framework for Nonlinear Dimensionality Reduction. Science 290, 2319–2323 (2000)
Roweis, S., Saul, L.: Nonlinear Dimensionality Reduction by Locally Linear Embedding. Science 290, 2323–2326 (2000)
Hinton, G., Roweis, S.: Stochastic Neighbor Embedding. Advances in Neural Information Processing Systems 15, 833–840 (2003)
van der Maaten, L.J.P., Hinton, G.: Visualizing Data Using t-SNE. Journal of Machine Learning Research 9, 2579–2605 (2008)
Georgia, A., Elias, I., Alexandros, P.: Low-dimensional manifold distributional semantic models. In: Proceedings of the 25th International Conference on Computational Linguistics (COLING 2014), pp. 731–740 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Hu, P., He, J., Zhang, Y. (2015). Graph-Based Query-Focused Multi-document Summarization Using Improved Affinity Graph. In: Zhang, S., Wirsing, M., Zhang, Z. (eds) Knowledge Science, Engineering and Management. KSEM 2015. Lecture Notes in Computer Science(), vol 9403. Springer, Cham. https://doi.org/10.1007/978-3-319-25159-2_31
Download citation
DOI: https://doi.org/10.1007/978-3-319-25159-2_31
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25158-5
Online ISBN: 978-3-319-25159-2
eBook Packages: Computer ScienceComputer Science (R0)