Skip to main content

Graph-Based Query-Focused Multi-document Summarization Using Improved Affinity Graph

  • Conference paper
  • First Online:
Book cover Knowledge Science, Engineering and Management (KSEM 2015)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9403))

Abstract

Manifold ranking is one of the most competitive approaches for query-focused multi-document summarization. Despite its success for this task, it usually constructs a sentence affinity graph first based on inter-sentence content similarity, and then perform manifold ranking on the graph to score each sentence with the assumption that all the sentences live on a single manifold. Actually, for a document set to be summarized, the distribution of the sentences might form different, but related manifolds. This paper aims to generalize the basic manifold-ranking based approach to the more generic setting by introducing a novel affinity graph to estimate the similarity between sentences, which leverages both the local geometric structures and the contents of sentences jointly. Preliminary experimental results on the DUC datasets demonstrate the good effectiveness of the proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Wan, X.J., Yang, J.W., Xiao, J.G.: Manifold-ranking based topic-focused multi-document summarization. In: Proceedings of the 20th International Joint Conference on Artificial Intelligence (IJCAI 2007), pp. 2903–2908 (2007)

    Google Scholar 

  2. Wan, X.J., Xiao, J.G.: Graph-based multi-modality learning for topic-focused multi-document summarization. In: Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI 2009), pp. 1586–1591 (2009)

    Google Scholar 

  3. Saggion, H., Bontcheva, K., Cunningham, H.: Robust generic and query-based summarization. In: Proceedings of the 10th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2003), pp. 2903–2908 (2003)

    Google Scholar 

  4. Radev, D.R., Jing, H.Y., Stys, M., Tam, D.: Centroid-Based Summarization of Multiple Documents. Information Processing and Management 40, 919–938 (2004)

    Article  MATH  Google Scholar 

  5. Lin, C.Y., Hovy, E.: From single to multi-document summarization: a prototype system and its evaluation. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics (ACL 2002), pp. 457–464 (2002)

    Google Scholar 

  6. Nenkova, A., Louis, A.: Can you summarize this? identifying correlates of input difficulty for generic multi-document summarization. In: Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL 2008), pp. 825–833 (2008)

    Google Scholar 

  7. Celikyilmaz, A., Hakkani-Tur, D.: A hybrid hierarchical model for multi-document summarization. In: Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL 2010), pp. 815–824 (2010)

    Google Scholar 

  8. Gillick, D., Favre, B.: A scalable global model for summarization. In: Proceedings of the Workshop on Integer Linear Programming for Natural Language Processing (ILP 2009), pp. 10–18 (2009)

    Google Scholar 

  9. Erkan, G., Radev, D.R.: LexRank: Graph-Based Centrality as Salience in Text Summarization. Journal of Artificial Intelligence Research 22, 457–479 (2004)

    Article  Google Scholar 

  10. Mihalcea, R., Tarau, P.: TextRank-bringing order into text. In: Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP 2004), pp. 404–411 (2004)

    Google Scholar 

  11. Wan, X.J., Yang, J.W.: Multi-document summarization using cluster-based link analysis. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), pp. 299–306 (2008)

    Google Scholar 

  12. Haveliwala, T.: Topic-Sensitive PageRank: A Context-Sensitive Ranking Algorithm for Web Search. IEEE Transactions on Knowledge and Data Engineering 15, 784–796 (2003)

    Article  Google Scholar 

  13. Zhao, L., Wu, L.D., Huang, X.J.: Query Expansion in Graph-Based Approach for Query-Focused Multi-Document Summarization. Information Processing and Management 45, 35–41 (2009)

    Article  Google Scholar 

  14. Carbonell, J., Goldstein, J.: The use of MMR, diversity-based reranking for reordering documents and producing summaries. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 1998), pp. 335–336 (1998)

    Google Scholar 

  15. Wang, D.D., Li, T., Zhu, S.H., Ding, C.: Multi-Document summarization via sentence-level semantic analysis and symmetric matrix factorization. In: Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), pp. 307–314 (2008)

    Google Scholar 

  16. Tang, J., Yao, L.M., Chen, D.W.: Multi-topic based query-oriented summarization. In: Proceedings of the 9th SIAM International Conference on Data Mining (SDM 2009), pp. 1147–1158 (2009)

    Google Scholar 

  17. Wei, F., Li, W., Lu, Q., He, Y.: A cluster-sensitive graph model for query-oriented multi-document summarization. In: Plachouras, V., Macdonald, C., Ounis, I., White, R.W., Ruthven, I. (eds.) ECIR 2008. LNCS, vol. 4956, pp. 446–453. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  18. Tong, H.H., He, J.R., Li, M.J., Zhang, C.S., Ma, W.Y.: Graph based multi-modality learning. In: Proceedings of the 13th Annual ACM International Conference on Multimedia (MM 2005), pp. 862–871 (2005)

    Google Scholar 

  19. Zhou, D., Weston, J., Gretton, A., Bousquet, O., Scholkopf, B.: Ranking on Data Manifolds. Advances in Neural Information Processing Systems 16, 169–176 (2004)

    Google Scholar 

  20. Goldberg, A.B., Zhu, X.J., Singh, A., Xu, Z., Nowak, R.: Multi-Manifold Semi-Supervised Learning. Journal of Machine Learning Research 5, 169–176 (2009). Proceedings Track

    Google Scholar 

  21. Lin, C.Y., Hovy, E.: Automatic evaluation of summaries using N-gram cooccurrence statistics. In: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology (NAACL 2003), pp. 71–78 (2003)

    Google Scholar 

  22. Cheng, X.Q., Du, P., Guo, J.F., Zhu, X.F., Chen, Y.X.: Ranking on Data Manifold with Sink Points. IEEE Transactions on Knowledge and Data Engineering 25, 177–191 (2013)

    Article  Google Scholar 

  23. Cai, X.Y., Li, W.J.: Mutually Reinforced Manifold-Ranking Based Relevance Propagation Model for Query-Focused Multi-Document Summarization. IEEE Transactions on Audio, Speech and Language Processing 20, 1597–1607 (2012)

    Article  Google Scholar 

  24. Tenenbaum, J.B., Silva, V., Langford, J.C.: A Global Geometric Framework for Nonlinear Dimensionality Reduction. Science 290, 2319–2323 (2000)

    Article  Google Scholar 

  25. Roweis, S., Saul, L.: Nonlinear Dimensionality Reduction by Locally Linear Embedding. Science 290, 2323–2326 (2000)

    Article  Google Scholar 

  26. Hinton, G., Roweis, S.: Stochastic Neighbor Embedding. Advances in Neural Information Processing Systems 15, 833–840 (2003)

    Google Scholar 

  27. van der Maaten, L.J.P., Hinton, G.: Visualizing Data Using t-SNE. Journal of Machine Learning Research 9, 2579–2605 (2008)

    MATH  Google Scholar 

  28. Georgia, A., Elias, I., Alexandros, P.: Low-dimensional manifold distributional semantic models. In: Proceedings of the 25th International Conference on Computational Linguistics (COLING 2014), pp. 731–740 (2014)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Po Hu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Hu, P., He, J., Zhang, Y. (2015). Graph-Based Query-Focused Multi-document Summarization Using Improved Affinity Graph. In: Zhang, S., Wirsing, M., Zhang, Z. (eds) Knowledge Science, Engineering and Management. KSEM 2015. Lecture Notes in Computer Science(), vol 9403. Springer, Cham. https://doi.org/10.1007/978-3-319-25159-2_31

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-25159-2_31

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-25158-5

  • Online ISBN: 978-3-319-25159-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics