Text Summarization of Single Documents Based on Syntactic Sequences

Villavicencio, Paul; Watanabe, Toyohide

doi:10.1007/978-3-642-22158-3_31

Paul Villavicencio⁶ &
Toyohide Watanabe⁶

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 11))

757 Accesses

Abstract

In this paper we propose a summarization method for scientific articles from the viewpoint of the syntactic sequences. The objective is to generate an extractive summary by ranking sentences according to their informative content, on the basis of the idea that the writing styles of authors create syntactic patterns which may contain important information about topics explained in a research paper. We use two main document features in our summarizing algorithm: syntactic sequences and frequent terms per section. We present an evaluation of our proposed algorithm by comparing it with existing summarization methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Barzilay, R., Elhadad, M.: Using Lexical Chains for Text Summarization. In: Proceedings of the Intelligent Scalable Text Summarization Workshop, Madrid, pp. 10–17 (1997)
Google Scholar
Bawakid, A., Oussalah, M.: A semantic summarization system: University of Birmingham at TAC 2008. In: Proceedings of the First Text Analysis Conference, Maryland, USA (2008)
Google Scholar
Hercules Dalianis. SweSum - A Text Summarizer for Swedish. Technical report, NADA, KTH, Stockholm (2000), http://people.dsv.su.se/hercules/papers/Textsumsummary.html
Hunston, S.: Starting with the small words Patterns, lexis and semantic sequences. International journal of corpus linguistics 13(3), 271–295 (2008)
Article Google Scholar
Gledhill, C.J.: Collocations in science writing. Narr Verlag, Tübingen (2000)
Google Scholar
Lioma, C., Ounis, I.: A syntactically-based query reformulation technique for information retrieval. Information Processing and Management: an International Journal 44(1), 143–162 (2008)
Article MATH Google Scholar
Liu, X., Webster, J.J., Kit, C.: An Extractive Text Summarizer Based on Significant Words. In: Proceedings of the 22nd International Conference on Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy, pp. 168–178. Springer, Heidelberg (2009)
Chapter Google Scholar
Matsuo, Y., Ishizuka, M.: Keyword extraction from a single document using word co-occurrence statistical information. International Journal on Artificial Intelligence Tools 13(1), 157–169 (2004)
Article Google Scholar
Papineni, K., Roukos, S., Ward, T., Zhu, W.-J.: BLEU: a method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 311–318. Association for Computational Linguistics, Stroudsburg (2002)
Google Scholar
Radev, D., Allison, T., Blair-Goldensohn, S., Blitzer, J., Çelebi, A., Dimitrov, S., Drabek, E., Hakim, A., Lam, W., Liu, D., Otterbacher, J., Qi, H., Saggion, H., Teufel, S., Topper, M., Winkel, A., Zhang, Z.: MEAD - a platform for multidocument multilingual text summarization. In: Proceedings of the 4th International Conference on Language Resources and Evaluation, Lisbon, Portugal (2004)
Google Scholar
Nadav Rotem. Open Text Summarizer, http://libots.sourceforge.net/
Saggion, H., Lapalme, G.: Generating indicative-informative summaries with sumUM. Computational Linguistics 28(4), 497–526 (2002)
Article Google Scholar
Schuemie, M.J., Weeber, M., Schijvenaars, B.A., van Mulligen, E.M., Christiaan, C., van der Eijk, Jelier, R., Mons, B., Kors, J.A.: Distribution of information in biomedical abstracts and full-text publications.. Bioinformatics 20(16), 2597–2604 (2004)
Article Google Scholar
Silber, H.G., McCoy, K.F.: Efficient text summarization using lexical chains. In: Proceedings of the 5th International Conference on Intelligent user Interfaces, pp. 252–255. ACM Press, New York (2000)
Chapter Google Scholar
Strobelt, H., Oelke, D., Rohrdantz, C., Stoffel, A., Keim, D., Deussen, O.: Document Cards: A Top Trumps Visualization for Documents. IEEE Transactions on Visualization and Computer Graphics 15(6), 1145–1152 (2009)
Article Google Scholar
Teufel, S., Moens, M.: Summarizing scientific articles: experiments with relevance and rhetorical status. Computational Linguistics 28(4), 409–445 (2002)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Systems and Social Informatics, Graduate School of Information Science, Nagoya University, Nagoya, 464-8603, Japan
Paul Villavicencio & Toyohide Watanabe

Authors

Paul Villavicencio
View author publications
You can also search for this author in PubMed Google Scholar
Toyohide Watanabe
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Informatics, University of Piraeus, 80 Karaoli & Dimitriou St., 18534, Piraeus, Greece
George A. Tsihrintzis & Maria Virvou &
School of Electrical and Information Engineering, University of South Australia, Mawson Lakes Campus, SA 5095, Adelaide, South Australia, Australia
Lakhmi C. Jain
KES International, 2115, BN43 9AF, Shoreham-by-sea, West Sussex, United Kingdom
Robert J. Howlett

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Villavicencio, P., Watanabe, T. (2011). Text Summarization of Single Documents Based on Syntactic Sequences. In: Tsihrintzis, G.A., Virvou, M., Jain, L.C., Howlett, R.J. (eds) Intelligent Interactive Multimedia Systems and Services. Smart Innovation, Systems and Technologies, vol 11. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22158-3_31

Download citation

DOI: https://doi.org/10.1007/978-3-642-22158-3_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22157-6
Online ISBN: 978-3-642-22158-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics