Abstract
In this paper we propose a summarization method for scientific articles from the viewpoint of the syntactic sequences. The objective is to generate an extractive summary by ranking sentences according to their informative content, on the basis of the idea that the writing styles of authors create syntactic patterns which may contain important information about topics explained in a research paper. We use two main document features in our summarizing algorithm: syntactic sequences and frequent terms per section. We present an evaluation of our proposed algorithm by comparing it with existing summarization methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Barzilay, R., Elhadad, M.: Using Lexical Chains for Text Summarization. In: Proceedings of the Intelligent Scalable Text Summarization Workshop, Madrid, pp. 10–17 (1997)
Bawakid, A., Oussalah, M.: A semantic summarization system: University of Birmingham at TAC 2008. In: Proceedings of the First Text Analysis Conference, Maryland, USA (2008)
Hercules Dalianis. SweSum - A Text Summarizer for Swedish. Technical report, NADA, KTH, Stockholm (2000), http://people.dsv.su.se/hercules/papers/Textsumsummary.html
Hunston, S.: Starting with the small words Patterns, lexis and semantic sequences. International journal of corpus linguistics 13(3), 271–295 (2008)
Gledhill, C.J.: Collocations in science writing. Narr Verlag, Tübingen (2000)
Lioma, C., Ounis, I.: A syntactically-based query reformulation technique for information retrieval. Information Processing and Management: an International Journal 44(1), 143–162 (2008)
Liu, X., Webster, J.J., Kit, C.: An Extractive Text Summarizer Based on Significant Words. In: Proceedings of the 22nd International Conference on Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy, pp. 168–178. Springer, Heidelberg (2009)
Matsuo, Y., Ishizuka, M.: Keyword extraction from a single document using word co-occurrence statistical information. International Journal on Artificial Intelligence Tools 13(1), 157–169 (2004)
Papineni, K., Roukos, S., Ward, T., Zhu, W.-J.: BLEU: a method for automatic evaluation of machine translation. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 311–318. Association for Computational Linguistics, Stroudsburg (2002)
Radev, D., Allison, T., Blair-Goldensohn, S., Blitzer, J., Çelebi, A., Dimitrov, S., Drabek, E., Hakim, A., Lam, W., Liu, D., Otterbacher, J., Qi, H., Saggion, H., Teufel, S., Topper, M., Winkel, A., Zhang, Z.: MEAD - a platform for multidocument multilingual text summarization. In: Proceedings of the 4th International Conference on Language Resources and Evaluation, Lisbon, Portugal (2004)
Nadav Rotem. Open Text Summarizer, http://libots.sourceforge.net/
Saggion, H., Lapalme, G.: Generating indicative-informative summaries with sumUM. Computational Linguistics 28(4), 497–526 (2002)
Schuemie, M.J., Weeber, M., Schijvenaars, B.A., van Mulligen, E.M., Christiaan, C., van der Eijk, Jelier, R., Mons, B., Kors, J.A.: Distribution of information in biomedical abstracts and full-text publications.. Bioinformatics 20(16), 2597–2604 (2004)
Silber, H.G., McCoy, K.F.: Efficient text summarization using lexical chains. In: Proceedings of the 5th International Conference on Intelligent user Interfaces, pp. 252–255. ACM Press, New York (2000)
Strobelt, H., Oelke, D., Rohrdantz, C., Stoffel, A., Keim, D., Deussen, O.: Document Cards: A Top Trumps Visualization for Documents. IEEE Transactions on Visualization and Computer Graphics 15(6), 1145–1152 (2009)
Teufel, S., Moens, M.: Summarizing scientific articles: experiments with relevance and rhetorical status. Computational Linguistics 28(4), 409–445 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Villavicencio, P., Watanabe, T. (2011). Text Summarization of Single Documents Based on Syntactic Sequences. In: Tsihrintzis, G.A., Virvou, M., Jain, L.C., Howlett, R.J. (eds) Intelligent Interactive Multimedia Systems and Services. Smart Innovation, Systems and Technologies, vol 11. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22158-3_31
Download citation
DOI: https://doi.org/10.1007/978-3-642-22158-3_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22157-6
Online ISBN: 978-3-642-22158-3
eBook Packages: EngineeringEngineering (R0)