Advertisement

Evolving Systems

, Volume 10, Issue 3, pp 409–424 | Cite as

Extractive summarization using semigraph (ESSg)

  • Sheetal SonawaneEmail author
  • Parag Kulkarni
  • Charusheela Deshpande
  • Bhagyashree Athawale
Original Paper
  • 68 Downloads

Abstract

Summary is the meaningful concise version of a text document. Generally existing statistical, knowledge based and discourse based extractive summarization methods use sentence similarity to extract informative sentences. This paper presents an innovative application of semigraph which includes the processes of semigraph construction and sentence extraction. Multilevel association among significant features of the text document can be represented using semigraph. Multi vertices property of semigraph helps in finding linear and nonlinear relationship between features. Some variation in semigraph in context of text document is proposed in this paper. The threshold for sentence length is calculated dynamically based on the sentence score. Challenge of measuring and analyzing performance is countered using proposed HIT ratio and ROUGE measures. Substantial experiments on benchmark dataset demonstrate that the proposed solution achieves encouraging performance. Multi directed mapping among summaries generated, using existing method is used to calculate effective index.

Keywords

Summarization Extractive summarization Semigraph Graph model 

Notes

References

  1. Baralis E, Cagliero L, Mahoto N, Alessandro F (2013) GRAPHSUM: discovering correlations among multiple terms for graph-based summarization. Inf Sci 249:96–109MathSciNetCrossRefGoogle Scholar
  2. Bhave SS, Ban BY, Deshpande CM (2009) Line semigraph of a semigraph. Adv Stud Contemp Math (Kyungshang) 2(2)Google Scholar
  3. Binwahlan MS, Salim N, Suanmali L (2010) Fuzzy swarm diversity hybrid model for text summarization. Inf Process Manag 46(5):571–588CrossRefGoogle Scholar
  4. Das D, Martins AFT (2007) A survey on automatic text summarization. In: Literature survey for the language and statistics II course at CMU 4, pp 192–195Google Scholar
  5. Deshpande CM, Gaidhani YS, Athawale BP (2015) Incidence matrix of semigraph. Br J Math Comput Sci 9(1):12–20CrossRefzbMATHGoogle Scholar
  6. El-Kilany A, Saleh I (2012) Unsupervised document summarization using clusters of dependency graph nodes. In: 12th international conference on intelligent systems design and applications (ISDA), pp 557–561Google Scholar
  7. Erkan G, Radev DR (2004) LexRank: graph-based lexical centrality as salience in text summarization. J Artif Intell Res 22:457–479CrossRefGoogle Scholar
  8. Ferreira Rafael et al (2013) Assessing sentence scoring techniques for extractive text summarization. Expert Syst Appl 40(14):5755–5764CrossRefGoogle Scholar
  9. Ferreira R, Freitas F, de Cabral LS, Lins RD, Lima R (2013) A four dimension graph model for automatic text summarization. In: IEEE/WIC/acm international conferences on web intelligence (WI) and intelligent agent technology (IAT), vol 1. IEEE, pp 389–396Google Scholar
  10. Gaidhani YS, Deshpande CM, Athawale BP (2017) Adjacency matrix of a semigraph. Electron Notes Discrete Math 63:399–406 (ISSN: 1571-0653)MathSciNetCrossRefzbMATHGoogle Scholar
  11. Ge SS, Zhang Z, He H (2011) Weighted graph model based sentence clustering and ranking for document summarization. In: 4th international conference on interaction sciences (ICIS), pp 90–95Google Scholar
  12. Goyal P, Behera L, McGinnity TM (2013) A context-based word indexing model for document summarization. IEEE Trans Knowl Data Eng 25(8):1693–1705CrossRefGoogle Scholar
  13. Hong K, Conroy JM, Favre B, Kulesza A, Lin H, May Nenkova A (2014) A repository of state of the art and competitive baseline summaries for generic news summarization. In: LREC, pp 1608–1616Google Scholar
  14. Jones KS (1998) Automatic summarising: factors and directions. In: Advances in Automatic text summarization. MIT Press, pp 1–12Google Scholar
  15. Joshi SG, Sonawane SS (2015) A survey of extractive summarization approaches using graph model. Int J Comput Eng Appl 9(4):145–156Google Scholar
  16. Kamat SS, Bhat RS (2003) Domination in semigraph. ELectron Notes Discrete Math 15:106–111MathSciNetCrossRefGoogle Scholar
  17. Kumar YJ, Salim N (2012) Automatic multi document summarization approaches. J Comput Sci ISSN 1549–3636:33–140Google Scholar
  18. Lin C-Y (2004) Rouge: a package for automatic evaluation of summaries. In: Text summarization branches out: Proceedings of the ACL-04 workshop, vol 8Google Scholar
  19. Mihalcea R, Tarau P (2004) TextRank: bringing order into texts. Association for Computational Linguistics, BarcelonaGoogle Scholar
  20. Moawad IF, Aref M (2012) Semantic graph reduction approach for abstractive text summarization. In: Seventh international conference on computer engineering and systems (ICCES), pp 132–138Google Scholar
  21. Radev DR, Hovy E, McKeown K (2002) Introduction to the special issue on summarization. Computat Linguist 28(4):399–408CrossRefGoogle Scholar
  22. Radev DR, Jing H, Stys M, Tam D (2004) Centroid-based summarization of multiple documents. Inf Process Manag 40:919–938CrossRefzbMATHGoogle Scholar
  23. Ramesh A, Srinivasa KG, Pramod N (2014) SentenceRank—a graph based approach to summarize text. In: 2014 fifth international conference on applications of digital information and web technologies (ICADIWT), pp 177–182Google Scholar
  24. Sampathkumar E (2000) Semigraphs and their applications. Report on the DST projectGoogle Scholar
  25. Saravanan M (2008) Ontology-based retrieval and automatic summarization of legal judgments. In: IITMGoogle Scholar
  26. Sirkol SS, Hampholi PR, Kaliwal M (2016) Dominating functions in semigraphs. Int J Eng Res Technol 5(2):28–31Google Scholar
  27. Sonawane S (2018) Extractive summarization dataset. Mendeley Data.  https://doi.org/10.17632/z59vy3rb2r.1 Google Scholar
  28. Sonawane SS, Kulkarni PA (2014) Graph based representation and analysis of text document: a survey of techniques. Int J Comput Appl 96(19):1–8Google Scholar
  29. Sonawane SS, Kulkarni PA (2016) Context-based co-reference resolution for text document using graph model (cont-graph). Int J Knowl Eng Data Min 4:1–17CrossRefGoogle Scholar
  30. Sonawane S, Kulkarni P (2016) The role of conference resolution in extractive summarization. In: International conference in computing, analytics and security trends (CAST). IEEE, pp 351–356Google Scholar
  31. Steinberger J, Kabadjov MA, Poesio M, Sanchez-Graillet O (2005) Improving LSA-based summarization with anaphora resolution. In: Proceedings of the conference on human language technology and empirical methods in natural language processing. Association for Computational Linguistics pp. 1–8Google Scholar
  32. Surajit KR, Das P (2013) Matching in semigraph. Int J Comput Appl 3(6):21–38Google Scholar
  33. Wei Y (2012) Document summarization method based on heterogeneous graph. In: 9th international conference on fuzzy systems and knowledge discovery, pp 1285–1289Google Scholar
  34. Weu F, He Y, Li W, Lu Q (2008) A query-sensitive graph-based sentence ranking algorithm for query-oriented multi-document summarization. In: International symposiums on information processing (ISIP), pp 9–13Google Scholar
  35. Wu Z, Palmer M (1994) Verbs semantics and lexical selection. In: Proceedings of the 32nd annual meeting on Association for Computational Linguistics, pp 133–138Google Scholar
  36. Zhang J, Cheng X, Xu H (2008) GSPSummary: a graph-based sub-topic partition algorithm for summarization. In: Information retrieval technology, pp 321–334Google Scholar

Copyright information

© Springer-Verlag GmbH Germany, part of Springer Nature 2018

Authors and Affiliations

  • Sheetal Sonawane
    • 1
    • 2
    Email author
  • Parag Kulkarni
    • 2
    • 3
  • Charusheela Deshpande
    • 4
  • Bhagyashree Athawale
    • 4
  1. 1.Department of Computer EngineeringPune Institute of Computer TechnologyPuneIndia
  2. 2.College of EngineeringPuneIndia
  3. 3.iKnowlation Research Labs Pvt. Ltd.PuneIndia
  4. 4.Department of MathematicsCollege of EngineeringPuneIndia

Personalised recommendations