Skip to main content

Single Document Extractive Text Summarization Using Neural Networks and Genetic Algorithm

  • Conference paper
  • First Online:
Intelligent Computing (SAI 2018)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 858))

Included in the following conference series:

Abstract

The presented paper proposes an extractive text summarization technique for single documents using Neural Networks and Genetic Algorithms. The Neural Network helps to define a fitness function to express mathematically the quality of the generated summary through six desired properties which are theme similarity, cohesion, sentiment, readability, aggregate similarity and sentence position. Genetic Algorithm maximizes the above-mentioned fitness function, and extracts the most important sentences to create the extractive summary. The results are compiled using DUC2002 data as a benchmark and calculated using the precision-recall technique. They are compared with techniques using Genetic Algorithm, Neural Network and a summarizer made by Microsoft. The comparison between the results clearly demonstrates the superiority of the technique and is very encouraging for future work in this area.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Li, S., Karatzoglou, A., Gentile, C.: Collaborative filtering bandits. In: The 39th ACM SIGIR, pp. 539–548 (2016)

    Google Scholar 

  2. Gentile, C., Li, S., Kar, P., Karatzoglou, A., Etrue, E., Zappella, G.: On context-dependent clustering of bandits. In: The 34th ICML, pp. 1253–1262 (2017)

    Google Scholar 

  3. Korda, N., Szorenyi, B., Li, S.: Distributed clustering of linear bandits in peer to peer networks. In: The 33rd ICML, pp. 1301–1309 (2016)

    Google Scholar 

  4. Mani, I.: Automatic Summarization. John Benjamins Publishing Company (2001)

    Google Scholar 

  5. Luhn, H.P.: The automatic creation of literature abstracts. IBM J. Res. Dev. 2(2), 159–165 (1958)

    Article  MathSciNet  Google Scholar 

  6. Edmundson, H.P.: New methods in automatic extracting. J. ACM 16(2), 264–285 (1969)

    Article  Google Scholar 

  7. Barzilay, R., Elhadad, M.: Using lexical chains for text summarization. In: Mani, I., Maybury, M.T. (eds.) Advances in Automatic Text Summarization, pp. 111–121. MIT Press (1999)

    Google Scholar 

  8. Fattah, M.A., Ren, F.: GA, MR, FFNN, PNN and GMM based models for automatic text summarization. Comput. Speech Lang. 23(1), 126–144 (2009)

    Article  Google Scholar 

  9. Chatterjee, N., Bhardwaj, A.: Single document text summarization using random indexing and neural networks. In: KEOD 2010, pp. 171–176 (2010)

    Google Scholar 

  10. Chatterjee, N., Mittal, A., Goyal, S.: Single document extractive text summarization using genetic algorithms. In: Third International Conference on Emerging Applications of Information Technology (EAIT) (2012)

    Google Scholar 

  11. Qazvinian, V., Hasaanabadi, L.S., Halavati, R.: Summarising text with a genetic algorithm-based sentence extraction. Int. J. Knowl. Manag. Stud. 2(4), 426–444 (2008)

    Article  Google Scholar 

  12. Yates, R.B., Neto, B.R.: Modern Information Retrieval. Addison Wesley (1999)

    Google Scholar 

  13. Bird, S., Klein, E., Loper, E.: Natural Language Processing with Python. O’Reilly Media (2009)

    Google Scholar 

  14. Mitra, M., Singhal, A., Buckley, C.: Automatic text summarization by paragraph extraction. In: ACL Workshop on Intelligent and Scalable Text Summarization, Madrid Spain, pp. 39–46 (1997)

    Google Scholar 

  15. Cormen, T.H., Leiserson, C.E., Rivest, R.L.: Introduction to Algorithms, 2nd edn. The MIT Press, Cambridge (2009)

    MATH  Google Scholar 

  16. Goldberg, D.E.: Genetic Algorithms. Addison Wiley Longman Inc. (1999)

    Google Scholar 

  17. Spears, W.M., Anand, V.: A study of crossover operators in genetic programming. In: Proceedings of the 6th International Symposium on Methodologies for Intelligent Systems, ser. ISMIS 1991, Springer, London, pp. 409–418 (1991)

    Google Scholar 

  18. Jizba, R.: Measuring Search Effectiveness, Precision and Recall. Creighton University (2007)

    Google Scholar 

  19. Deb, K., Pratap, A., Agarwal, S., Meyarivan, T.: A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. Evol. Comput. 6(2), 182–197 (2002)

    Article  Google Scholar 

  20. Perumal, K., Chaudhuri, B.B.: Language independent sentence extraction based text summarization. In: ICON-2011: 9th International Conference on Natural Language Processing, pp. 213–217. Macmillan, India (2011)

    Google Scholar 

  21. Kupiec, J., Pedersen, J., Chen, F.: A trainable document summarizer. In: Proceedings of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, ser. SIGIR 1995, pp. 68–73. ACM, New York (1995)

    Google Scholar 

  22. Nielsen, F.Å.: A new ANEW: evaluation of a word list for sentiment analysis in microblogs. In: Proceedings of the ESWC2011 Workshop on ‘Making Sense of Microposts’: Big Things Come in Small Packages 718 in CEUR Workshop Proceedings, pp. 93–98, May 2011

    Google Scholar 

  23. Ramanujam, N., Kaliappan, M.: An automatic multidocument text summarization approach based on Naïve Bayesian classifier using timestamp strategy. Sci. World J. 2016 (2016). Article ID 1784827

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Niladri Chatterjee .

Editor information

Editors and Affiliations

A Appendix

A Appendix

Fig. 3.
figure 3

Graph comparing the results of all the different techniques.

figure b
figure c

Document No. 17 of the DUC 2002 data set containing 28 sentences.

figure d

Ideal Summary for Document No.17, containing sentences 1, 2, 3, 4, 5, 6, 7, 8, 9, 12, 14, 15, 17, 18, 23, 24.

figure e

Summary for Document No.17 generated by our algorithm, containing sentences 1, 2, 3, 4, 5, 6, 7, 8, 10, 11, 12, 16, 17, 18, 25, 26. Precision = 0.6875.

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chatterjee, N., Jain, G., Bajwa, G.S. (2019). Single Document Extractive Text Summarization Using Neural Networks and Genetic Algorithm. In: Arai, K., Kapoor, S., Bhatia, R. (eds) Intelligent Computing. SAI 2018. Advances in Intelligent Systems and Computing, vol 858. Springer, Cham. https://doi.org/10.1007/978-3-030-01174-1_26

Download citation

Publish with us

Policies and ethics