Graph Based Technique for Hindi Text Summarization

Kumar, K. Vimal; Yadav, Divakar; Sharma, Arun

doi:10.1007/978-81-322-2250-7_29

K. Vimal Kumar⁷,
Divakar Yadav⁷ &
Arun Sharma⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 339))

1753 Accesses
10 Citations

Abstract

Automatic Summarization is the process of generating or extracting the important sentences from the given input document. Since there are many such systems for English language so this proposed system is mainly focused on the Hindi language. The basic idea of this summarization system is to identify the important sentences and also to extract them based on its relevance with other sentences. In case of summarization the sentences in the summarized document should be meaningful and relevant to each other, which are achieved using sentential semantic analysis. For finding the relation between each sentence and also to analyze for the importance, the Graph based approach is found to be more appropriate. Based on the frequency of words occurrence in the input document, the sentences are ranked and the ranks are used to identify the important sentences in the document. The relevance between each sentence in the document with other sentences is found using semantic similarity. There may be same information conveyed by two different sentences whose semantic similarity score is very high. Such kind of sentences has to be kept only once in the output. For which an analysis has been performed over various semantically similar sentences. Finally, the identified relevant sentences are merged using the rank and the semantic analysis of the sentences. These identified sentences are rearranged to provide a proper meaningful summarized text to avoid textual continuity in the output text. The system is found to perform well in terms of precision, recall and F-measure with various input documents.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 299.00; Price excludes VAT (USA)

Softcover Book: USD 379.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Juneja, V., Germesin, S., Kleinbauer, T.: A learning-based sampling approach to extractive summarization. In: Proceedings of the NAACL HLT 2010 Student Research Workshop, pp. 34–39 (2010)
Google Scholar
Gupta, V., Lehal, G.S.: A survey of text summarization extractive techniques. J. Emerg. Technol. Web Intell. 2(3), 258–268 (2010)
Google Scholar
Gupta, V., Lehal, G.S.: Features selection and weight learning for punjabi text summarization. Int. J. Eng. Trends Technol. 2(2) (2011)
Google Scholar
Pal, A.R., Saha, D.: An approach to automatic text summarization using WordNet. In: Advance Computing Conference (IACC), 2014 IEEE International, pp. 1169, 1173 (2014). doi:10.1109/IAdCC.2014.6779492
Devasena, C.L., Hemalatha, M.: Automatic text categorization and summarization using rule reduction. In: Advances in Engineering, Science and Management (ICAESM), 2012 International Conference on, pp. 594, 598, 30–31 Mar 2012
Google Scholar
Kalaiselvan, M., Kathiravan, A.V.: A pioneering tool for text summarization using star map. In: Pattern Recognition, Informatics and Mobile Engineering (PRIME), 2013 International Conference on, pp. 277, 281, 21–22 Feb 2013
Google Scholar
Moro, R., Bielikov’, M.: Personalized text summarization based on important terms identification. In: Database and Expert Systems Applications (DEXA), 2012 23rd International Workshop on, pp. 131, 135 (2012) doi:10.1109/DEXA.2012.47
Mangairkarasi, S., Gunasundari, S.: Article: semantic based text summarization using universal networking language. Int. J. Appl. Inf. Syst. 3(8), 18–23 (2012) (Published by Foundation of Computer Science, New York, USA)
Google Scholar
Porter, M.F.: An algorithm for suffix stripping. Program 14(3) (1980)
Google Scholar
Ramanathan, A., Rao, D.D.: A lightweight stemmer for Hindi. In: Proceedings of EACL (2003)
Google Scholar
Alguliev, R.M., Aliguliyev, R.M.: Effective summarization method of text documents. In: Proceedings of IEEE/WIC/ACM International Conference on Web Intelligence (WI’05), pp. 1–8 (2005)
Google Scholar
Mihalcea, R., Tarau, P.: An algorithm for language independent single and multiple document summarization. In: Proceedings of the International Joint Conference on Natural Language Processing (IJCNLP), Korea (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Jaypee Institute of Information Technology, Noida, India
K. Vimal Kumar & Divakar Yadav
Gautam Buddha University, Greater Noida, India
Arun Sharma

Authors

K. Vimal Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Divakar Yadav
View author publications
You can also search for this author in PubMed Google Scholar
Arun Sharma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to K. Vimal Kumar .

Editor information

Editors and Affiliations

University of Kalyani, Kalyani, West Bengal, India
J. K. Mandal
Department of Computer Science and Engineering, Anil Neerukonda Institute of Technology and Sciences, Vishakapatnam, India
Suresh Chandra Satapathy
Dean, Faculty of Engineering, Technology, University of Kalyani, Kalyani, West Bengal, India
Manas Kumar Sanyal
Engineering and Technological Studies, University of Kalyani, Kalyani, West Bengal, India
Partha Pratim Sarkar
Department Computer Science & Engineering, University of Kalyani, Kalyani, India
Anirban Mukhopadhyay

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kumar, K.V., Yadav, D., Sharma, A. (2015). Graph Based Technique for Hindi Text Summarization. In: Mandal, J., Satapathy, S., Kumar Sanyal, M., Sarkar, P., Mukhopadhyay, A. (eds) Information Systems Design and Intelligent Applications. Advances in Intelligent Systems and Computing, vol 339. Springer, New Delhi. https://doi.org/10.1007/978-81-322-2250-7_29

Download citation

DOI: https://doi.org/10.1007/978-81-322-2250-7_29
Published: 21 January 2015
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-2249-1
Online ISBN: 978-81-322-2250-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics