Automatic Text Summarization of Video Lectures Using Subtitles

  • Shruti GargEmail author
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 555)


Text summarization can be defined as a process of reducing a text document using computer program in order to generate a summary of original document that consists of most important things covered in that. An example of summarization technology is search engines such as Google. This paper orients for analyzing and producing text summary of video lectures by harnessing the subtitles file provided along with the lectures. Extractive text summarization method has been adopted to produce the summaries from the source subtitles. This would help user in deciding whether a particular lecture is relevant to them or not, thereby saving their time and aiding them in quick decision making. Experiments were conducted on various subtitle files belonging to different lectures, and it has been found that extractive text summarization reduces the content of original subtitle file up to sixty percent by tf-idf approach.


Summarization tf-idf Information retrieval Subtitles 


  1. 1.
    Arnulfo, R., Herandez, G., Ledeneva, Y.,: Word Sequence Models for Single Text Summarization, Second International Conferences on Advances in Computer-Human Interactions, IEEE, pp. 44–48, (2009).Google Scholar
  2. 2.
    Luhn, H. P.,: The Automatic Creation of Literature Abstracts. In Inderjeet Mani and Mark Marbury, editors, Advances in Automatic Text Summarization. MIT Press, (1999).Google Scholar
  3. 3.
    Lloret, E., Palomar, M.,: Text summarization in progress: a literature review, ACM journal of Artificial Intelligence Review, pp. 1–41, vol. 37 (1), (January 2012).Google Scholar
  4. 4.
    Radev, R., Blair-goldensohn, S., Zhang, Z.,: Experiments in Single and Multi-Document Summarization using MEAD. In First Document Under-standing Conference, New Orleans, LA, (2001).Google Scholar
  5. 5.
    Radev, D., Weiguo, F., Zhang, Z., Web in essence: A personalized web-based multi-document summarization and recommendation system. In NAACL Workshop on automatic Summarization, Pittsburg, (2001).Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2017

Authors and Affiliations

  1. 1.BITRanchiIndia

Personalised recommendations