Skip to main content
  • 706 Accesses

Text provides crucial cues for understanding content. For example, the closed captions in broadcast television programs and subtitles in DVD movies facilitate video consumption for viewers. When a transcript is not available for certain content, automatic speech recognition can be used to extract linguistic information. Text information is much more concise than corresponding audio or video. The reason is that we need language knowledge to understand text, and the knowledge itself does not need to be embedded in the text data. For example, we only need five characters to express a “plane,” but to show a video clip of plane takes millions of bytes. Text streams contain very rich semantic information. How to effectively extract information from text is an important component in video content analysis.

In this chapter, we will introduce some fundamentals in text processing that are relevant to content analysis, information extraction, and information retrieval. Specifically, we will discuss part of speech tagging, named entity extraction, text capitalization, stemming, term weighting, and document ranking. We will also present a few methods for story segmentation and text summarization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer Berlin Heidelberg

About this chapter

Cite this chapter

(2008). Text Processing. In: Introduction to Video Search Engines. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-79337-3_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-79337-3_8

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-79336-6

  • Online ISBN: 978-3-540-79337-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics