Text Driven Temporal Segmentation of Cricket Videos

Pramod Sankar, K.; Pandey, Saurabh; Jawahar, C. V.

doi:10.1007/11949619_39

K. Pramod Sankar¹⁸,
Saurabh Pandey¹⁸ &
C. V. Jawahar¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4338))

1864 Accesses
11 Citations

Abstract

In this paper we address the problem of temporal segmentation of videos. We present a multi-modal approach where clues from different information sources are merged to perform the segmentation. Specifically, we segment videos based on textual descriptions or commentaries of the action in the video. Such a parallel information is available for cricket videos, a class of videos where visual feature based (bottom-up) scene segmentation algorithms generally fail, due to lack of visual dissimilarity across space and time. With additional top-down information from textual domain, these ambiguities could be resolved to a large extent. The video is segmented to meaningful entities or scenes, using the scene level descriptions provided by the commentary. These segments can then be automatically annotated with the respective descriptions. This allows for a semantic access and retrieval of video segments, which is difficult to obtain from existing visual feature based approaches. We also present techniques for automatic highlight generation using our scheme.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Rui, Y., Huang, T.S., Mehrotra, S.: Constructing table-of-content for videos. Multimedia Syst 7, 359–368 (1999)
Article Google Scholar
Jiang, H., Helal, A., Elmagarmid, A.K., Joshi, A.: Scene change detection techniques for video database systems. Multimedia Syst 6, 186–195 (1998)
Article Google Scholar
Koprinska, I., Carrato, S.: Temporal video segmentation: A survey. Signal Processing: Image Communication, 477–500 (2001)
Google Scholar
Lefevre, S., Holler, J., Vincent, N.: A review of real-time segmentation of uncompressed video sequences for content-based search and retrieval. Real-Time Imaging 9, 73–98 (2003)
Article Google Scholar
Hanjalic, A., Lagendijk, R.L., Biemond, J.: Automated high-level movie segmentation for advanced video retrieval systems. IEEE Trans. Circuits Syst. Video Technol. 9, 580 (1999)
Article Google Scholar
Demarty, C., Beucher, S.: Morphological tools for indexing video documents. In: Proc. IEEE Intl. Conf. Multimedia Computing and Systems, p. 991 (1999)
Google Scholar
Zabih, R., Miller, J., Mai, K.: A feature-based algorithm for detecting and classifying production effects. Multimedia Syst 7, 119–128 (1999)
Article Google Scholar
Rasheed, Z., Shah, M.: Scene detection in hollywood movies and tv shows. In: Proc. Computer Vision and Pattern Recognition, June 2003, vol. 2, pp. 343–348 (2003)
Google Scholar
Rui, Y., Gupta, A., Acero, A.: Automatically extracting highlights for tv baseball programs. In: ACM Multimedia, pp. 105–115. ACM Press, New York (2000)
Google Scholar
Babaguchi, N., Kawai, Y., Kitahashi, T.: Event based indexing of broadcast sports video by intermodal collaboration. IEEE Trans. Multimedia 4, 68–75 (2002)
Article Google Scholar
Sudhir, G., Lee, J.C.M., Jain, A.K.: Automatic classification of tennis video for high-level content-based retrieval. In: Proc. International Workshop on Content-Based Access of Image and Video Databases, pp. 81–90 (1998)
Google Scholar
Kolekar, M.H., Sengupta, S.: A hierarchical framework for generic sports video classification. In: ACCV (2), pp. 633–642 (2006)
Google Scholar
Jadon, R.S., Chaudhury, S., Biswas, K.K.: Sports video characterization using scene dynamics. In: ICVGIP, pp. 545–549 (2004)
Google Scholar
Fatemi, O., Zhang, S., Panchanathan, S.: Optical flow based model for scene cut detection. In: Canadian Conf. on Electrical and Computer Engineering., vol. 1, pp. 470–473 (1996)
Google Scholar
Gunsel, B., Ferman, A., Tekalp, A.: Temporal video segmentation using unsupervised clustering and semantic object tracking. Journal of Electronic Imaging 7, 592–604 (1998)
Article Google Scholar
Lienhart, R., Kuhmunch, C., Effelsberg, W.: On the detection and recognition of television commercials. In: International Conference on Multimedia Computing and Systems, pp. 509–516 (1997)
Google Scholar
Wang, L., Liu, X., Lin, S., Xu, G., Shum, H.Y.: Generic slow-motion replay detection in sports video. In: ICIP, pp. 1585–1588 (2004)
Google Scholar
Li, B., Errico, J.H., Pan, H., Sezan, I.: Bridging the semantic gap in sports video retrieval and summarization. J. Vis. Commun. Image R. 15, 393–424 (2004)
Article MATH Google Scholar
Cox, I.J., Hingorani, S.L., Rao, S.B., Maggs, B.M.: A maximum likelihood stereo algorithm. Comput. Vis. Image Underst. 63, 542–567 (1996)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Centre for Visual Information Technology, International Institute of Information Technology, Hyderabad, India
K. Pramod Sankar, Saurabh Pandey & C. V. Jawahar

Authors

K. Pramod Sankar
View author publications
You can also search for this author in PubMed Google Scholar
Saurabh Pandey
View author publications
You can also search for this author in PubMed Google Scholar
C. V. Jawahar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, IIT Delhi, New Delhi, India
Prem K. Kalra
School of Computer Science and Engineering, The Hebrew University of Jerusalem, 91904, Jerusalem, Israel
Shmuel Peleg

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pramod Sankar, K., Pandey, S., Jawahar, C.V. (2006). Text Driven Temporal Segmentation of Cricket Videos. In: Kalra, P.K., Peleg, S. (eds) Computer Vision, Graphics and Image Processing. Lecture Notes in Computer Science, vol 4338. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11949619_39

Download citation

DOI: https://doi.org/10.1007/11949619_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68301-8
Online ISBN: 978-3-540-68302-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics