Skip to main content

Semantic Annotation and Indexing of News and Sports Videos

  • Conference paper
  • First Online:
  • 200 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2540))

Abstract

Broadcasters are demonstrating interest in systems that ease the process of annotation the huge amount of live and archived video materials.Exploitation of such assets is considered a key method for the improvement of production quality. Development of systems supporting effective retrieval by content of videos requires to perform a wide spectrum of operations on video streams, including temporal segmentation, analysis of the audio and video tracks, identification and recognition of text. Low level features are then processed to provide some higher level description of video content, as most of the user queries are typically related to higher level syntax and semantics, rather on the lower level lexical level. The specificity of different application domains requires that different solutions be adopted in different contexts. This may affect both the choice of low level features to be extracted, as well as the modeling of specific domain knowledge required address the issue of higher level of semantics.

In this paper we will report on our experience in the application contexts of news and soccer videos. We will show solutions adopted to cope with specific requirements of different application domains.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. W. A l-Khatib, Y. F. Day, A. Ghafoor, P. B. Berra, Semantic Modeling and Knowledge Representation in Multimedia Databases, IEEE Trans. on Knowledge and Data Engineering 11(1), 1999.

    Google Scholar 

  2. Y. Ariki, and Y. Sugiyama, Classification of TV Sports News by DCT Features using Multiple Subspace Method, in Proc. 14th Int. Conf. on Pattern Recognition (ICPR’98), pp.1488–1491, 1998.

    Google Scholar 

  3. C. Colombo, A. Del Bimbo, and P. Pala, Semantics in Visual Information Retrieval, IEEE MultiMedia 6(3):38–53, 1999.

    Article  Google Scholar 

  4. N. Dimitrova et al., Entry into the Content Forest: The Role of Multimedia Portals, IEEE MultiMedia, Summer 2000.

    Google Scholar 

  5. S. Eickeler and S. Muller, Content-Based Video Indexing of TV Broadcast News Using Hidden Markov Models, in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pp.2997–3000, 1999. 117

    Google Scholar 

  6. Y. Gong, L.T. Sin, C. H. Chuan, H. Zhang, and M. Sakauchi, Automatic Parsing of TV Soccer Programs, in Proc. of the Int’l Conf. on Multimedia Computing and Systems (ICMCS’95), Washington, D.C., May 15-18, 1995. 120, 123

    Google Scholar 

  7. H. M iyamori, S.-I. Iisaku, Video annotation for content-based retrieval using human behavior analysis and domain knowledge, in Proc. Int. Workshop on Automatic Face and Gesture Recognition 2000, 2000.

    Google Scholar 

  8. R C. Nelson, Finding Line Segments by Stick Growing, IEEE Transactions on PAMI, 16(5):519–523, May 1994. 123

    Google Scholar 

  9. T. Sato, T. Kanade, E.K. Hughes, M.A. Smith, Video OCR for Digital News Archive, in Proc. IEEE Int. Workshop on Content-Based Access of Image and Video Databases CAIVD’98, pp.52–60, 1998. 117

    Google Scholar 

  10. W. Zhou, A. Vellaikal, and C. C. J. Kuo, Rule-based video classification system for basketball video indexing, in Proc. ACM Multimedia 2000 workshop, pp.213–216, 2000. 120

    Google Scholar 

  11. S. Choi, Y. Seo, H. Kim, K.-S. Hong, Where are the ball and players?: Soccer Game Analysis with Color-based Tracking and Image Mosaick, Proc. of Int’l Conf. Image Analysis and Processing (ICIAP’97), 1997. 120

    Google Scholar 

  12. V. Tovinkere, R. J. Qian, Detecting Semantic Events in Soccer Games: Towards a Complete Solution, Proc. of Int’l Conf. on Multimedia and Expo (ICME 2001), pp.1040–1043, 2001. 120, 121

    Google Scholar 

  13. D. Yow, B.-L. Yeo, M. Yeung, B. Liu, Analysis and Presentation of Soccer Highlights from Digital Video, Proc. of 2nd Asian Conf. on Computer Vision (ACCV’95), 1995. 120, 121

    Google Scholar 

  14. G. Sudhir, J.C.M. Lee, A.K. Jain, Automatic Classification of Tennis Video for High-level Content-based Retrieval, Proc. of the Int’l Workshop on Content-Based Access of Image and Video Databases (CAIVD’ 98), 1998. 120

    Google Scholar 

  15. S. Nepal, U. Srinivasan, G. Reynolds, Automatic Detection of ‘Goal’ Segments in Basketball Videos, Proc. of ACM Multimedia, pp.261–269, 2001. 120

    Google Scholar 

  16. D.D. Saur, Y.-P. Tan, S. R. Kulkami, P.J. Ramadge, Automatic Analysis and Annotation of Basketball Video, Storage and Retrieval for Image and Video Databases V, pp.176–187, 1997. 120

    Google Scholar 

  17. Y. Rui, A. Gupta, A. Acero, Automatically Extracting Highlights for TV Baseball Programs, Proc. of ACM Multimedia, 2000. 120

    Google Scholar 

  18. Assfalg J., Del Bimbo A., Hirakawa M., Mosaic-based query paradigm for contentbased video retrieval Proc.of the SPIE Conf.on Electronic Imaging II 2001.

    Google Scholar 

  19. Aigrain P., Joly P. and Longueville V., Maybury M. editor, Medium Knowledge-Based Macro-Segmentation of Video into Sequences. MIT Press, 1997.

    Google Scholar 

  20. Baldi G., Colombo C., and Del Bimbo A., A compact and retrieval-oriented video representation using mosaics, in Proceedings 3rd International Conference on Visual Information Systems VISual99, Amsterdam, The Netherlands, June 1999, pp.171–178, Springer LNCS 1999.

    Google Scholar 

  21. Bertini M., Del Bimbo A. and Pala P. Content Based Annotation and Retrieval of News Videos International Conference on Multimedia and Expo 2000. 116

    Google Scholar 

  22. Boreczky J.S. and Rowe L.A. Comparison of video shot boundary detection techniques Proc IS&T/SPIE Conf. Storage and Retrieval for Image and Video Databases IV 1996; 170–179.

    Google Scholar 

  23. Dailianas A., Allen R. B. and England P. Comparison of Automatic Video Segmentation Algorithm Proceedings SPIE Photonics East’95: Integration Issues in Large Commercial Media Delivery Systems 1995.

    Google Scholar 

  24. Kurita T., Otsu N., and Sato T. A Face Recognition Method Using Higher Order Local Autocorrelation And Multivariate Analysis. Proc. 11th Int. Conf.on Pattern Recognition(ICPR’92) 1992; 213–216.

    Google Scholar 

  25. Furht B., Smoliar S.W., Zhang H. Video and image processing in multimedia systems. Kluwer Academic Publishers, 1995, pp 337–346. 116, 117

    Google Scholar 

  26. Hauptmann A.G. Speech Recognition in the Informedia Digital Video Library: Uses and Limitations ICTAI 1995.

    Google Scholar 

  27. Irani M., Anandan P., Bergen J., Kumar R., and Hsu S., Mosaic Representations of Video Sequences and Their Applications Signal Processing: Image Communication, special issue on Image and Video Semantics: Processing, Analysis, and Application 1996; 4.

    Google Scholar 

  28. Gargi U., Kasturi R. and Strayer S.H. IEEE Transactions on Circuits and Systems for Video Technology 2000; 1.

    Google Scholar 

  29. Gargi U. and Kasturi R. An Evaluation of Color Histogram Based Methods in Video Indexing Research Progress Report CSE-96-053, Department of Computer Science and Engineering, Pennsylvania State University.

    Google Scholar 

  30. Li H. and Doermann D. Automatic Identification of Text in Digital Video Key Frames Proceedings ICPR 1998.

    Google Scholar 

  31. Lienhart R. Indexing and Retrieval of Digital Video Sequences Based On Automatic Text Recognition Fourth ACM International Multimedia Conference 1996. 117

    Google Scholar 

  32. Miyamori H., and Iisaku S.-I. Video annotation for content-based retrieval using human behavior analysis and domain knowledge Proc.Automatic Face and Gesture Recognition 2000.

    Google Scholar 

  33. Nakamura Y., Kanade T. Semantic Analysis for Video Contents Extraction-Spotting by Association in News Video ACM Multimedia 1997. 116

    Google Scholar 

  34. Pentland A., Picard R., Davenport G., Hasse K., Video and Image Semantics: Advanced Tools for Tele-Communication IEEE MultiMedia 1994; 2:73–75.

    Google Scholar 

  35. Pfeiffer S., Fischer S. and Effelsberg W. Automatic Audio Content Analysis Proc. ACM Multimedia 1996.

    Google Scholar 

  36. Sahouria E., and Zakhor A. Content analysis of video using principal components IEEE Trans.on Circuits and Systems for Video Technology 1999; 8.

    Google Scholar 

  37. Sato T., Kanade T., Hughes E.K. and M.A. Smith Video OCR for Digital News Archive IEEE International Workshop on Content-Based Access of Image and Video Databases CAIVD’ 98 1998; 52–60.

    Google Scholar 

  38. Sawhney H.S. and Ayer S. Compact Representation of Videos Through Dominant and Multiple Motion Estimation IEEE Trans.on Pattern Analysis and Machine Intelligence 1996; 8:814–830.

    Article  Google Scholar 

  39. Swanberg D. and Shu C. and Jain R. Knowledge Guided Parsing in Video Databases Spie 1993; 13:13–24. 116

    Google Scholar 

  40. Wactlar H.D., Hauptmann A.G. and Witbroc M.J. Informedia: News-On-Demand experiments in Speech Recognition ARPA Speech Recognition Workshop 1996. 117

    Google Scholar 

  41. Witbrock M.J. and Hauptmann A.G. Speech Recognition for a Digital Video Library JASIS 1996. 117, 119

    Google Scholar 

  42. Zhang H. J., Low C., Smoliar S.W., and Wu J.H. Video Parsing, Retrieval and Browsing: An Integrated and Content-based Solution Proc. ACM Multimedia 1995ACM Press.

    Google Scholar 

  43. Zhong Y., Zangh H. and Jain A.K. Automatic Caption Localization in Compressed Video IEEE Transactions on Pattern Analysis and Machine Intelligence 2000; 4:385–392.

    Article  Google Scholar 

  44. Zhou W., Vellaikal A., and Kuo C. C. J. Rule-based video classification system for basketball video indexing Proceedings on ACM multimedia 2000 workshops 2000; 213–216.

    Google Scholar 

  45. L. S. Shapiro, H. Wang and J.M. Brady. A matching and tracking strategy for independently moving, non-rigid object. In Proc. 3rd British Machine Vision Conference. 1992, pages 306–315.

    Google Scholar 

  46. M.A. Fischer and R.C. Bolles. Random Sample Consensus: A paradigm for model fitting with applications to image analysis and automated cartography. In Communications of the ACM, 24:381–395, 1981.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Assfalg, J., Bertini, M., Colombo, C., Del Bimbo, A., Nunziati, W. (2002). Semantic Annotation and Indexing of News and Sports Videos. In: Grosky, W.I., Plášil, F. (eds) SOFSEM 2002: Theory and Practice of Informatics. SOFSEM 2002. Lecture Notes in Computer Science, vol 2540. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36137-5_6

Download citation

  • DOI: https://doi.org/10.1007/3-540-36137-5_6

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-00145-4

  • Online ISBN: 978-3-540-36137-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics