Semantic Annotation and Indexing of News and Sports Videos

Assfalg, Jürgen; Bertini, Marco; Colombo, Carlo; Del Bimbo, Alberto; Nunziati, Walter

doi:10.1007/3-540-36137-5_6

Semantic Annotation and Indexing of News and Sports Videos

Jürgen Assfalg⁶,
Marco Bertini⁶,
Carlo Colombo⁶,
Alberto Del Bimbo⁶ &
…
Walter Nunziati⁶

Conference paper
First Online: 27 November 2002

200 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2540))

Abstract

Broadcasters are demonstrating interest in systems that ease the process of annotation the huge amount of live and archived video materials.Exploitation of such assets is considered a key method for the improvement of production quality. Development of systems supporting effective retrieval by content of videos requires to perform a wide spectrum of operations on video streams, including temporal segmentation, analysis of the audio and video tracks, identification and recognition of text. Low level features are then processed to provide some higher level description of video content, as most of the user queries are typically related to higher level syntax and semantics, rather on the lower level lexical level. The specificity of different application domains requires that different solutions be adopted in different contexts. This may affect both the choice of low level features to be extracted, as well as the modeling of specific domain knowledge required address the issue of higher level of semantics.

In this paper we will report on our experience in the application contexts of news and soccer videos. We will show solutions adopted to cope with specific requirements of different application domains.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

W. A l-Khatib, Y. F. Day, A. Ghafoor, P. B. Berra, Semantic Modeling and Knowledge Representation in Multimedia Databases, IEEE Trans. on Knowledge and Data Engineering 11(1), 1999.
Google Scholar
Y. Ariki, and Y. Sugiyama, Classification of TV Sports News by DCT Features using Multiple Subspace Method, in Proc. 14th Int. Conf. on Pattern Recognition (ICPR’98), pp.1488–1491, 1998.
Google Scholar
C. Colombo, A. Del Bimbo, and P. Pala, Semantics in Visual Information Retrieval, IEEE MultiMedia 6(3):38–53, 1999.
Article Google Scholar
N. Dimitrova et al., Entry into the Content Forest: The Role of Multimedia Portals, IEEE MultiMedia, Summer 2000.
Google Scholar
S. Eickeler and S. Muller, Content-Based Video Indexing of TV Broadcast News Using Hidden Markov Models, in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pp.2997–3000, 1999. 117
Google Scholar
Y. Gong, L.T. Sin, C. H. Chuan, H. Zhang, and M. Sakauchi, Automatic Parsing of TV Soccer Programs, in Proc. of the Int’l Conf. on Multimedia Computing and Systems (ICMCS’95), Washington, D.C., May 15-18, 1995. 120, 123
Google Scholar
H. M iyamori, S.-I. Iisaku, Video annotation for content-based retrieval using human behavior analysis and domain knowledge, in Proc. Int. Workshop on Automatic Face and Gesture Recognition 2000, 2000.
Google Scholar
R C. Nelson, Finding Line Segments by Stick Growing, IEEE Transactions on PAMI, 16(5):519–523, May 1994. 123
Google Scholar
T. Sato, T. Kanade, E.K. Hughes, M.A. Smith, Video OCR for Digital News Archive, in Proc. IEEE Int. Workshop on Content-Based Access of Image and Video Databases CAIVD’98, pp.52–60, 1998. 117
Google Scholar
W. Zhou, A. Vellaikal, and C. C. J. Kuo, Rule-based video classification system for basketball video indexing, in Proc. ACM Multimedia 2000 workshop, pp.213–216, 2000. 120
Google Scholar
S. Choi, Y. Seo, H. Kim, K.-S. Hong, Where are the ball and players?: Soccer Game Analysis with Color-based Tracking and Image Mosaick, Proc. of Int’l Conf. Image Analysis and Processing (ICIAP’97), 1997. 120
Google Scholar
V. Tovinkere, R. J. Qian, Detecting Semantic Events in Soccer Games: Towards a Complete Solution, Proc. of Int’l Conf. on Multimedia and Expo (ICME 2001), pp.1040–1043, 2001. 120, 121
Google Scholar
D. Yow, B.-L. Yeo, M. Yeung, B. Liu, Analysis and Presentation of Soccer Highlights from Digital Video, Proc. of 2nd Asian Conf. on Computer Vision (ACCV’95), 1995. 120, 121
Google Scholar
G. Sudhir, J.C.M. Lee, A.K. Jain, Automatic Classification of Tennis Video for High-level Content-based Retrieval, Proc. of the Int’l Workshop on Content-Based Access of Image and Video Databases (CAIVD’ 98), 1998. 120
Google Scholar
S. Nepal, U. Srinivasan, G. Reynolds, Automatic Detection of ‘Goal’ Segments in Basketball Videos, Proc. of ACM Multimedia, pp.261–269, 2001. 120
Google Scholar
D.D. Saur, Y.-P. Tan, S. R. Kulkami, P.J. Ramadge, Automatic Analysis and Annotation of Basketball Video, Storage and Retrieval for Image and Video Databases V, pp.176–187, 1997. 120
Google Scholar
Y. Rui, A. Gupta, A. Acero, Automatically Extracting Highlights for TV Baseball Programs, Proc. of ACM Multimedia, 2000. 120
Google Scholar
Assfalg J., Del Bimbo A., Hirakawa M., Mosaic-based query paradigm for contentbased video retrieval Proc.of the SPIE Conf.on Electronic Imaging II 2001.
Google Scholar
Aigrain P., Joly P. and Longueville V., Maybury M. editor, Medium Knowledge-Based Macro-Segmentation of Video into Sequences. MIT Press, 1997.
Google Scholar
Baldi G., Colombo C., and Del Bimbo A., A compact and retrieval-oriented video representation using mosaics, in Proceedings 3rd International Conference on Visual Information Systems VISual99, Amsterdam, The Netherlands, June 1999, pp.171–178, Springer LNCS 1999.
Google Scholar
Bertini M., Del Bimbo A. and Pala P. Content Based Annotation and Retrieval of News Videos International Conference on Multimedia and Expo 2000. 116
Google Scholar
Boreczky J.S. and Rowe L.A. Comparison of video shot boundary detection techniques Proc IS&T/SPIE Conf. Storage and Retrieval for Image and Video Databases IV 1996; 170–179.
Google Scholar
Dailianas A., Allen R. B. and England P. Comparison of Automatic Video Segmentation Algorithm Proceedings SPIE Photonics East’95: Integration Issues in Large Commercial Media Delivery Systems 1995.
Google Scholar
Kurita T., Otsu N., and Sato T. A Face Recognition Method Using Higher Order Local Autocorrelation And Multivariate Analysis. Proc. 11th Int. Conf.on Pattern Recognition(ICPR’92) 1992; 213–216.
Google Scholar
Furht B., Smoliar S.W., Zhang H. Video and image processing in multimedia systems. Kluwer Academic Publishers, 1995, pp 337–346. 116, 117
Google Scholar
Hauptmann A.G. Speech Recognition in the Informedia Digital Video Library: Uses and Limitations ICTAI 1995.
Google Scholar
Irani M., Anandan P., Bergen J., Kumar R., and Hsu S., Mosaic Representations of Video Sequences and Their Applications Signal Processing: Image Communication, special issue on Image and Video Semantics: Processing, Analysis, and Application 1996; 4.
Google Scholar
Gargi U., Kasturi R. and Strayer S.H. IEEE Transactions on Circuits and Systems for Video Technology 2000; 1.
Google Scholar
Gargi U. and Kasturi R. An Evaluation of Color Histogram Based Methods in Video Indexing Research Progress Report CSE-96-053, Department of Computer Science and Engineering, Pennsylvania State University.
Google Scholar
Li H. and Doermann D. Automatic Identification of Text in Digital Video Key Frames Proceedings ICPR 1998.
Google Scholar
Lienhart R. Indexing and Retrieval of Digital Video Sequences Based On Automatic Text Recognition Fourth ACM International Multimedia Conference 1996. 117
Google Scholar
Miyamori H., and Iisaku S.-I. Video annotation for content-based retrieval using human behavior analysis and domain knowledge Proc.Automatic Face and Gesture Recognition 2000.
Google Scholar
Nakamura Y., Kanade T. Semantic Analysis for Video Contents Extraction-Spotting by Association in News Video ACM Multimedia 1997. 116
Google Scholar
Pentland A., Picard R., Davenport G., Hasse K., Video and Image Semantics: Advanced Tools for Tele-Communication IEEE MultiMedia 1994; 2:73–75.
Google Scholar
Pfeiffer S., Fischer S. and Effelsberg W. Automatic Audio Content Analysis Proc. ACM Multimedia 1996.
Google Scholar
Sahouria E., and Zakhor A. Content analysis of video using principal components IEEE Trans.on Circuits and Systems for Video Technology 1999; 8.
Google Scholar
Sato T., Kanade T., Hughes E.K. and M.A. Smith Video OCR for Digital News Archive IEEE International Workshop on Content-Based Access of Image and Video Databases CAIVD’ 98 1998; 52–60.
Google Scholar
Sawhney H.S. and Ayer S. Compact Representation of Videos Through Dominant and Multiple Motion Estimation IEEE Trans.on Pattern Analysis and Machine Intelligence 1996; 8:814–830.
Article Google Scholar
Swanberg D. and Shu C. and Jain R. Knowledge Guided Parsing in Video Databases Spie 1993; 13:13–24. 116
Google Scholar
Wactlar H.D., Hauptmann A.G. and Witbroc M.J. Informedia: News-On-Demand experiments in Speech Recognition ARPA Speech Recognition Workshop 1996. 117
Google Scholar
Witbrock M.J. and Hauptmann A.G. Speech Recognition for a Digital Video Library JASIS 1996. 117, 119
Google Scholar
Zhang H. J., Low C., Smoliar S.W., and Wu J.H. Video Parsing, Retrieval and Browsing: An Integrated and Content-based Solution Proc. ACM Multimedia 1995ACM Press.
Google Scholar
Zhong Y., Zangh H. and Jain A.K. Automatic Caption Localization in Compressed Video IEEE Transactions on Pattern Analysis and Machine Intelligence 2000; 4:385–392.
Article Google Scholar
Zhou W., Vellaikal A., and Kuo C. C. J. Rule-based video classification system for basketball video indexing Proceedings on ACM multimedia 2000 workshops 2000; 213–216.
Google Scholar
L. S. Shapiro, H. Wang and J.M. Brady. A matching and tracking strategy for independently moving, non-rigid object. In Proc. 3rd British Machine Vision Conference. 1992, pages 306–315.
Google Scholar
M.A. Fischer and R.C. Bolles. Random Sample Consensus: A paradigm for model fitting with applications to image analysis and automated cartography. In Communications of the ACM, 24:381–395, 1981.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Università di Firenze, 50139, Firenze, Italia
Jürgen Assfalg, Marco Bertini, Carlo Colombo, Alberto Del Bimbo & Walter Nunziati

Authors

Jürgen Assfalg
View author publications
You can also search for this author in PubMed Google Scholar
Marco Bertini
View author publications
You can also search for this author in PubMed Google Scholar
Carlo Colombo
View author publications
You can also search for this author in PubMed Google Scholar
Alberto Del Bimbo
View author publications
You can also search for this author in PubMed Google Scholar
Walter Nunziati
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer and Information Science, University of Michigan - Dearborn, 4901 Evergreen Road, Dearborn, 48128, Michigan, USA
William I. Grosky
Department of Software Engineering School of Computer Science, Charles University, Malostranské nám. 25, 118 00, Prague, Czech Republic
František Plášil

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Assfalg, J., Bertini, M., Colombo, C., Del Bimbo, A., Nunziati, W. (2002). Semantic Annotation and Indexing of News and Sports Videos. In: Grosky, W.I., Plášil, F. (eds) SOFSEM 2002: Theory and Practice of Informatics. SOFSEM 2002. Lecture Notes in Computer Science, vol 2540. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36137-5_6

Download citation

DOI: https://doi.org/10.1007/3-540-36137-5_6
Published: 27 November 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00145-4
Online ISBN: 978-3-540-36137-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics