Abstract
Broadcasters are demonstrating interest in systems that ease the process of annotation the huge amount of live and archived video materials.Exploitation of such assets is considered a key method for the improvement of production quality. Development of systems supporting effective retrieval by content of videos requires to perform a wide spectrum of operations on video streams, including temporal segmentation, analysis of the audio and video tracks, identification and recognition of text. Low level features are then processed to provide some higher level description of video content, as most of the user queries are typically related to higher level syntax and semantics, rather on the lower level lexical level. The specificity of different application domains requires that different solutions be adopted in different contexts. This may affect both the choice of low level features to be extracted, as well as the modeling of specific domain knowledge required address the issue of higher level of semantics.
In this paper we will report on our experience in the application contexts of news and soccer videos. We will show solutions adopted to cope with specific requirements of different application domains.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
W. A l-Khatib, Y. F. Day, A. Ghafoor, P. B. Berra, Semantic Modeling and Knowledge Representation in Multimedia Databases, IEEE Trans. on Knowledge and Data Engineering 11(1), 1999.
Y. Ariki, and Y. Sugiyama, Classification of TV Sports News by DCT Features using Multiple Subspace Method, in Proc. 14th Int. Conf. on Pattern Recognition (ICPR’98), pp.1488–1491, 1998.
C. Colombo, A. Del Bimbo, and P. Pala, Semantics in Visual Information Retrieval, IEEE MultiMedia 6(3):38–53, 1999.
N. Dimitrova et al., Entry into the Content Forest: The Role of Multimedia Portals, IEEE MultiMedia, Summer 2000.
S. Eickeler and S. Muller, Content-Based Video Indexing of TV Broadcast News Using Hidden Markov Models, in Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pp.2997–3000, 1999. 117
Y. Gong, L.T. Sin, C. H. Chuan, H. Zhang, and M. Sakauchi, Automatic Parsing of TV Soccer Programs, in Proc. of the Int’l Conf. on Multimedia Computing and Systems (ICMCS’95), Washington, D.C., May 15-18, 1995. 120, 123
H. M iyamori, S.-I. Iisaku, Video annotation for content-based retrieval using human behavior analysis and domain knowledge, in Proc. Int. Workshop on Automatic Face and Gesture Recognition 2000, 2000.
R C. Nelson, Finding Line Segments by Stick Growing, IEEE Transactions on PAMI, 16(5):519–523, May 1994. 123
T. Sato, T. Kanade, E.K. Hughes, M.A. Smith, Video OCR for Digital News Archive, in Proc. IEEE Int. Workshop on Content-Based Access of Image and Video Databases CAIVD’98, pp.52–60, 1998. 117
W. Zhou, A. Vellaikal, and C. C. J. Kuo, Rule-based video classification system for basketball video indexing, in Proc. ACM Multimedia 2000 workshop, pp.213–216, 2000. 120
S. Choi, Y. Seo, H. Kim, K.-S. Hong, Where are the ball and players?: Soccer Game Analysis with Color-based Tracking and Image Mosaick, Proc. of Int’l Conf. Image Analysis and Processing (ICIAP’97), 1997. 120
V. Tovinkere, R. J. Qian, Detecting Semantic Events in Soccer Games: Towards a Complete Solution, Proc. of Int’l Conf. on Multimedia and Expo (ICME 2001), pp.1040–1043, 2001. 120, 121
D. Yow, B.-L. Yeo, M. Yeung, B. Liu, Analysis and Presentation of Soccer Highlights from Digital Video, Proc. of 2nd Asian Conf. on Computer Vision (ACCV’95), 1995. 120, 121
G. Sudhir, J.C.M. Lee, A.K. Jain, Automatic Classification of Tennis Video for High-level Content-based Retrieval, Proc. of the Int’l Workshop on Content-Based Access of Image and Video Databases (CAIVD’ 98), 1998. 120
S. Nepal, U. Srinivasan, G. Reynolds, Automatic Detection of ‘Goal’ Segments in Basketball Videos, Proc. of ACM Multimedia, pp.261–269, 2001. 120
D.D. Saur, Y.-P. Tan, S. R. Kulkami, P.J. Ramadge, Automatic Analysis and Annotation of Basketball Video, Storage and Retrieval for Image and Video Databases V, pp.176–187, 1997. 120
Y. Rui, A. Gupta, A. Acero, Automatically Extracting Highlights for TV Baseball Programs, Proc. of ACM Multimedia, 2000. 120
Assfalg J., Del Bimbo A., Hirakawa M., Mosaic-based query paradigm for contentbased video retrieval Proc.of the SPIE Conf.on Electronic Imaging II 2001.
Aigrain P., Joly P. and Longueville V., Maybury M. editor, Medium Knowledge-Based Macro-Segmentation of Video into Sequences. MIT Press, 1997.
Baldi G., Colombo C., and Del Bimbo A., A compact and retrieval-oriented video representation using mosaics, in Proceedings 3rd International Conference on Visual Information Systems VISual99, Amsterdam, The Netherlands, June 1999, pp.171–178, Springer LNCS 1999.
Bertini M., Del Bimbo A. and Pala P. Content Based Annotation and Retrieval of News Videos International Conference on Multimedia and Expo 2000. 116
Boreczky J.S. and Rowe L.A. Comparison of video shot boundary detection techniques Proc IS&T/SPIE Conf. Storage and Retrieval for Image and Video Databases IV 1996; 170–179.
Dailianas A., Allen R. B. and England P. Comparison of Automatic Video Segmentation Algorithm Proceedings SPIE Photonics East’95: Integration Issues in Large Commercial Media Delivery Systems 1995.
Kurita T., Otsu N., and Sato T. A Face Recognition Method Using Higher Order Local Autocorrelation And Multivariate Analysis. Proc. 11th Int. Conf.on Pattern Recognition(ICPR’92) 1992; 213–216.
Furht B., Smoliar S.W., Zhang H. Video and image processing in multimedia systems. Kluwer Academic Publishers, 1995, pp 337–346. 116, 117
Hauptmann A.G. Speech Recognition in the Informedia Digital Video Library: Uses and Limitations ICTAI 1995.
Irani M., Anandan P., Bergen J., Kumar R., and Hsu S., Mosaic Representations of Video Sequences and Their Applications Signal Processing: Image Communication, special issue on Image and Video Semantics: Processing, Analysis, and Application 1996; 4.
Gargi U., Kasturi R. and Strayer S.H. IEEE Transactions on Circuits and Systems for Video Technology 2000; 1.
Gargi U. and Kasturi R. An Evaluation of Color Histogram Based Methods in Video Indexing Research Progress Report CSE-96-053, Department of Computer Science and Engineering, Pennsylvania State University.
Li H. and Doermann D. Automatic Identification of Text in Digital Video Key Frames Proceedings ICPR 1998.
Lienhart R. Indexing and Retrieval of Digital Video Sequences Based On Automatic Text Recognition Fourth ACM International Multimedia Conference 1996. 117
Miyamori H., and Iisaku S.-I. Video annotation for content-based retrieval using human behavior analysis and domain knowledge Proc.Automatic Face and Gesture Recognition 2000.
Nakamura Y., Kanade T. Semantic Analysis for Video Contents Extraction-Spotting by Association in News Video ACM Multimedia 1997. 116
Pentland A., Picard R., Davenport G., Hasse K., Video and Image Semantics: Advanced Tools for Tele-Communication IEEE MultiMedia 1994; 2:73–75.
Pfeiffer S., Fischer S. and Effelsberg W. Automatic Audio Content Analysis Proc. ACM Multimedia 1996.
Sahouria E., and Zakhor A. Content analysis of video using principal components IEEE Trans.on Circuits and Systems for Video Technology 1999; 8.
Sato T., Kanade T., Hughes E.K. and M.A. Smith Video OCR for Digital News Archive IEEE International Workshop on Content-Based Access of Image and Video Databases CAIVD’ 98 1998; 52–60.
Sawhney H.S. and Ayer S. Compact Representation of Videos Through Dominant and Multiple Motion Estimation IEEE Trans.on Pattern Analysis and Machine Intelligence 1996; 8:814–830.
Swanberg D. and Shu C. and Jain R. Knowledge Guided Parsing in Video Databases Spie 1993; 13:13–24. 116
Wactlar H.D., Hauptmann A.G. and Witbroc M.J. Informedia: News-On-Demand experiments in Speech Recognition ARPA Speech Recognition Workshop 1996. 117
Witbrock M.J. and Hauptmann A.G. Speech Recognition for a Digital Video Library JASIS 1996. 117, 119
Zhang H. J., Low C., Smoliar S.W., and Wu J.H. Video Parsing, Retrieval and Browsing: An Integrated and Content-based Solution Proc. ACM Multimedia 1995ACM Press.
Zhong Y., Zangh H. and Jain A.K. Automatic Caption Localization in Compressed Video IEEE Transactions on Pattern Analysis and Machine Intelligence 2000; 4:385–392.
Zhou W., Vellaikal A., and Kuo C. C. J. Rule-based video classification system for basketball video indexing Proceedings on ACM multimedia 2000 workshops 2000; 213–216.
L. S. Shapiro, H. Wang and J.M. Brady. A matching and tracking strategy for independently moving, non-rigid object. In Proc. 3rd British Machine Vision Conference. 1992, pages 306–315.
M.A. Fischer and R.C. Bolles. Random Sample Consensus: A paradigm for model fitting with applications to image analysis and automated cartography. In Communications of the ACM, 24:381–395, 1981.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Assfalg, J., Bertini, M., Colombo, C., Del Bimbo, A., Nunziati, W. (2002). Semantic Annotation and Indexing of News and Sports Videos. In: Grosky, W.I., Plášil, F. (eds) SOFSEM 2002: Theory and Practice of Informatics. SOFSEM 2002. Lecture Notes in Computer Science, vol 2540. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36137-5_6
Download citation
DOI: https://doi.org/10.1007/3-540-36137-5_6
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00145-4
Online ISBN: 978-3-540-36137-4
eBook Packages: Springer Book Archive