Abstract
Highlight event detection is a fundamental step of semantic based video retrieval and personalized sports video browsing. In this paper, an enhanced hidden Markov models (EHMM) based soccer video event detection method is proposed. Firstly, each soccer video shot is classified into one of the thirteen middle level semantics. Then the sequential soccer video sequence is segmented into event clips. Finally, HMMs are utilized to model the defined four highlights (goal, shoot, foul, and placed kick) and a normal kick. Not only the transitions of the middle level semantics and but also the overall features of an event clip are fused by HMMs to determine the event type. Comparisons are made with some existing soccer video event detection approaches. Experimental results show the effectiveness of the proposed EHMM based soccer video event detection approach. The influences of hidden state number and overall feature types to the event detection performances are discussed.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Li, B., Errico, J., Pan, H., Sezan, M.: Bridging the semantic gap in sports video retrieval and summarization. J. Vis. Commun. Image R. 17, 393–424 (2004)
Rabiner, L.: A tutorial on hidden markov models and selected applications in speech recognition. Proceedings of the IEEE 77(2), 257–285 (1989)
Pan, H., Li, B., Sezan, M.: Automatic detection of replay segments in broadcast sports programs by detecting of logos in scene transitions. In: Proc. Int. Conf. Acoustics, Speech, and Signal Processing, May 2002, vol. 4, pp. 3385–3388 (2002)
Zhao, Z., Jiang, S., Huang, Q., Zhu, G.: Highlight summarization in sports video based on replay detection. In: Proc. Int. Conf. Mulmedia and Expo., Toronto, Ontario, Canada, July 2006, pp. 1613–1616 (2006)
Cheng, C., Hsu, C.: Fusion of audio and motion information on HMM-based highlight extraction for baseball games. IEEE Trans. Multimedia 8(3), 585–599 (2006)
Xie, L., Chang, S., Divakaran, A., Sun, H.: Structure analysis of soccer video with hidden Markov models. In: Proc. Int. Conf. Acoustics, Speech, and Signal Processing, pp. 4096–4099 (2002)
Ekin, Tekalp, A.: Generic play-break event detection for summarization and hierarchical sports video analysis. In: Proc. Int. Conf. Mulmedia and Expo., vol. 1, pp. 169–172 (2003)
Snoek, Worring, M.: Multimedia event-based video indexing using time intervals. IEEE Trans. Multimedia 7(4), 638–647 (2005)
Zhu, G., Xu, C., Huang, Q., Rui, Y., Jiang, S., Gao, W., Yao, H.: Event Tactic Analysis Based on Broadcast Sport Video. IEEE Trans. Multimedia 11(1), 49–67 (2009)
Chen, S., Chen, M., Zhang, C., Shyu, M.: Exciting event detection using multi-level multimodal descriptors and data classification. In: Proc. ISM (2006)
Wang, T., Li, J., Diao, Q., Hu, W., Zhang, Y., Dulong, C.: Semantic event detection using conditional random fields. In: Proc. Computer Vision and Pattern Recognition Workshop, pp. 109–115 (2006)
Nan, N., Liu, G., Qian, X., Wang, C.: An SVM-based soccer video shot classification scheme using projection histograms. In: Huang, Y.-M.R., Xu, C., Cheng, K.-S., Yang, J.-F.K., Swamy, M.N.S., Li, S., Ding, J.-W. (eds.) PCM 2008. LNCS, vol. 5353, pp. 883–886. Springer, Heidelberg (2008)
Wickramaratna, K., Chen, M., Chen, S., Shyu, M.: Neural network based framework for goal event detection in soccer videos. In: Proc. Int. Symposium on Multimedia, December 2005, pp. 21–28 (2005)
Duan, L., Xu, M., Chua, T., Tian, Q., Xu, C.: A mid-level representation framework for semantic sports video analysis. In: Proc. ACM Multimedia, pp. 29–32 (2003)
Sadlier, D., O’Connor, N.: Event detection in field sports video using audio-visual features and a support vector Machine. IEEE Trans. Circuits Syst. Video Technol. 15(10), 602–615 (2005)
Xu, P., Xie, L., Chang, S.: Algorithms and systems for segmentation and structure analysis in soccer video. In: Proc. Int. Conf. Multimedia & Expo., pp. 184–187 (2001)
Xu, C., Wang, J., Lu, H., Zhang, Y.: A Novel Framework for Semantic Annotation and Personalized Retrieval of Sports Video. IEEE Transactions on Multimedia 10(3), 421–436 (2008)
Duan, L., Xu, M., Tian, Q., Xu, C., Jin, J.S.: A unified framework for semantic shot classification in sports video. IEEE Trans. Multimedia 7(6), 1066–1083 (2005)
Ding, Y., Fan, G., Bryan, W.: Two-layer generative models for sport video mining. In: Proc. Int. Conf. Multimedia & Expo., pp. 1731–1734 (2007)
Ekin, Tekalp, A., Mehrotra, R.: Automatic soccer video analysis and summarization. IEEE Trans. Image Processing 12(7), 796–807 (2003)
Dao, M., Babaguchi, N.: Sports event detection using temporal patterns mining and web-casting text. In: Proc. ACM AREA, pp. 33–40 (2008)
Zhu, X., Wu, X., Elmagarmid, A., Feng, Z., Wu, L.: Video data mining semantic indexing and event detection from the association perspective. IEEE Trans. Knowledge and Data Engineering 17(5), 665–677 (2005)
Xiong, Z., Radhakrishnan, R., Divakaran, A., Huang, T.: Highlights extraction from sports video based on an audio-visual marker detection framework. In: Proc. Int. Conf. Multimedia & Expo., pp. 29–32 (2005)
Xu, C., Zhang, Y., Zhu, G., Rui, Y., Lu, H., Huang, Q.: Using Webcast Text for Semantic Event Detection in Broadcast Sports Video. IEEE Trans. Multimedia 10(7), 1342–1345 (2008)
Wang, Y., Liu, Z., Huang, J.: Multimedia content analysis using both audio and video clues. IEEE Signal Processing Magazine (2000)
Huang, C., Shih, H., Chao, C.: Semantic analysis of soccer video using dynamic Bayesian network. IEEE Trans. Multimedia 8(4), 749–760 (2006)
Zhang, D., Chang, S.: Event detection in baseball video using superimposed caption recognition. In: Proc. ACM Multimedia, Juan-les-Pins, France, November 1, pp. 315–318 (2002)
Su, Y., Sun, M., Hsu, V.: Global motion estimation from coarsely sampled motion vector field and the applications. IEEE Trans. Circuits Syst. Video Technol. 15(2), 232–242 (2005)
Lyu, M., Song, J., Cai, M.: A comprehensive method for text detection, localization, and extraction. IEEE Trans. Circuits and Systems for Video Technology 15(2), 243–255 (2005)
Wang, J., Xu, C., Chng, E., Tian, Q.: Sports highlight detection from keyword sequences using HMM. In: ICME 2004 (2004)
Lienhart, R., Wernicke, A.: Localizing and segmenting text in images and videos. IEEE Trans. Circuits Syst. Video Technol. 12(4), 256–267 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Qian, X., Liu, G., Wang, H., Li, Z., Wang, Z. (2010). Soccer Video Event Detection by Fusing Middle Level Visual Semantics of an Event Clip. In: Qiu, G., Lam, K.M., Kiya, H., Xue, XY., Kuo, CC.J., Lew, M.S. (eds) Advances in Multimedia Information Processing - PCM 2010. PCM 2010. Lecture Notes in Computer Science, vol 6298. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15696-0_41
Download citation
DOI: https://doi.org/10.1007/978-3-642-15696-0_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15695-3
Online ISBN: 978-3-642-15696-0
eBook Packages: Computer ScienceComputer Science (R0)