Abstract
This paper presents a model to represent a broadcasted sports video in a semantic way and proposes a method of automatically generating semantic descriptions of significant scenes. Representation of a video should clarify the semantic content of the video as accurately as possible. Our model structurizes the video and specifies suitable semantic descriptions for video segments paying attention to the structure of both a sports game and a sports TV program. As the elements of these semantic descriptions, the proposed method tries to obtain the information about the plays and their related players from the closed-caption stream by searching key phrases. Finding the corresponding segments of the video by means of template matching for the image stream attaches these textual descriptions to the proper portion of the video. In this paper, we discuss some experimental results of our method and the potentiality for integrating these results into the standardized MPEG-7 description tools.
Similar content being viewed by others
References
N. Babaguchi, Y. Kawai, and T. Kitahashi, "Event based indexing of broadcasted sports video by intermodal collaboration," IEEE Transaction on Multimedia, Vol. 4, No. 1, pp. 68–75,2001
A.B. Benitez, D. Zhong, S.-F. Chang, and J.R. Smith, "MPEG-7 MDS content description tools and applica-tions," Lecture Notes in Computer Science.
S.-F. Chang, T. Sikora, and A. Puri, "Overview of the MPEG-7 Standard," IEEE Transactions on Circuits and Systems for Video Technology, Vol. 11, No. 6, pp. 688–695,2001
Y. Chang, W. Zeng, I. Kamel, and R. Alonso, "Integrated image and speech analysis for content-based video indexing," in Proc. IEEE ICMCS'96, 1996, pp. 306–313.
Q. Huang, Z. Liu, A. Rosenberg, D. Gibbon, and B. Shahraray, "Automated generation of news content hierarchy by integrating audio, video, and text information", in Proc. ICASSP'99, 1999, Vol. 6, pp. 3025–3028.
M. Lazarescu, S. Venkatesh, G. West, and T. Caelli, "On the automated interpretation and indexing of American football," in Proc. IEEE ICMCS'99, 1999, Vol. 1, pp. 802–806.
R. Lienhart, S. Pfeiffer, and W. Effelsberg, "Video abstracting," Communications of the ACM, Vol. 40, No. 12, pp. 55–62,1997
I. Mani, D. House, M.T. Maybury, and M. Green, "Towards content-based browsing of broadcast" in Intelligent Multimedia Information Retrieval, The MIT Press, 1997, pp. 241–258.
MPEG MDS Group, "Text of 15938-5 FCD information technology-multimedia content description interface-part 5 multimedia description schemes," ISO/IEC JTC1/SC29/WG11 MPEG01/M7009, Singa-pore, 2001.
Y. Nakamura and T. Kanade, "Semantic analysis for video contents extraction-spotting by association in news video," in Proc. of The Fifth ACM International Multimedia Conference, 1997, pp. 393–402.
N. Nitta, N. Babaguchi, and T. Kitahashi, "Extracting actors, actions and events from sports video-a funda-mental approach to story tracking-," in ICPR'00, 2000, pp. 718–721.
S. Satoh, Y. Nakamura, and T. Kanade, "Name-it: naming and detecting faces in news videos," IEEE Multi-media, pp. 22–35,1999
M.A. Smith and T. Kanade, "Video skimming and characterization through the combination of image and language understanding techniques," in Proc. of CVPR97, 1997, pp. 775–781.
P. Xu, L. Xie, S.F. Chang, A. Divakaran, A. Vetro, and H. Sun, "Algorithms and system for segmentation and structure analysis in soccer video," in Proc. of IEEE ICME'01, 2001, pp. 928–931.
D. Zhong and S.F. Chang, "Structure analysis of sports video using domain models," in Proc. of IEEE ICME'01, 2001, pp. 920–923.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Nitta, N., Babaguchi, N. & Kitahashi, T. Generating Semantic Descriptions of Broadcasted Sports Videos Based on Structures of Sports Games and TV Programs. Multimedia Tools and Applications 25, 59–83 (2005). https://doi.org/10.1023/B:MTAP.0000046382.62218.e1
Issue Date:
DOI: https://doi.org/10.1023/B:MTAP.0000046382.62218.e1