Abstract
This paper proposes a new idea and approach for the story-based news video retrieval, i.e. clip-based retrieval. Generally speaking, clip-based retrieval can be divided into two phases: feature representation and similarity ranking. The existing methods only adopt the content-based features and pairwise similarity measure for clip-based retrieval. The main deficiencies are: (1) In feature representation, the concept-based features is still not used to represent the content of video clip; (2) In similarity ranking, the learning-based method is not considered to rank the similar clips with the query. To address the above issues, in this paper, on one hand, we consider jointly the concept-based and content-based features to represent adequately the news story; on the other hand, we consider jointly the learning classifier and pairwise similarity measure to rank effectively the similar stories with the query. Both are the main novelty of this paper. The model construction of learning classifier for story-based retrieval is our focus, which is constructed as follows: given one query story, we can use its all keyframes as the set of positive examples of its topic, and the retrieval data set in which most of the keyframes are irrelevant to the topic as the candidates of negative examples. The multi-bag SVM is employed to compute the score of all keyframes in the data set, and then the stories in the data set are ranked according to the average score of their keyframes, which reflects their similarity with the query story. We compare and evaluate the performance of our approach on 1334 stories from TRECVID 2005 benchmark, and the results show our approach can achieve superior performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cheung, S.C., Zakhor, A.: Efficient Video Similarity Measurement with Video Signature. IEEE Trans. on Circuits and Systems for Video Technology (CSVT) 13(1) (January 2003)
Yuan, J., Duan, L.-Y., Tian, Q., Xu, C.: Fast and Robust Short Video Clip Search Using an Index Structure. In: ACM International Workshop on Multimedia Information Retrieval (MIR) (October 2004)
Chen, L., Chua, T.S.: A Match and Tiling Approach to Content-based Video Retrieval. In: IEEE International Conference on Multimedia and Expo. (ICME), pp. 417–420 (2001)
Jain, A.K., Vailaya, A., Xiong, W.: Query by Video Clip. Multimedia System 7, 369–384 (1999)
Lienhart, R., Effelsberg, W.: A Systematic Method to Compare and Retrieve Video Sequences. Multimedia Tools and Applications 10(1) (January 2000)
Liu, X., Zhuang, Y., Pan, Y.: A New Approach to Retrieve Video by Example Video Clip. In: ACM Multimedia Conference, MM (1999)
Peng, Y., Ngo, C.-W.: Clip-Based Similarity Measure for Query-Dependent Clip Retrieval and Video Summarization. IEEE Trans. on Circuits and Systems for Video Technology (CSVT) 16(5) (2006)
Peng, Y., Ngo, C.-W.: EMD-Based Video Clip Retrieval by Many-to-Many Matching. In: Leow, W.-K., Lew, M., Chua, T.-S., Ma, W.-Y., Chaisorn, L., Bakker, E.M. (eds.) CIVR 2005. LNCS, vol. 3568, pp. 71–81. Springer, Heidelberg (2005)
Wu, X., Hauptmann, A.G., Ngo, C.-W.: Novelty Detection for Cross-Lingual News Stories with Visual Duplicates and Speech Transcripts. In: ACM Multimedia Conference, MM (2007)
Yanagawa, A., Chang, S.-F., Kennedy, L., Hsu, W.: Columbia University’s Baseline Detectors for 374 LSCOM Semantic Visual Concepts. Columbia University ADVENT Technical Report #222-2006-8, March 20 (2007)
Yanagawa, A., Hsu, W., Chang, S.-F.: Brief Descriptions of Visual Features for Baseline TRECVID Concept Detectors. Columbia University ADVENT Technical Report #219-2006-5 (July 2006)
Tesic, J., Natssev, A., Smith, J.R.: Cluster-based data modeling for semantic video search. In: ACM International Conference on Image and Video Retrieval, CIVR (2007)
Naphade, M.R., Yeung, M.M., Yeo, B.L.: A Novel Scheme for Fast and Efficient Video Sequence Matching Using Compact Signatures. In: SPIE: Storage and Retrieval for Media Databases, pp. 564–572 (2000)
Hsu, W.H., Chang, S.-F.: Topic Tracking across Broadcast News Videos with Visual Duplicates and Semantic Concepts. In: International Conference on Image Processing (ICIP), Atlanta, GA (October 2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Peng, Y., Xiao, J. (2010). Story-Based Retrieval by Learning and Measuring the Concept-Based and Content-Based Similarity. In: Boll, S., Tian, Q., Zhang, L., Zhang, Z., Chen, YP.P. (eds) Advances in Multimedia Modeling. MMM 2010. Lecture Notes in Computer Science, vol 5916. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11301-7_38
Download citation
DOI: https://doi.org/10.1007/978-3-642-11301-7_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-11300-0
Online ISBN: 978-3-642-11301-7
eBook Packages: Computer ScienceComputer Science (R0)