Story-Based Retrieval by Learning and Measuring the Concept-Based and Content-Based Similarity

Peng, Yuxin; Xiao, Jianguo

doi:10.1007/978-3-642-11301-7_38

Yuxin Peng²¹ &
Jianguo Xiao²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5916))

Included in the following conference series:

International Conference on Multimedia Modeling

2084 Accesses
2 Citations

Abstract

This paper proposes a new idea and approach for the story-based news video retrieval, i.e. clip-based retrieval. Generally speaking, clip-based retrieval can be divided into two phases: feature representation and similarity ranking. The existing methods only adopt the content-based features and pairwise similarity measure for clip-based retrieval. The main deficiencies are: (1) In feature representation, the concept-based features is still not used to represent the content of video clip; (2) In similarity ranking, the learning-based method is not considered to rank the similar clips with the query. To address the above issues, in this paper, on one hand, we consider jointly the concept-based and content-based features to represent adequately the news story; on the other hand, we consider jointly the learning classifier and pairwise similarity measure to rank effectively the similar stories with the query. Both are the main novelty of this paper. The model construction of learning classifier for story-based retrieval is our focus, which is constructed as follows: given one query story, we can use its all keyframes as the set of positive examples of its topic, and the retrieval data set in which most of the keyframes are irrelevant to the topic as the candidates of negative examples. The multi-bag SVM is employed to compute the score of all keyframes in the data set, and then the stories in the data set are ranked according to the average score of their keyframes, which reflects their similarity with the query story. We compare and evaluate the performance of our approach on 1334 stories from TRECVID 2005 benchmark, and the results show our approach can achieve superior performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cheung, S.C., Zakhor, A.: Efficient Video Similarity Measurement with Video Signature. IEEE Trans. on Circuits and Systems for Video Technology (CSVT) 13(1) (January 2003)
Google Scholar
Yuan, J., Duan, L.-Y., Tian, Q., Xu, C.: Fast and Robust Short Video Clip Search Using an Index Structure. In: ACM International Workshop on Multimedia Information Retrieval (MIR) (October 2004)
Google Scholar
Chen, L., Chua, T.S.: A Match and Tiling Approach to Content-based Video Retrieval. In: IEEE International Conference on Multimedia and Expo. (ICME), pp. 417–420 (2001)
Google Scholar
Jain, A.K., Vailaya, A., Xiong, W.: Query by Video Clip. Multimedia System 7, 369–384 (1999)
Article Google Scholar
Lienhart, R., Effelsberg, W.: A Systematic Method to Compare and Retrieve Video Sequences. Multimedia Tools and Applications 10(1) (January 2000)
Google Scholar
Liu, X., Zhuang, Y., Pan, Y.: A New Approach to Retrieve Video by Example Video Clip. In: ACM Multimedia Conference, MM (1999)
Google Scholar
Peng, Y., Ngo, C.-W.: Clip-Based Similarity Measure for Query-Dependent Clip Retrieval and Video Summarization. IEEE Trans. on Circuits and Systems for Video Technology (CSVT) 16(5) (2006)
Google Scholar
Peng, Y., Ngo, C.-W.: EMD-Based Video Clip Retrieval by Many-to-Many Matching. In: Leow, W.-K., Lew, M., Chua, T.-S., Ma, W.-Y., Chaisorn, L., Bakker, E.M. (eds.) CIVR 2005. LNCS, vol. 3568, pp. 71–81. Springer, Heidelberg (2005)
Google Scholar
Wu, X., Hauptmann, A.G., Ngo, C.-W.: Novelty Detection for Cross-Lingual News Stories with Visual Duplicates and Speech Transcripts. In: ACM Multimedia Conference, MM (2007)
Google Scholar
Yanagawa, A., Chang, S.-F., Kennedy, L., Hsu, W.: Columbia University’s Baseline Detectors for 374 LSCOM Semantic Visual Concepts. Columbia University ADVENT Technical Report #222-2006-8, March 20 (2007)
Google Scholar
Yanagawa, A., Hsu, W., Chang, S.-F.: Brief Descriptions of Visual Features for Baseline TRECVID Concept Detectors. Columbia University ADVENT Technical Report #219-2006-5 (July 2006)
Google Scholar
Tesic, J., Natssev, A., Smith, J.R.: Cluster-based data modeling for semantic video search. In: ACM International Conference on Image and Video Retrieval, CIVR (2007)
Google Scholar
Naphade, M.R., Yeung, M.M., Yeo, B.L.: A Novel Scheme for Fast and Efficient Video Sequence Matching Using Compact Signatures. In: SPIE: Storage and Retrieval for Media Databases, pp. 564–572 (2000)
Google Scholar
Hsu, W.H., Chang, S.-F.: Topic Tracking across Broadcast News Videos with Visual Duplicates and Semantic Concepts. In: International Conference on Image Processing (ICIP), Atlanta, GA (October 2006)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computer Science and Technology, Peking University,
Yuxin Peng & Jianguo Xiao

Authors

Yuxin Peng
View author publications
You can also search for this author in PubMed Google Scholar
Jianguo Xiao
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Oldenburg, Germany
Susanne Boll
University of Texas at San Antonio,, TX, San Antonio, USA
Qi Tian
Microsoft Research Asia, Beijing, P.R. China
Lei Zhang
Southwest University, Beibei, Chongqing, China
Zili Zhang
School of Engineering and Information Technology, Deakin University, 221 Burwood Highway, Vic, 3125, Australia
Yi-Ping Phoebe Chen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Peng, Y., Xiao, J. (2010). Story-Based Retrieval by Learning and Measuring the Concept-Based and Content-Based Similarity. In: Boll, S., Tian, Q., Zhang, L., Zhang, Z., Chen, YP.P. (eds) Advances in Multimedia Modeling. MMM 2010. Lecture Notes in Computer Science, vol 5916. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11301-7_38

Download citation

DOI: https://doi.org/10.1007/978-3-642-11301-7_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-11300-0
Online ISBN: 978-3-642-11301-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics