Temporal Video Segmentation to Scene Based on Conditional Random Fileds

Xu, Su; Feng, Bailan; Xu, Bo

doi:10.1007/978-3-642-35728-2_36

Su Xu⁷,
Bailan Feng⁷ &
Bo Xu⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7733))

1963 Accesses
1 Citations

Abstract

In this paper, we propose a novel approach of video segmentation into scenes based on the technique of conditional random fields (CRFs). This approach is built upon the design in which scene segmentation is transformed into a label identification problem by defining three types of shots. To implement our algorithm, three middle-level features including shot difference signal, scene transition graph and audio type are extracted to depict the label properties of each shot, and then CRFs model is employed to identify the labels sequence. The advantage of CRFs model lies in its facility in integrating context information of neighboring shots, which produces accurate results in scene segmentation. The proposed approach is verified by seven types of data covering the most major genres of TV program. Experiments on testing data set yield average 0.88 F-measure, which illustrates that the proposed method can accurately detect most scenes in different genres of programs.

This work was supported by the National Natural Science Foundation of China (Grant No.61202326).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Petersohn, C.: Logical unit and scene detection: a comparative survey. In: Multimedia Content Access: Algorithms and Systems II, vol. 6820, pp. 2–17 (2008)
Google Scholar
Yeung, M., Yeo, B.L., Liu, B.: Segmentation of video by clustering and graph analysis. Computer Vision and Image Understanding 71, 94–109 (1998)
Article Google Scholar
Chong-Wah, N., Yu-Fei, M., Hong-Jiang, Z.: Video summarization and scene detection by graph modeling. IEEE Transactions on Circuits and Systems for Video Technology 15, 296–305 (2005)
Article Google Scholar
Jianbo, S., Malik, J.: Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence 22, 888–905 (2000)
Article Google Scholar
Rasheed, Z., Shah, M.: Detection and representation of scenes in videos. IEEE Transactions on Multimedia 7, 1097–1105 (2005)
Article Google Scholar
Yun, Z., Shah, M.: Video scene segmentation using markov chain monte carlo. IEEE Transactions on Multimedia 8, 686–697 (2006)
Article Google Scholar
Chasanis, V.T., Likas, A.C., Galatsanos, N.P.: Scene detection in videos using shot clustering and sequence alignment. IEEE Transactions on Multimedia 11, 89–100 (2009)
Article Google Scholar
Sakarya, U., Telatar, Z.: Video scene detection using graph-based representations. Signal Processing: Image Communication 25, 774–783 (2010)
Google Scholar
Jinhui, Y., Huiyi, W., Lan, X., Wujie, Z., Jianmin, L., Fuzong, L., Bo, Z.: A formal study of shot boundary detection. IEEE Transactions on Circuits and Systems for Video Technology 17, 168–186 (2007)
Article Google Scholar
Xinbo, G., Xiaoou, T.: Unsupervised video-shot segmentation and model-free anchorperson detection for news video story parsing. IEEE Transactions on Circuits and Systems for Video Technology 12, 765–776 (2002)
Article Google Scholar
Kudo, T.: Crf++: Yet another crf toolkit (2005)
Google Scholar
Li, Y., Dorai, C.: Svm-based audio classification for instructional video analysis. In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004), vol. 5, pp. 897–900 (2004)
Google Scholar
Sutton, C., McCallum, A.: An Introduction to Conditional Random Fields. ArXiv e-prints (2010)
Google Scholar
Klinger, R., Tomanek, K., Klinger, R.: Classical probabilistic models and conditional random fields (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Automation Chinese Academy of Sciences, Beijing, 100190, China
Su Xu, Bailan Feng & Bo Xu

Authors

Su Xu
View author publications
You can also search for this author in PubMed Google Scholar
Bailan Feng
View author publications
You can also search for this author in PubMed Google Scholar
Bo Xu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microsoft Research Asia, 5 Danling Street, 100080, Beijing, China
Shipeng Li & Tao Mei &
School of Electrical Engineering and Computer Science, University of Ottawa, 800 King Edward, K1N 6N5, Ottawa, ON, Canada
Abdulmotaleb El Saddik
School of Computer and Information, Hefei University of Technology, Road Tunxi 193#, 230009, Hefei, Anhui, China
Meng Wang & Richang Hong &
Department of Information Engineering and Computer Science, University of Trento, ommarive 14, 38100, Trento, Italy
Nicu Sebe
Department of Electrical and Computer Engineering, National University of Singapore, 4 Engineering Drive 3, 117583, Singapore, Singapore
Shuicheng Yan
School of Computing, CLARITY: Centre for Sensor Web Technologies, Dublin City University, Glasnevin, 9, Dublin, Ireland
Cathal Gurrin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xu, S., Feng, B., Xu, B. (2013). Temporal Video Segmentation to Scene Based on Conditional Random Fileds. In: Li, S., et al. Advances in Multimedia Modeling. Lecture Notes in Computer Science, vol 7733. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35728-2_36

Download citation

DOI: https://doi.org/10.1007/978-3-642-35728-2_36
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35727-5
Online ISBN: 978-3-642-35728-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics