Anchor Shot Detection with Deep Neural Network

Feng, Bailan; Bai, Jinfeng; Chen, Zhineng; Huang, Xiangsheng; Xu, Bo

doi:10.1007/978-3-319-13168-9_34

Bailan Feng²¹,
Jinfeng Bai²¹,
Zhineng Chen²¹,
Xiangsheng Huang²¹ &
…
Bo Xu²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8879))

Included in the following conference series:

Pacific Rim Conference on Multimedia

2054 Accesses
1 Citations

Abstract

Anchor Shot Detection (ASD) is a key step for segmenting news videos into stories. However, the existing ASD methods are either channel-related or channel-limited which could not satisfy the requirement for achieving effective management of large-scale broadcast news videos. Considering the variety and diversity of large-scale news videos and channels, in this paper we propose a universal scheme based on deep neural network for anchor shot detection (DNN_ASD). Firstly, DNN_ASD consists of a training procedure of deep neural network to learn the appropriate anchor shot detector. Secondly, accompanied with imbalanced sampling strategy and face-assist verification, a universal scheme of anchor shot detection for large-scale news videos and channels is available. Parallel to this, the width and depth of neural network and the transfer ability are empirically discussed respectively as well. Encouraging experimental results on news videos from 30 TV channels demonstrate the effectiveness of the proposed scheme, as well as its superiority on transfer ability over traditional ASD methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hsu, W.H., Kenney, L.S., Chang, S.F., Franz, M., Smith, J.: Columbia-IBM News Video Story Segmentation in TRECVID. In: Proc. ACM CIVR, pp. 1–11 (2005)
Google Scholar
Xu, S., Feng, B.L., Chen, Z.N., Xu, B.: A General Framework of Video Segmentation to Logical Unit based on Conditional Random Fields. In: Proc. ACM ICMR, pp. 247–254 (2013)
Google Scholar
Feng, B.L., Chen, Z.N., Zheng, R., Xu, B.: Multiple Style Exploration for Story Unit Segmentation of Broadcast News Video. Multimedia Systems 20(4), 347–361 (2014)
Article Google Scholar
Smoliar, S.W., Zhang, H.J., Tao, S.Y., Gong, Y.: Automatic Parsing and Indexing of News Video. Multimedia Systems 2(6), 256–265 (1995)
Article Google Scholar
Hanjalic, A., Lagendijk, R.L., Biemond, J.: Semi-Automatic News Analysis, Indexing and Classification System based on Topics Preselection. In: Proc. of SPIE: Electronic Imaging: Storage and Retrieval of Image and Video Databases, San Jose, CA (1999)
Google Scholar
Liu, Z., Huang, Q.: Adaptive Anchor Detection using On-line Trained Audio/Visual Model. In: Proc. of SPIE, Storage and Retrieval for Media Database (2000)
Google Scholar
Santo, M.D., Foggia, P., Percannella, G., Sansone, C., Vento, M.: An Unsupervised Algorithm for Anchor Shot Detection. In: Proc. IEEE ICPR, pp. 1238–1241 (2006)
Google Scholar
Gao, X., Tang, X.: Unsupervised Video-Shot Segmentation and Model-Free Anchorperson Detection for News Video Story Parsing. IEEE Trans. Circuits and System for Video Technology 12(9), 765–776 (2002)
Article Google Scholar
Hsu, W., Chang, S.F., Huang, C.W., Kennedy, L., Lin, C.Y., Iyengar, G.: Discovery and Fusion of Salient Multi-Modal Features towards News Story Segmentation. In: Proc. of SPIE: Symposium on Electronic Imaging: Storage and Retrieval of Image/Video Database, San Jose, USA (2004)
Google Scholar
Ojala, T., Pietikainen, M., Maenpaa, T.: Multiresolution Gray-scale and Rotation Invariant Texture Classification with Local Binary Patterns. IEEE Trans. Pattern Analysis and Machine Intelligence 24(7), 971–987 (2002)
Article Google Scholar
Tesic, J., Natsev, A., Xie, L., Smith, J.R.: Data Modeling Strategy for Imbalanced Learning in Visual Search. In: Proc. IEEE ICME, pp. 1990–1993 (2007)
Google Scholar
Liu, C.L., Sako, H., Fujisawa, H.: Handwritten Chinese Character Recognition: Alternatives to Nonlinear Normalization. In: Proc. ICDAR, pp. 524–528 (2003)
Google Scholar
Bai, J., Chen, Z., Feng, B., Xu, B.: Chinese Image Character Recognition Using DNN and Machine Simulated Training Samples. In: Wermter, S., Weber, C., Duch, W., Honkela, T., Koprinkova-Hristova, P., Magg, S., Palm, G., Villa, A.E.P. (eds.) ICANN 2014. LNCS, vol. 8681, pp. 209–216. Springer, Heidelberg (2014)
Chapter Google Scholar
Intel, Compute-Intensive, Highly Parallel Applications and Uses. Intel Technology Journal 09 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Interactive Digital Media Technology Research Center, Institute of Automation, Chinese Academy of Sciences, Beijing, China
Bailan Feng, Jinfeng Bai, Zhineng Chen, Xiangsheng Huang & Bo Xu

Authors

Bailan Feng
View author publications
You can also search for this author in PubMed Google Scholar
Jinfeng Bai
View author publications
You can also search for this author in PubMed Google Scholar
Zhineng Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xiangsheng Huang
View author publications
You can also search for this author in PubMed Google Scholar
Bo Xu
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, National University of Singapore, 117417, Singapore
Wei Tsang Ooi
Informatics Institute, Intelligent Systems Lab Amsterdam (ISLA), University of Amsterdam, Science Park 904, 1098 GH, Amsterdam, The Netherlands
Cees G. M. Snoek
Department of Computer Science, Universiti Tunku Abdul Rahman, 31900, Kampar, Perak, Malaysia
Hung Khoon Tan
Faculty of Computing and Informatics, Persiaran Multimedia, Multimedia University, 63100, Cyberjaya, Selangor, Malaysia
Chin-Kuan Ho
EURECOM, Campus Sophia Tech, 450 route des Chappes, 06904, Sophia Antipolis, France
Benoit Huet
Department of Computer Science, City University of Hong Kong, Tat Chee Ave, Kowloon, Hong Kong, China
Chong-Wah Ngo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Feng, B., Bai, J., Chen, Z., Huang, X., Xu, B. (2014). Anchor Shot Detection with Deep Neural Network. In: Ooi, W.T., Snoek, C.G.M., Tan, H.K., Ho, CK., Huet, B., Ngo, CW. (eds) Advances in Multimedia Information Processing – PCM 2014. PCM 2014. Lecture Notes in Computer Science, vol 8879. Springer, Cham. https://doi.org/10.1007/978-3-319-13168-9_34

Download citation

DOI: https://doi.org/10.1007/978-3-319-13168-9_34
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13167-2
Online ISBN: 978-3-319-13168-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics