Skip to main content

Fusion of Audio-Visual Features and Statistical Property for Commercial Segmentation

  • Conference paper
Advances in Multimedia Modeling (MMM 2013)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7732))

Included in the following conference series:

  • 2244 Accesses

Abstract

Commercial segmentation is a primary step of commercial management which is an emerging technology. Relative to general video scene segmentation, commercial segmentation is particular because of dramatic changes in acoustic effect and chromatic composition. Conventional algorithms emphasize on utilizing new audio and visual features to adapt with change over time. In this paper, we have proposed a novel scheme to fuse audio-visual characteristics and statistical property of commercial length to find individual commercial boundaries. First, mid-level descriptors such as Static Shot with Product Information (SSPI) are used to predict the likelihoods of commercial boundary for every shot boundary. And then, Dynamic Programming (DP) refiner with Distribution of Individual Commercial Length (DICL) constraint is applied to find the optimal path of a Markov Chain of these shot boundaries. Experiments on simulated and real datasets show promising results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Lienhart, R., Kuhmunch, C., Effelsberg, W.: On the Detection and Recognition of Television Commercials. In: Proc. of IEEE Conf. on Multimedia Computing and Systems (1997)

    Google Scholar 

  2. Hauptmann, A.G., Witbrock, M.J.: Story segmentation and detection of commercials in broadcast news video. In: Proc. Conf. ADL (1998)

    Google Scholar 

  3. Ajmera, J., McCowan, I., Bourlard, H.: Robust speaker change detection. IEEE Signal Processing Letters 11, 649–651 (2004)

    Article  Google Scholar 

  4. Rasheed, Z., Shah, M.: Detection and representation of scenes in videos. IEEE Transactions on Multimedia 7, 1097–1105 (2005)

    Article  Google Scholar 

  5. Hua, X.S., Lu, L., Zhang, H.J.: Robust Learning-based TV Commercial Detection. In: Proc. ICME (2005)

    Google Scholar 

  6. Mizutani, M., Ebadollahi, S., Chang, S.F.: Commercial Detection in Heterogeneous Video Streams Using Fused Multi-Modal and Temporal Features. In: Pro. ICASSP (2005)

    Google Scholar 

  7. Duan, L.Y., Wang, J.Q., Zheng, Y., Jin, J.S., Lu, H.Q., Xu, C.S.: Segmentation, categorization, and identification of commercials from tv streams using multimodal analysis. In: Proc. ACM MM 2006, pp. 202–210 (2006)

    Google Scholar 

  8. Yuan, J.H., Wang, H.Y., Xiao, L., Zheng, W.J., Li, J.M., Lin, F.Z., Zhang, B.: A Formal Study of Shot Boundary Detection. IEEE Trans. on Circuits and Systems for Video Technology 17, 168–186 (2007)

    Article  Google Scholar 

  9. Duxans, H., Conejero, D., Anguera, X.: Audio-based automatic management of TV commercials. In: Pro. ICASSP (2009)

    Google Scholar 

  10. Chasanis, V.T., Likas, A.C., Galatsanos, N.P.: Scene Detection in Videos Using Shot Clustering and Sequence Alignment. IEEE Transactions on Multimedia 11, 89–100 (2009)

    Article  Google Scholar 

  11. Newstead, K., Romaniuk, J.: Cost Per Second: The Relative Effectiveness of 15- and 30-Second Television Advertisements. Journal of Advertising Research 50, 68–76 (2009)

    Article  Google Scholar 

  12. Liu, N., Zhao, Y., Zhu, Z.F., Lu, H.Q.: Exploiting Visual-Audio-Textual Characteristics for Automatic TV Commercial Block Detection and Segmentation. IEEE Transactions on Multimedia 13, 961–973 (2011)

    Article  Google Scholar 

  13. Sidiropoulos, P., Mezaris, V., Kompatsiaris, I., Meinedo, H., Bugalho, M., Trancoso, I.: Temporal video segmentation to scenes using high-level audiovisual features. IEEE Transactions on Circuits and Systems for Video Technology 21, 1163–1177 (2011)

    Article  Google Scholar 

  14. Libsvm, http://www.csie.ntu.edu.tw/cjlin/libsvm/

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhang, B., Feng, B., Xu, B. (2013). Fusion of Audio-Visual Features and Statistical Property for Commercial Segmentation. In: Li, S., et al. Advances in Multimedia Modeling. MMM 2013. Lecture Notes in Computer Science, vol 7732. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35725-1_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-35725-1_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-35724-4

  • Online ISBN: 978-3-642-35725-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics