A Framework for Semantic Classification of Scenes Using Finite State Machines

Zhai, Yun; Rasheed, Zeeshan; Shah, Mubarak

doi:10.1007/978-3-540-27814-6_35

Yun Zhai²⁰,
Zeeshan Rasheed²¹ &
Mubarak Shah²⁰

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3115))

Included in the following conference series:

International Conference on Image and Video Retrieval

992 Accesses
15 Citations

Abstract

We address the problem of classifying scenes from feature films into semantic categories and propose a robust framework for this problem. We propose that the Finite State Machines (FSM) are suitable for detecting and classifying scenes and demonstrate their usage for three types of movie scenes; conversation, suspense and action. Our framework utilizes the structural information of the scenes together with the low and mid-level features. Low level features of video including motion and audio energy and a mid-level feature, face detection, are used in our approach. The transitions of the FSMs are determined by the features of each shot in the scene. Our FSMs have been experimented on over 60 clips and convincing results have been achieved.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Adams, B., Dorai, C., Venkatesh, S.: Novel Approach to Determining Tempo and Dramatic Story Sections in Motion Pictures, ICIP (2000)
Google Scholar
Rasheed, Z., Shah, M.: Scene Detection In Hollywood Movies and TV Shows. In: IEEE Computer Vision and Pattern Recognition Conference, Madison,Wisconsin, June 16-22 (2003)
Google Scholar
Yoshitaka, A., Ishii, T., Hirakawa, M., Ichikawa, T.: Content-Based Retrieval of Video Data by the Grammar of Film. In: IEEE Symposium on Visual Languages (1997)
Google Scholar
Lienhart, R., Pfeiffer, S., Effelsberg, W.: Scene Determination Based on Video and Audio Features. In: Proc. IEEE Conf. on Multimedia Computing and Systems, Florence, Italy (1999)
Google Scholar
Li, Y., Narayanan, S., Jay Kuo, C.-C.: Movie Content Analysis Indexing, and Skimming. In: Video Mining. ch.5, Kluwer Academic Publishers, Dordrecht (2003)
Google Scholar
Viola, P., Jones, M.: Robust Real-Time Object Detection, International Journal of Computer Vision (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science, University of Central Florida, Orlando, Florida, 32828, United States
Yun Zhai & Mubarak Shah
ObjectVideo, 11600, Sunrise Valley Dr. Suite 290, Reston, Virginia, 20191, United States
Zeeshan Rasheed

Authors

Yun Zhai
View author publications
You can also search for this author in PubMed Google Scholar
Zeeshan Rasheed
View author publications
You can also search for this author in PubMed Google Scholar
Mubarak Shah
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computing, Mathematical and Information Sciences, University of Brighton, UK
Peter Enser
Informatics and Telematics Institute, Centre for Research and Technology-Hellas, 57001, Thessaloniki, Greece
Yiannis Kompatsiaris
Centre for Digital Video Processing, Adaptive Information Cluster, Dublin City University, Ireland
Noel E. O’Connor
Dublin City University, Dublin, Ireland
Alan F. Smeaton
ISLA lab, Informatics Institute, University of Amsterdam, The Netherlands
Arnold W. M. Smeulders

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhai, Y., Rasheed, Z., Shah, M. (2004). A Framework for Semantic Classification of Scenes Using Finite State Machines. In: Enser, P., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A.F., Smeulders, A.W.M. (eds) Image and Video Retrieval. CIVR 2004. Lecture Notes in Computer Science, vol 3115. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27814-6_35

Download citation

DOI: https://doi.org/10.1007/978-3-540-27814-6_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22539-3
Online ISBN: 978-3-540-27814-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics