Conceptual Indexing of Television Images Based on Face and Caption Sizes and Locations

Ronfard, Remi; Garcia, Christophe; Carrive, Jean

doi:10.1007/3-540-40053-2_31

Remi Ronfard⁵,
Christophe Garcia⁶ &
Jean Carrive⁵

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1929))

Included in the following conference series:

International Conference on Advances in Visual Information Systems

505 Accesses

Abstract

Indexing videos by their image content is an important issue for digital audiovisual archives. While much work has been devoted to classification and indexing methods based on perceptual qualities of images, such as color, shape and texture, there is also a need for classi fication and indexing of some structural properties of images. In this paper, we present some methods for image classification in video, based on the presence, size and location of faces and captions. We argue that such classifications are highly domain-dependent, and are best handled using flexible knowledge management systems (in our case, a description logics).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aigrain, Ph., Joly, Ph. and Longueville, V. Medium knowledge-based macrosegmentation of video into sequences Intelligent multimedia information retrieval, AAAI Press-MIT Press, 1997.
Google Scholar
Borgida, A., Brachman, R.J., McGuiness, D.L., Resnick, L.A. 1989. CLASSIC: A Structural Data Model for Objects. ACM SIGMOD Int. Conf. on Management of Data, 1989.
Google Scholar
Bouthemy, P., Garcia C., Ronfard R., Tziritas G., Veneau E. Scene segmentation and image feature extraction for video indexing and retrieval. VISUAL’99, Amsterdam, 1999.
Google Scholar
Carrive, J., Pachet F., Ronfard R. Using Description Logics for Indexing Audiovisual Documents. Proceedings of the International Workshop on Description Logics, Trento, Italy, 1998.
Google Scholar
Carrive, J., Pachet F., Ronfard R. Clavis: a temporal reasoning system for classification of audiovisual sequences. RIAO, Paris, April 2000.
Google Scholar
Carrive, J., Pachet F., Ronfard R. A Language for Audiovisual Template Specification and Recognition Int. Conference on Constraint Programming, Singapore,September 2000.
Google Scholar
Chan, Y. and Lin, S.H. and Tan, Y.P. and Kung, S.Y. Video shot classification using human faces. IEEE Intern. Conference on Image Processing, September 1996.
Google Scholar
Chopra K., Srihari R.K.. Control Structures for Incorporating Picture-Specific Context in Image Interpretation. in: Proceedings of Int’l Joint Conf. on Artificial Intelligence, 1995.
Google Scholar
Garcia C., Zikos G., Tziritas G.. Wavelet Packet Analysis for Face Recognition. To appear in Image and Vision Computing, 18(4).
Google Scholar
Garcia C. and Tziritas G.. Face Detection Using Quantized Skin Color Regions Merging andWavelet Packet Analysis. IEEE Transactions on Multimedia, 1(3):264–277, Sept. 1999.
Article Google Scholar
Garcia C., Apostolidis X.. Text Detection and Segmentation in Complex Color Images. IEEE International Conference on Acoustics, Speech, and Signal, June 5–9 2000, Istanbul, Turkey.
Google Scholar
Ide, I., Yamamoto, K. and Tanaka, H. Automatic indexing to video based on shot classification. Advanced Multimedia Content Processing, LNCS1554, November 1998.
Google Scholar
Jaimes, A. and Chang, S.F. Model-based classification of visual information for content-based retrieval Storage and Retrieval for Image and Video Databases, SPI99, San Jose, January 1999.
Google Scholar
Patel-Schneider, P. and Swartout, B., KRSS Description Logic Specification from the KRSS Effort, http://www.ida.liu.se/labs/iislab/people/patla/DL/, January 1992.
Ronfard, R. Shot-level indexing and matching of video content. Storage and Retrieval for Image and Video Databases, SPIE, October 1997.
Google Scholar
Satoh S., Kanade T.. Name-it: Association of Face and Name in Video. in: Proc. of Computer Vision and Pattern Recognition. IEEE Compu ter Society Press, pp. 368–373, 1997.
Google Scholar
Gunsel, B. and Ferman, A.M. and Tekalp, A.M. Video Indexing Through Integration of Syntactic and Semantic Features. WACV, 1996.
Google Scholar
Ferman, A.M., Tekalp, A.M. and Mehrotra, R. Effective Content Representation for Video IEEE Intern. Conference on Image Processing, October 1998.
Google Scholar
Thomson, R. Grammar of the shot. Media Manual, Focal Press, Oxford, UK, 1998.
Google Scholar
Yeung, M. and Yeo, B.-L. Time-constrained Clustering for Segmentation of Video into Story Units International Conference on Pattern Recognition, 1996.
Google Scholar

Download references

Author information

Authors and Affiliations

INA, 4 avenue de l’Europe, 94366, Bry-sur-Marne, France
Remi Ronfard & Jean Carrive
ICS-FORTH, 10 Heraklion, P.O.Box 1385, GR 711, Crete, Greece
Christophe Garcia

Authors

Remi Ronfard
View author publications
You can also search for this author in PubMed Google Scholar
Christophe Garcia
View author publications
You can also search for this author in PubMed Google Scholar
Jean Carrive
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Claude Bernard University of Lyon, LISI - 502, INSA de Lyon, 69621, Villeurbanne Cedex, France
Robert Laurini

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ronfard, R., Garcia, C., Carrive, J. (2000). Conceptual Indexing of Television Images Based on Face and Caption Sizes and Locations. In: Laurini, R. (eds) Advances in Visual Information Systems. VISUAL 2000. Lecture Notes in Computer Science, vol 1929. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-40053-2_31

Download citation

DOI: https://doi.org/10.1007/3-540-40053-2_31
Published: 11 February 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41177-2
Online ISBN: 978-3-540-40053-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics