Abstract
Indexing videos by their image content is an important issue for digital audiovisual archives. While much work has been devoted to classification and indexing methods based on perceptual qualities of images, such as color, shape and texture, there is also a need for classi fication and indexing of some structural properties of images. In this paper, we present some methods for image classification in video, based on the presence, size and location of faces and captions. We argue that such classifications are highly domain-dependent, and are best handled using flexible knowledge management systems (in our case, a description logics).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Aigrain, Ph., Joly, Ph. and Longueville, V. Medium knowledge-based macrosegmentation of video into sequences Intelligent multimedia information retrieval, AAAI Press-MIT Press, 1997.
Borgida, A., Brachman, R.J., McGuiness, D.L., Resnick, L.A. 1989. CLASSIC: A Structural Data Model for Objects. ACM SIGMOD Int. Conf. on Management of Data, 1989.
Bouthemy, P., Garcia C., Ronfard R., Tziritas G., Veneau E. Scene segmentation and image feature extraction for video indexing and retrieval. VISUAL’99, Amsterdam, 1999.
Carrive, J., Pachet F., Ronfard R. Using Description Logics for Indexing Audiovisual Documents. Proceedings of the International Workshop on Description Logics, Trento, Italy, 1998.
Carrive, J., Pachet F., Ronfard R. Clavis: a temporal reasoning system for classification of audiovisual sequences. RIAO, Paris, April 2000.
Carrive, J., Pachet F., Ronfard R. A Language for Audiovisual Template Specification and Recognition Int. Conference on Constraint Programming, Singapore,September 2000.
Chan, Y. and Lin, S.H. and Tan, Y.P. and Kung, S.Y. Video shot classification using human faces. IEEE Intern. Conference on Image Processing, September 1996.
Chopra K., Srihari R.K.. Control Structures for Incorporating Picture-Specific Context in Image Interpretation. in: Proceedings of Int’l Joint Conf. on Artificial Intelligence, 1995.
Garcia C., Zikos G., Tziritas G.. Wavelet Packet Analysis for Face Recognition. To appear in Image and Vision Computing, 18(4).
Garcia C. and Tziritas G.. Face Detection Using Quantized Skin Color Regions Merging andWavelet Packet Analysis. IEEE Transactions on Multimedia, 1(3):264–277, Sept. 1999.
Garcia C., Apostolidis X.. Text Detection and Segmentation in Complex Color Images. IEEE International Conference on Acoustics, Speech, and Signal, June 5–9 2000, Istanbul, Turkey.
Ide, I., Yamamoto, K. and Tanaka, H. Automatic indexing to video based on shot classification. Advanced Multimedia Content Processing, LNCS1554, November 1998.
Jaimes, A. and Chang, S.F. Model-based classification of visual information for content-based retrieval Storage and Retrieval for Image and Video Databases, SPI99, San Jose, January 1999.
Patel-Schneider, P. and Swartout, B., KRSS Description Logic Specification from the KRSS Effort, http://www.ida.liu.se/labs/iislab/people/patla/DL/, January 1992.
Ronfard, R. Shot-level indexing and matching of video content. Storage and Retrieval for Image and Video Databases, SPIE, October 1997.
Satoh S., Kanade T.. Name-it: Association of Face and Name in Video. in: Proc. of Computer Vision and Pattern Recognition. IEEE Compu ter Society Press, pp. 368–373, 1997.
Gunsel, B. and Ferman, A.M. and Tekalp, A.M. Video Indexing Through Integration of Syntactic and Semantic Features. WACV, 1996.
Ferman, A.M., Tekalp, A.M. and Mehrotra, R. Effective Content Representation for Video IEEE Intern. Conference on Image Processing, October 1998.
Thomson, R. Grammar of the shot. Media Manual, Focal Press, Oxford, UK, 1998.
Yeung, M. and Yeo, B.-L. Time-constrained Clustering for Segmentation of Video into Story Units International Conference on Pattern Recognition, 1996.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ronfard, R., Garcia, C., Carrive, J. (2000). Conceptual Indexing of Television Images Based on Face and Caption Sizes and Locations. In: Laurini, R. (eds) Advances in Visual Information Systems. VISUAL 2000. Lecture Notes in Computer Science, vol 1929. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-40053-2_31
Download citation
DOI: https://doi.org/10.1007/3-540-40053-2_31
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41177-2
Online ISBN: 978-3-540-40053-0
eBook Packages: Springer Book Archive