Abstract
In this paper we present a review of recent research and development works, which have been developed in the domain of indexing and mining audio-visual document. We first present the characteristics of the audio-visual documents and the outcomes of digitising this kind of documents. It raises several important issues concerning the new definition of what is a document, what is indexing and what are the numeric principles and technologies available for performing indexing and mining tasks. The analysis of these issue let us introduce the notion of temporal and multimedia objects, and the presentation of the three steps for indexing multimedia documents. It includes the clear distinction between descriptors and indexing. Finally we introduce the MPEG-7 paradigm, which sets the technical environment for developing indexing applications; Then we shortly review current developments, based on the text mining, the XML-Schema, and the event description interface approaches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Auffret, G., Carrive, J., Chevet, O., Dechilly, T., Ronfard, R., Bachimont, B.: Audiovisual-based Hypermedia Authoring - using structured representations for efficient access to AV documents. In: ACM Hypertext (1999)
Akrivas, G., Ioannou, S., Karakoulakis, E., Karpouzis, K., Avrithis, Y., Delopoulos, A., Kollias, S., Vazirgiannis, M., Varlamis, I.: An Intelligent System for Retrieval and Mining of Audiovisual Material Based on the MPEG-7 Description Schemes. In: European Symposium on Intelligent Technologies, Hybrid Systems and their implementation on Smart Adaptive Systems (EUNITE), Spain (2001)
et Dechilly, T.B.B.: Ontologies pour l’indexation conceptuelle et structurelle de l’audiovisuel, Eyrolles, dans Ingénierie des connaissances (1999-2001)
Bachimont, B., Isaac, A., Troncy, R.: Semantic Commitment for Designing Ontologies: a Proposal. In: Gómez-Pérez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS (LNAI), vol. 2473, p. 114. Springer, Heidelberg (2002)
Bachimont, B.: Indexation multimédia, in Assistance intelligente à la recherche d’information, édité par Eric Gaussier et Marie-Hélène Stefanini,, Hermès (2002)
Bachimont, B.: Audiovisual indexing and automatic analysis: problems and perspectives from an archiving point of view Imagina 2002, Monaco, 12 February (2002)
Bachimont, B.: Le document audiovisuel, le numérique, ses usages et son archivage. Le document multimédia en sciences du traitement de l’information. Ecole thématique du CNRSGDR I3, Documents et évolution. Cépaduès-Editions, tome 1, p. 111-128 (2000)
Bachimont, B.: MPEG-7 and Ontologies: an editorial perspective. In: Virtual Systems and Multimedia, 98th edn., Gifu, Japan (October 1998)
Carrive, J., Pachet, F., Ronfard, R.: Using Description Logics for Indexing Audiovisual Documents. In: International Workshop on Description Logics (DL 1998), Trento, Italy, ITC-Irst, pp. 116–120 (1998)
Carrive, J., Pachet, F., Ronfard, R.: Logiques de descriptions pour l’analyse structurelle de film. In: Charlet, J., Zacklad, M., Kassel, G., Bourigault, D. (eds.) Ingéniérie des Connaissances, évolutions récentes et nouveaux défis, Eyrolles, pp. 423–438 (2000b)
Carrive, J., Pachet, F., Ronfard, R.: Clavis: a temporal reasoning system for classification of audiovisual sequences. In: Proceedings of the Content-Based Multimedia Access (RIAO 2000), Paris, France, April 12-14, pp. 1400–1415 (2000a)
Durand, G., Faudemay, P.: Cross-indexing and access to mixed-media contents. In: Proc. CBMI 2001 International Workshop on Content-Based Multimedia Indexing, Brescia, Italy (September 2001)
Erwig, M., Gueting, R.H., Schneider, M., Vazirgiannis, M.: Spatio-Temporal Data Types: An Approach to Modeling and Querying Moving Objects in Databases. GeoInformatica Journal 3(3) (1999)
Kohonen, T., Kaski, S., Lagus, K., Salojärvi, J., Honkela, J., Paatero, V., Saarela, A.: Self Organization of a Massive Document Collection. In: IEEE Transactions on Neural Networks, Special Issue on Neural Networks for Data Mining and Knowledge Discovery, vol. 11(3), pp. 574–585 (May 2000)
Lagus, K.: Text Mining with the WEBSOM. Acta Polytechnica Scandinavica, Mathematics and Computing Series no. 110, Espoo, 54 pp. D.Sc(Tech) Thesis, Helsinki University of Technology, Finland. URL (2000), http://www.cis.hut.fi/krista/thesis/
Lespinasse, K., Bachimont, B.: Is Peritext a Key for Audiovisual Documents? The Use of Texts Describing Television Programs to Assist indexing. In: Gelbukh, A. (ed.) CICLing 2001. LNCS, vol. 2004, p. 505. Springer, Heidelberg (2001)
Markousis, T., Tsirikos, D., Vazirgiannis, M., Stavrakas, G.: A Client-Server Design for Interactive Multimedia Documents based on Java. Elsevier - Computer Communications Journal (2000)
Mirbel, I., Pernici, B., Sellis, T., Tserkezoglou, S., Vazirgiannis, M.: Checking Temporal Integrity of Interactive Multimedia Documents. Very Large Data Bases journal 9(2), 111–130 (2000)
Mirbel, I., Pernici, B., Vazirgiannis, M.: Temporal Integrity Constraints in Interactive Multimedia Documents. In: The proceedings of IEEE-International Conference on Multimedia Computing and Systems (ICMCS 1999), Florence, Italy (June 1999)
Ontrup, J., Ritter, H.: Text Categorization and Semantic Browsing with Self-Organizing Maps on non-euclidean Spaces. In: De Raedt, L., Siebes, A. (eds.) PKDD 2001. LNCS (LNAI), vol. 2168, pp. 338–349. Springer, Heidelberg (2001)
Ontrup, J., Ritter, H.: Hyperbolic Self-Organizing Maps for Semantic Navigation. In: Advances in Neural Information Processing Systems, vol. 14 (2001)
Pampalk, E., Rauber, A., Merkl, D.: Content-based Organization and Visualization of Music Archives. In: Proceedings of the ACM Multimedia 2002, Juan les Pins, France, December 1-6, pp. 570–579 (2002)
Pampalk, E., Rauber, A., Merkl, D.: Using Smoothed Data Histograms for Cluster Visualization in Self-Organizing Maps. In: Dorronsoro, J.R. (ed.) ICANN 2002. LNCS, vol. 2415, pp. 871–876. Springer, Heidelberg (2002)
Varlamis, I., Vazirgiannis, M., Poulos, P.: Using XML as a medium for describing, modifying and querying audiovisual content stored in relational database systems. In: International Workshop on Very Low Bitrate Video Coding (VLBV), Athens (2001)
Varlamis, I., Vazirgiannis, M.: Bridging XML-Schema and relational databases. A system for generating and manipulating relational databases using valid XML documents. In: The Proceedings of ACM Symposium on Document Engineering, Atlanta, USA (November 2001)
Varlamis, I., Vazirgiannis, M.: Web document searching using enhanced hyperlink semantics based on XML. In: Proceedings of IDEAS 2001, Grenoble, France (2001)
Vazirgiannis, M., Tsirikos, D., Markousis, Th., Trafalis, M., Stamati, Y., Hatzopoulos, M., Sellis, T.: Interactive Multimedia Documents: a Modeling, Authoring and Rendering approach. Multimedia Tools and Applications Journal (Kluwer Academic Publishers) (2000)
Vazirgiannis, M., Theodoridis, Y., Sellis, T.: Spatio-Temporal Composition and Indexing for Large Multimedia Applications. ACM/Springer-Verlag Multimedia Systems Journal 6(4) (1998)
Vazirgiannis, M., Mourlas, C.: An object Oriented Model for Interactive Multimedia Applications. The Computer Journal 36(1) (January 1993)
Vazirgiannis, M.: Multimedia Data Base Object and Application Modeling Issues and an Object Oriented Model. In: Multimedia Database Systems: Design and Implementation Strategies, pp. 208–250. Kluwer Academic Publishers, Dordrecht (1996)
Veneau, E., Ronfard, R., Bouthemy, P.: From Video Shot Clustering to Sequence Segmentation. In: Fifteenth International Conference on Pattern Recognition (ICPR 2000), Barcelona (September 2000)
Vodislav, D., Vazirgiannis, M.: Structured Interactive Animation for Multimedia Documents. In: Proceedings of IEEE Visual Languages Symposium, Seattle, USA (September 2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Morizet-Mahoudeaux, P., Bachimont, B. (2005). Indexing and Mining Audiovisual Data. In: Tsumoto, S., Yamaguchi, T., Numao, M., Motoda, H. (eds) Active Mining. Lecture Notes in Computer Science(), vol 3430. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11423270_3
Download citation
DOI: https://doi.org/10.1007/11423270_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26157-5
Online ISBN: 978-3-540-31933-7
eBook Packages: Computer ScienceComputer Science (R0)