Skip to main content

Indexing and Mining Audiovisual Data

  • Conference paper
Active Mining

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3430))

Abstract

In this paper we present a review of recent research and development works, which have been developed in the domain of indexing and mining audio-visual document. We first present the characteristics of the audio-visual documents and the outcomes of digitising this kind of documents. It raises several important issues concerning the new definition of what is a document, what is indexing and what are the numeric principles and technologies available for performing indexing and mining tasks. The analysis of these issue let us introduce the notion of temporal and multimedia objects, and the presentation of the three steps for indexing multimedia documents. It includes the clear distinction between descriptors and indexing. Finally we introduce the MPEG-7 paradigm, which sets the technical environment for developing indexing applications; Then we shortly review current developments, based on the text mining, the XML-Schema, and the event description interface approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Auffret, G., Carrive, J., Chevet, O., Dechilly, T., Ronfard, R., Bachimont, B.: Audiovisual-based Hypermedia Authoring - using structured representations for efficient access to AV documents. In: ACM Hypertext (1999)

    Google Scholar 

  2. Akrivas, G., Ioannou, S., Karakoulakis, E., Karpouzis, K., Avrithis, Y., Delopoulos, A., Kollias, S., Vazirgiannis, M., Varlamis, I.: An Intelligent System for Retrieval and Mining of Audiovisual Material Based on the MPEG-7 Description Schemes. In: European Symposium on Intelligent Technologies, Hybrid Systems and their implementation on Smart Adaptive Systems (EUNITE), Spain (2001)

    Google Scholar 

  3. et Dechilly, T.B.B.: Ontologies pour l’indexation conceptuelle et structurelle de l’audiovisuel, Eyrolles, dans Ingénierie des connaissances (1999-2001)

    Google Scholar 

  4. Bachimont, B., Isaac, A., Troncy, R.: Semantic Commitment for Designing Ontologies: a Proposal. In: Gómez-Pérez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS (LNAI), vol. 2473, p. 114. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  5. Bachimont, B.: Indexation multimédia, in Assistance intelligente à la recherche d’information, édité par Eric Gaussier et Marie-Hélène Stefanini,, Hermès (2002)

    Google Scholar 

  6. Bachimont, B.: Audiovisual indexing and automatic analysis: problems and perspectives from an archiving point of view Imagina 2002, Monaco, 12 February (2002)

    Google Scholar 

  7. Bachimont, B.: Le document audiovisuel, le numérique, ses usages et son archivage. Le document multimédia en sciences du traitement de l’information. Ecole thématique du CNRSGDR I3, Documents et évolution. Cépaduès-Editions, tome 1, p. 111-128 (2000)

    Google Scholar 

  8. Bachimont, B.: MPEG-7 and Ontologies: an editorial perspective. In: Virtual Systems and Multimedia, 98th edn., Gifu, Japan (October 1998)

    Google Scholar 

  9. Carrive, J., Pachet, F., Ronfard, R.: Using Description Logics for Indexing Audiovisual Documents. In: International Workshop on Description Logics (DL 1998), Trento, Italy, ITC-Irst, pp. 116–120 (1998)

    Google Scholar 

  10. Carrive, J., Pachet, F., Ronfard, R.: Logiques de descriptions pour l’analyse structurelle de film. In: Charlet, J., Zacklad, M., Kassel, G., Bourigault, D. (eds.) Ingéniérie des Connaissances, évolutions récentes et nouveaux défis, Eyrolles, pp. 423–438 (2000b)

    Google Scholar 

  11. Carrive, J., Pachet, F., Ronfard, R.: Clavis: a temporal reasoning system for classification of audiovisual sequences. In: Proceedings of the Content-Based Multimedia Access (RIAO 2000), Paris, France, April 12-14, pp. 1400–1415 (2000a)

    Google Scholar 

  12. Durand, G., Faudemay, P.: Cross-indexing and access to mixed-media contents. In: Proc. CBMI 2001 International Workshop on Content-Based Multimedia Indexing, Brescia, Italy (September 2001)

    Google Scholar 

  13. Erwig, M., Gueting, R.H., Schneider, M., Vazirgiannis, M.: Spatio-Temporal Data Types: An Approach to Modeling and Querying Moving Objects in Databases. GeoInformatica Journal 3(3) (1999)

    Google Scholar 

  14. Kohonen, T., Kaski, S., Lagus, K., Salojärvi, J., Honkela, J., Paatero, V., Saarela, A.: Self Organization of a Massive Document Collection. In: IEEE Transactions on Neural Networks, Special Issue on Neural Networks for Data Mining and Knowledge Discovery, vol. 11(3), pp. 574–585 (May 2000)

    Google Scholar 

  15. Lagus, K.: Text Mining with the WEBSOM. Acta Polytechnica Scandinavica, Mathematics and Computing Series no. 110, Espoo, 54 pp. D.Sc(Tech) Thesis, Helsinki University of Technology, Finland. URL (2000), http://www.cis.hut.fi/krista/thesis/

  16. Lespinasse, K., Bachimont, B.: Is Peritext a Key for Audiovisual Documents? The Use of Texts Describing Television Programs to Assist indexing. In: Gelbukh, A. (ed.) CICLing 2001. LNCS, vol. 2004, p. 505. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  17. Markousis, T., Tsirikos, D., Vazirgiannis, M., Stavrakas, G.: A Client-Server Design for Interactive Multimedia Documents based on Java. Elsevier - Computer Communications Journal (2000)

    Google Scholar 

  18. Mirbel, I., Pernici, B., Sellis, T., Tserkezoglou, S., Vazirgiannis, M.: Checking Temporal Integrity of Interactive Multimedia Documents. Very Large Data Bases journal 9(2), 111–130 (2000)

    Article  Google Scholar 

  19. Mirbel, I., Pernici, B., Vazirgiannis, M.: Temporal Integrity Constraints in Interactive Multimedia Documents. In: The proceedings of IEEE-International Conference on Multimedia Computing and Systems (ICMCS 1999), Florence, Italy (June 1999)

    Google Scholar 

  20. Ontrup, J., Ritter, H.: Text Categorization and Semantic Browsing with Self-Organizing Maps on non-euclidean Spaces. In: De Raedt, L., Siebes, A. (eds.) PKDD 2001. LNCS (LNAI), vol. 2168, pp. 338–349. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  21. Ontrup, J., Ritter, H.: Hyperbolic Self-Organizing Maps for Semantic Navigation. In: Advances in Neural Information Processing Systems, vol. 14 (2001)

    Google Scholar 

  22. Pampalk, E., Rauber, A., Merkl, D.: Content-based Organization and Visualization of Music Archives. In: Proceedings of the ACM Multimedia 2002, Juan les Pins, France, December 1-6, pp. 570–579 (2002)

    Google Scholar 

  23. Pampalk, E., Rauber, A., Merkl, D.: Using Smoothed Data Histograms for Cluster Visualization in Self-Organizing Maps. In: Dorronsoro, J.R. (ed.) ICANN 2002. LNCS, vol. 2415, pp. 871–876. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  24. Varlamis, I., Vazirgiannis, M., Poulos, P.: Using XML as a medium for describing, modifying and querying audiovisual content stored in relational database systems. In: International Workshop on Very Low Bitrate Video Coding (VLBV), Athens (2001)

    Google Scholar 

  25. Varlamis, I., Vazirgiannis, M.: Bridging XML-Schema and relational databases. A system for generating and manipulating relational databases using valid XML documents. In: The Proceedings of ACM Symposium on Document Engineering, Atlanta, USA (November 2001)

    Google Scholar 

  26. Varlamis, I., Vazirgiannis, M.: Web document searching using enhanced hyperlink semantics based on XML. In: Proceedings of IDEAS 2001, Grenoble, France (2001)

    Google Scholar 

  27. Vazirgiannis, M., Tsirikos, D., Markousis, Th., Trafalis, M., Stamati, Y., Hatzopoulos, M., Sellis, T.: Interactive Multimedia Documents: a Modeling, Authoring and Rendering approach. Multimedia Tools and Applications Journal (Kluwer Academic Publishers) (2000)

    Google Scholar 

  28. Vazirgiannis, M., Theodoridis, Y., Sellis, T.: Spatio-Temporal Composition and Indexing for Large Multimedia Applications. ACM/Springer-Verlag Multimedia Systems Journal 6(4) (1998)

    Google Scholar 

  29. Vazirgiannis, M., Mourlas, C.: An object Oriented Model for Interactive Multimedia Applications. The Computer Journal 36(1) (January 1993)

    Google Scholar 

  30. Vazirgiannis, M.: Multimedia Data Base Object and Application Modeling Issues and an Object Oriented Model. In: Multimedia Database Systems: Design and Implementation Strategies, pp. 208–250. Kluwer Academic Publishers, Dordrecht (1996)

    Google Scholar 

  31. Veneau, E., Ronfard, R., Bouthemy, P.: From Video Shot Clustering to Sequence Segmentation. In: Fifteenth International Conference on Pattern Recognition (ICPR 2000), Barcelona (September 2000)

    Google Scholar 

  32. Vodislav, D., Vazirgiannis, M.: Structured Interactive Animation for Multimedia Documents. In: Proceedings of IEEE Visual Languages Symposium, Seattle, USA (September 2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Morizet-Mahoudeaux, P., Bachimont, B. (2005). Indexing and Mining Audiovisual Data. In: Tsumoto, S., Yamaguchi, T., Numao, M., Motoda, H. (eds) Active Mining. Lecture Notes in Computer Science(), vol 3430. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11423270_3

Download citation

  • DOI: https://doi.org/10.1007/11423270_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-26157-5

  • Online ISBN: 978-3-540-31933-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics