Abstract
This paper describes a novel browsing paradigm, taking benefit of the various types of links (e.g. thematic, temporal, references, etc.) that can be automatically built between multimedia documents. This browsing paradigm can help eliciting multimedia archives’ hidden structures or expanding search results to related media. The paper intend to present a novel model for browsing any kind of multimedia archives and further focuses on an archive of meetings recordings, in order to illustrate the advantage of our method to perform cross-meetings and in general cross-documents browsing. First of all, the structure of meeting datasets is presented, describing in particular the media implied, the annotations used for cross-document linking and the major mining techniques integrated in this work. Then, the paper presents at a glance the visual browser we developed that combines searching and browsing by links. Further, the performances of the actual system are discussed, i.e. the automatic indexing and linking processes for the two different meeting corpora, as well as the access and browsing performances. Finally, the paper presents the major unsolved issues and our perspectives for future works.
Chapter PDF
Similar content being viewed by others
Keywords
References
Alice in Wonderland, TextArc, http://www.textarc.org
Behera, A.: A Visual Signature-based Identification Method of Low-resolution Document Images and its Exploitation to Automate Indexing of Multimodal Recordings. University of Fribourg, Switzerland, thesis Nr. 1529
Bollacker, K.D., Lawrence, S., Lee Giles, C.: CiteSeer: an autonomous web agent for automatic retrieval and identification of interesting publications. In: Proc. Of 2nd International Conference on Autonomous Agents, pp. 116–123. ACM Press, New York (1998)
Callan, J.P.: Passage-level evidence in document retrieval. In: Proc. of the 17th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 302–310. Springer, Heidelberg (1994)
Campanella, M., Leonardi, R., Migliorati, P.: An intuitive graphic environment for navigation and classification of multimedia documents. In: ICME 2005. Proc. Of Multimedia and Expo, pp. 743–746. IEEE Press, Los Alamitos (2005)
Carey, M., Heesch, D.C., Rüger, S.M.: Info Navigator: A Visualization Tool for Document Searching and Browsing. In: Proc. DMS 2003, pp. 23–38 (2003)
Havre, S., Hetzler, E., Whitney, P., Nowell, L.: ThemeRiver: visualizing thematic changes in large document collections. In: IEEE Transactions on Visualization and Computer Graphics, pp. 9–20. IEEE Press, Los Alamitos (2002)
Hoffman, P., Grinstein, G., Marx, K., Grosse, I., Stanley, E.: DNA visual and analytic data mining. In: Proc. Of Visualisation’97, pp. 437–441. IEEE Press, Los Alamitos (1997)
Kartoo, http://www.kartoo.com
Kuper, J., Saggion, H., Cunningham, H., Declerck, T., de Jong, F., Reidsma, D., Wilks, Y., Wittenburgh, P.: Intelligent Multimedia Indexing And Retrieval through Multi-source Information Extraction and Merging. In: Proc of IJCAI, pp. 409–414 (2003)
Goularte, R., Camacho-Guerrero, J.A., Inácio Jr., V.R., Cattelan, R.G., Pimentel, M.d.G.C.: M4Note: a Multimodal Tool for Multimedia Annotations. In: Proc. of WebMedia and LA-Web, pp. 142–149. IEEE Press, Los Alamitos (2004)
Integrated JFerret Browser and Overlapped Speech Browser, In: Demonstration Session Guide, MLMI’06, http://groups.inf.ed.ac.uk/mlmi06/MLMI-2006-DemoSessionFinal.pdf
Lalanne, D., Sire, S., Ingold, R., Behera, A., Mekhaldi, D., Von Rotz, D.: A research agenda for assessing the utility of document annotations in multimedia databases of meeting recordings. In: Proc. of 3rd International Workshop on Multimedia Data and Document Engineering, in conjunction with VLDB-2003, pp. 47–55 (2003)
Lalanne, D., Ingold, R., Von Rotz, D., Behera, A., Mekhaldi, D., Popescu-Belis, A.: Using Static Documents as Structured and Thematic Interfaces to Multimedia Meeting Archives. In: Renals, S., Bengio, S. (eds.) MLMI 2005. LNCS, vol. 3869, Springer, Heidelberg (2006)
Lalanne, D., Lisowska, A., Bruno, E., Flynn, M., Georgescul, M., Guillemot, M., Janvier, B., Marchand-Maillet, S., Melichar, M., Moenne-Loccoz, N., Popescu-Belis, A., Rajman, M., Rigamonti, M., von Rotz, D., Wellner, P.: The IM2 Multimodal Meeting Browser Family. IM2 technical report (2005)
LinkedIn, https://www.linkedin.com
Lisowska, A., Rajman, M., Bui, T.H.: ARCHIVUS: A System for Accessing the Content of Recorded Multimodal Meetings. In: Proc. of the Joint AMI/PASCAL/IM2/M4 Workshop on Multimodal Interaction and Related Machine Learning Algorithms, pp. 291–304 (2004)
Mekhaldi, D.: A Study on multimodal document alignment: bridging the gap between textual documents and spoken language. University of Fribourg, Switzerland, thesis Nr. 1521
Rigamonti, M., Bloechle, J.L., Hadjar, K., Lalanne, D., Ingold, R.: Towards a canonical and structured representation of PDF documents through reverse engineering. In: Proc. of ICDA 2005, pp. 1050–1054 (2005)
Scholar, http://scholar.google.com
Shneiderman, B., Plaisant, C.: Designing the User Interface: Strategies for Effective Human-Computer Interaction, 4th edn., p. 652. Addison-Wesley, Reading (2004)
Smith, J.R., Naphade, M., Natsev, A(P.): Multimedia Semantic Indexing Using Model Vectors. In: ICME 2003. Proc. Of Multimedia and Expo, vol. II, pp. 445–448. IEEE Press, Los Alamitos (2003)
Swain, M.J.: Searching for Multimedia on the World Wide Web. In: Proc. of Multimedia Computing and Systems, vol. I, pp. 32–37. IEEE Press, Los Alamitos (1999)
Theisel, H., Kreuseler, M.: An enhanced spring model for information visualization. In: Proc. of Eurographics 1998, vol. 17(3), pp. 335–344. Blackwell Publishing, Malden (1998)
Tucker, S., Whittaker, S.: Accessing Multimodal Meeting Data: Systems, Problems and Possibilities. In: Bengio, S., Bourlard, H. (eds.) MLMI 2004. LNCS, vol. 3361, pp. 1–11. Springer, Heidelberg (2005)
Tucker, S., Whittaker, S.: Reviewing Multimedia Meeting Records: Current Approaches. In: Multimodal multiparty meeting processing workshop, ICMI 2005, International Conference on Multimodal Interfaces (2005)
Wellner, P., Flynn, M., Guillemot, M.: Browsing recorded meetings with Ferret. In: Bengio, S., Bourlard, H. (eds.) MLMI 2004. LNCS, vol. 3361, pp. 12–21. Springer, Heidelberg (2005)
YouTube, http://www.youtube.com
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 IFIP International Federation for Information Processing
About this paper
Cite this paper
Rigamonti, M., Lalanne, D., Ingold, R. (2007). FaericWorld: Browsing Multimedia Events Through Static Documents and Links. In: Baranauskas, C., Palanque, P., Abascal, J., Barbosa, S.D.J. (eds) Human-Computer Interaction – INTERACT 2007. INTERACT 2007. Lecture Notes in Computer Science, vol 4662. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74796-3_11
Download citation
DOI: https://doi.org/10.1007/978-3-540-74796-3_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74794-9
Online ISBN: 978-3-540-74796-3
eBook Packages: Computer ScienceComputer Science (R0)