Abstract
This paper explores two approaches to multimedia indexing that might contribute to the advancement of text-based conceptual search for pictorial information. Insights from relatively mature retrieval areas (spoken document retrieval and cross-language retrieval) are taken as a starting point. for an investigation of the usefulness of the concept of bimodal dictionaries and of clustering features from multi-modal documents into one semantic space. One of the advantages of the presented techniques is that they are domain independent.
Part of the research for this paper has been funded by the Dutch organisation for Scientific Research NWO.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Brachsler, M. and Schäuble, P. Multilingual Information Retrieval Based on Document Alignment Techniques, Proceedings of the second European Digital Libraries Conference, 1998.
Cascia, M. La, Sethi, S., Sclaroff, S., Combining textual and visual cues for content-based image retrieval on the world wide web, In: IEEE Workshop on content-based access of image and video libraries, 1998
Deerwester, S., Dumais, S.T., Harshman, R., Indexing by Latent Semantic Analysis, In: Journal of the American Society for Information Science, 41 (6), pp391–407, 1990.
Dumais, S.T., Landauer, T.K., Littman, M.L., Automatic Cross-Linguistic Information Retrieval using Latent Semantic Indexing, In: Proceedings SIGIR96, Workshop On Cross-Linguistic Information Retrieval, 1996.
Flickner, M., Sawhney, H., Niblack, W. Ashley, J., Huang, Q., Dom, B., Gorkani, M., Hafner, J., Lee, D., Petkovic, D. Steele, D. and Yanker, P. Query by image and video content: the QBIC system, In: Maybury, M.T. (ed.) Intelligent multimedia information retrieval, pages 7–22, 1997.
Forsythe, G.E., Malcolm, M.A. and Moler, C.B., Least squares and the singular value decomposition In: Computer Methods for Mathematical Computations, (Chapter 9 ), Englewood Cliffs, NJ: Prentice Hall, 1977.
Gevers, T. and Smeulders, A.W.M., PicToSeek: A content-based image search engine for the WWW, In: Proceedings of VISUAL’97, 1997
Hiemstra, D. and W. Kraaij, Twenty-One at TREC-7: Ad-hoc and Cross-language track, In: Proceedings of the seventh Text Retrieval Conference TREC-7, Nist Special Publications, 1999.
Hiemstra, D., Multilingual domain modeling in Twenty-One: automatic creation of a bidirectional translation lexicon from a parallel corpus, In: Peter-Arno Coppen, Hans van Halteren and Lisanne Teunissen (eds.), Proceedings of the eighth CLIN meeting, pages 41–58, 1998.
Jong, F. de, Twenty-One: a baseline for multimedia retrieval, In: Proceedings of the 14th Twente Workshop on Language Technology TWLT-14, pp 189–195,1998.
Kraaij, W., J. van Gent, R. Ekkelenkamp, and D. van Leeuwen, Phoneme-based Spoken Document Retrieval, In: Proceedings of the 14th Twente Workshop on Language Technology TWLT-14, pp 141–153, 1998.
Marsicoi, M., Clinque, L. and Levialdi, S., Indexing pictorial documents by their content: a survey of current techniques, In: Image and vision computing 15, pages 119–141, 1997
Netter, K. and F. de Jong, Olive: speech based video retrieval, In: Proceedings of the 14th Twente Workshop on Language Technology TWLT-14, pp 187–189, 1998.
Oard, D.W. and Don, B.J., A Survey of Multilingual Text Retrieval, Technical report TR96–19, University of Maryland, http://www.ee.umd.edu/medlab/mlir/mlir.html
Smeulders, A.W.M., T. Gevers and M.L. Kersten, Computer vision and image search engines. In: Proceedings of the 14th Twente Workshop on Language Technology TWLT14, pp 107–116, 1998.
Yang, Y. Carbonell, J.G., Brown, R.D., Frederking, R.E., Translingual Information Retrieval: Learning from Bilingual Corpora, Artificial Intelligence 103 (1–2), pp 323–345, 1998.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag/Wien
About this paper
Cite this paper
Westerveld, T., Hiemstra, D., de Jong, F. (2000). Extracting Bimodal Representations for Language-Based Image Retrieval. In: Correia, N., Chambel, T., Davenport, G. (eds) Multimedia ’99. Eurographics. Springer, Vienna. https://doi.org/10.1007/978-3-7091-6771-7_5
Download citation
DOI: https://doi.org/10.1007/978-3-7091-6771-7_5
Publisher Name: Springer, Vienna
Print ISBN: 978-3-211-83437-4
Online ISBN: 978-3-7091-6771-7
eBook Packages: Springer Book Archive