Extracting Bimodal Representations for Language-Based Image Retrieval

Westerveld, Thijs; Hiemstra, Djoerd; de Jong, Franciska

doi:10.1007/978-3-7091-6771-7_5

Thijs Westerveld⁴,
Djoerd Hiemstra⁴ &
Franciska de Jong⁴

Part of the book series: Eurographics ((EUROGRAPH))

152 Accesses
1 Citations

Abstract

This paper explores two approaches to multimedia indexing that might contribute to the advancement of text-based conceptual search for pictorial information. Insights from relatively mature retrieval areas (spoken document retrieval and cross-language retrieval) are taken as a starting point. for an investigation of the usefulness of the concept of bimodal dictionaries and of clustering features from multi-modal documents into one semantic space. One of the advantages of the presented techniques is that they are domain independent.

Part of the research for this paper has been funded by the Dutch organisation for Scientific Research NWO.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Brachsler, M. and Schäuble, P. Multilingual Information Retrieval Based on Document Alignment Techniques, Proceedings of the second European Digital Libraries Conference, 1998.
Google Scholar
Cascia, M. La, Sethi, S., Sclaroff, S., Combining textual and visual cues for content-based image retrieval on the world wide web, In: IEEE Workshop on content-based access of image and video libraries, 1998
Google Scholar
Deerwester, S., Dumais, S.T., Harshman, R., Indexing by Latent Semantic Analysis, In: Journal of the American Society for Information Science, 41 (6), pp391–407, 1990.
Article Google Scholar
Dumais, S.T., Landauer, T.K., Littman, M.L., Automatic Cross-Linguistic Information Retrieval using Latent Semantic Indexing, In: Proceedings SIGIR96, Workshop On Cross-Linguistic Information Retrieval, 1996.
Google Scholar
Flickner, M., Sawhney, H., Niblack, W. Ashley, J., Huang, Q., Dom, B., Gorkani, M., Hafner, J., Lee, D., Petkovic, D. Steele, D. and Yanker, P. Query by image and video content: the QBIC system, In: Maybury, M.T. (ed.) Intelligent multimedia information retrieval, pages 7–22, 1997.
Google Scholar
Forsythe, G.E., Malcolm, M.A. and Moler, C.B., Least squares and the singular value decomposition In: Computer Methods for Mathematical Computations, (Chapter 9 ), Englewood Cliffs, NJ: Prentice Hall, 1977.
Google Scholar
Gevers, T. and Smeulders, A.W.M., PicToSeek: A content-based image search engine for the WWW, In: Proceedings of VISUAL’97, 1997
Google Scholar
Hiemstra, D. and W. Kraaij, Twenty-One at TREC-7: Ad-hoc and Cross-language track, In: Proceedings of the seventh Text Retrieval Conference TREC-7, Nist Special Publications, 1999.
Google Scholar
Hiemstra, D., Multilingual domain modeling in Twenty-One: automatic creation of a bidirectional translation lexicon from a parallel corpus, In: Peter-Arno Coppen, Hans van Halteren and Lisanne Teunissen (eds.), Proceedings of the eighth CLIN meeting, pages 41–58, 1998.
Google Scholar
Jong, F. de, Twenty-One: a baseline for multimedia retrieval, In: Proceedings of the 14th Twente Workshop on Language Technology TWLT-14, pp 189–195,1998.
Google Scholar
Kraaij, W., J. van Gent, R. Ekkelenkamp, and D. van Leeuwen, Phoneme-based Spoken Document Retrieval, In: Proceedings of the 14th Twente Workshop on Language Technology TWLT-14, pp 141–153, 1998.
Google Scholar
Marsicoi, M., Clinque, L. and Levialdi, S., Indexing pictorial documents by their content: a survey of current techniques, In: Image and vision computing 15, pages 119–141, 1997
Google Scholar
Netter, K. and F. de Jong, Olive: speech based video retrieval, In: Proceedings of the 14th Twente Workshop on Language Technology TWLT-14, pp 187–189, 1998.
Google Scholar
Oard, D.W. and Don, B.J., A Survey of Multilingual Text Retrieval, Technical report TR96–19, University of Maryland, http://www.ee.umd.edu/medlab/mlir/mlir.html
Google Scholar
Smeulders, A.W.M., T. Gevers and M.L. Kersten, Computer vision and image search engines. In: Proceedings of the 14th Twente Workshop on Language Technology TWLT14, pp 107–116, 1998.
Google Scholar
Yang, Y. Carbonell, J.G., Brown, R.D., Frederking, R.E., Translingual Information Retrieval: Learning from Bilingual Corpora, Artificial Intelligence 103 (1–2), pp 323–345, 1998.
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Centre for Telematics and Information Technology, University of Twente, PO Box 217, 7500 AE, Enschede, The Netherlands
Thijs Westerveld, Djoerd Hiemstra & Franciska de Jong

Authors

Thijs Westerveld
View author publications
You can also search for this author in PubMed Google Scholar
Djoerd Hiemstra
View author publications
You can also search for this author in PubMed Google Scholar
Franciska de Jong
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

New University of Lisbon, Caparica, Portugal
Nuno Correia
University of Lisbon, Lisbon, Portugal
Teresa Chambel
MIT Media Laboratory, Cambridge, MA, USA
Glorianna Davenport

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Westerveld, T., Hiemstra, D., de Jong, F. (2000). Extracting Bimodal Representations for Language-Based Image Retrieval. In: Correia, N., Chambel, T., Davenport, G. (eds) Multimedia ’99. Eurographics. Springer, Vienna. https://doi.org/10.1007/978-3-7091-6771-7_5

Download citation

DOI: https://doi.org/10.1007/978-3-7091-6771-7_5
Publisher Name: Springer, Vienna
Print ISBN: 978-3-211-83437-4
Online ISBN: 978-3-7091-6771-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics