Skip to main content

Extracting Bimodal Representations for Language-Based Image Retrieval

  • Conference paper
Multimedia ’99

Part of the book series: Eurographics ((EUROGRAPH))

Abstract

This paper explores two approaches to multimedia indexing that might contribute to the advancement of text-based conceptual search for pictorial information. Insights from relatively mature retrieval areas (spoken document retrieval and cross-language retrieval) are taken as a starting point. for an investigation of the usefulness of the concept of bimodal dictionaries and of clustering features from multi-modal documents into one semantic space. One of the advantages of the presented techniques is that they are domain independent.

Part of the research for this paper has been funded by the Dutch organisation for Scientific Research NWO.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Brachsler, M. and Schäuble, P. Multilingual Information Retrieval Based on Document Alignment Techniques, Proceedings of the second European Digital Libraries Conference, 1998.

    Google Scholar 

  2. Cascia, M. La, Sethi, S., Sclaroff, S., Combining textual and visual cues for content-based image retrieval on the world wide web, In: IEEE Workshop on content-based access of image and video libraries, 1998

    Google Scholar 

  3. Deerwester, S., Dumais, S.T., Harshman, R., Indexing by Latent Semantic Analysis, In: Journal of the American Society for Information Science, 41 (6), pp391–407, 1990.

    Article  Google Scholar 

  4. Dumais, S.T., Landauer, T.K., Littman, M.L., Automatic Cross-Linguistic Information Retrieval using Latent Semantic Indexing, In: Proceedings SIGIR96, Workshop On Cross-Linguistic Information Retrieval, 1996.

    Google Scholar 

  5. Flickner, M., Sawhney, H., Niblack, W. Ashley, J., Huang, Q., Dom, B., Gorkani, M., Hafner, J., Lee, D., Petkovic, D. Steele, D. and Yanker, P. Query by image and video content: the QBIC system, In: Maybury, M.T. (ed.) Intelligent multimedia information retrieval, pages 7–22, 1997.

    Google Scholar 

  6. Forsythe, G.E., Malcolm, M.A. and Moler, C.B., Least squares and the singular value decomposition In: Computer Methods for Mathematical Computations, (Chapter 9 ), Englewood Cliffs, NJ: Prentice Hall, 1977.

    Google Scholar 

  7. Gevers, T. and Smeulders, A.W.M., PicToSeek: A content-based image search engine for the WWW, In: Proceedings of VISUAL’97, 1997

    Google Scholar 

  8. Hiemstra, D. and W. Kraaij, Twenty-One at TREC-7: Ad-hoc and Cross-language track, In: Proceedings of the seventh Text Retrieval Conference TREC-7, Nist Special Publications, 1999.

    Google Scholar 

  9. Hiemstra, D., Multilingual domain modeling in Twenty-One: automatic creation of a bidirectional translation lexicon from a parallel corpus, In: Peter-Arno Coppen, Hans van Halteren and Lisanne Teunissen (eds.), Proceedings of the eighth CLIN meeting, pages 41–58, 1998.

    Google Scholar 

  10. Jong, F. de, Twenty-One: a baseline for multimedia retrieval, In: Proceedings of the 14th Twente Workshop on Language Technology TWLT-14, pp 189–195,1998.

    Google Scholar 

  11. Kraaij, W., J. van Gent, R. Ekkelenkamp, and D. van Leeuwen, Phoneme-based Spoken Document Retrieval, In: Proceedings of the 14th Twente Workshop on Language Technology TWLT-14, pp 141–153, 1998.

    Google Scholar 

  12. Marsicoi, M., Clinque, L. and Levialdi, S., Indexing pictorial documents by their content: a survey of current techniques, In: Image and vision computing 15, pages 119–141, 1997

    Google Scholar 

  13. Netter, K. and F. de Jong, Olive: speech based video retrieval, In: Proceedings of the 14th Twente Workshop on Language Technology TWLT-14, pp 187–189, 1998.

    Google Scholar 

  14. Oard, D.W. and Don, B.J., A Survey of Multilingual Text Retrieval, Technical report TR96–19, University of Maryland, http://www.ee.umd.edu/medlab/mlir/mlir.html

    Google Scholar 

  15. Smeulders, A.W.M., T. Gevers and M.L. Kersten, Computer vision and image search engines. In: Proceedings of the 14th Twente Workshop on Language Technology TWLT14, pp 107–116, 1998.

    Google Scholar 

  16. Yang, Y. Carbonell, J.G., Brown, R.D., Frederking, R.E., Translingual Information Retrieval: Learning from Bilingual Corpora, Artificial Intelligence 103 (1–2), pp 323–345, 1998.

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2000 Springer-Verlag/Wien

About this paper

Cite this paper

Westerveld, T., Hiemstra, D., de Jong, F. (2000). Extracting Bimodal Representations for Language-Based Image Retrieval. In: Correia, N., Chambel, T., Davenport, G. (eds) Multimedia ’99. Eurographics. Springer, Vienna. https://doi.org/10.1007/978-3-7091-6771-7_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-7091-6771-7_5

  • Publisher Name: Springer, Vienna

  • Print ISBN: 978-3-211-83437-4

  • Online ISBN: 978-3-7091-6771-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics