Skip to main content

Natural-Language-Based Conversion of Images to Mobile Multimedia Experiences

  • Conference paper
User Centric Media (UCMEDIA 2009)

Abstract

We describe an approach for viewing any large, detail-rich picture on a small display by generating a video from the image, as taken by a virtual camera moving across it at varying distance. Our main innovation is the ability to build the virtual camera’s motion from a textual description of a picture, e.g., a museum caption, so that relevance and ordering of image regions are determined by co-analyzing image annotations and natural language text. Furthermore, our system arranges the resulting presentation such that it is synchronized with an audio track generated from the text by use of a text-to-speech system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Shamir, A., Avidan, S.: Seam Carving for Media Retargeting. Commun. ACM 52(1), 77–85 (2009)

    Article  Google Scholar 

  2. Pinho, P., Baltazar, J., Pereira, F.: Integrating Low-Level and Semantic Visual Cues for Improved Image-to-Video Experiences. In: Campilho, A., Kamel, M.S. (eds.) ICIAR 2006. LNCS, vol. 4142, pp. 832–843. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  3. Megino, F.B., Martínez Sánchez, J.M., López, V.V.: José M. Martínez Sánchez. In: WIAMIS 2008: Proceedings of the 2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services, pp. 223–226. IEEE Computer Society, Los Alamitos (2008)

    Chapter  Google Scholar 

  4. Concolato, C., Le Feuvre, J., Moissinac, J.-C.: Design of an efficient scalable vector graphics player for constrained devices. IEEE Transactions on Consumer Electronics 54, 895–903 (2008)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 ICST Institute for Computer Science, Social Informatics and Telecommunications Engineering

About this paper

Cite this paper

Reiterer, B., Concolato, C., Hellwagner, H. (2010). Natural-Language-Based Conversion of Images to Mobile Multimedia Experiences. In: Daras, P., Ibarra, O.M. (eds) User Centric Media. UCMEDIA 2009. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 40. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12630-7_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-12630-7_10

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-12629-1

  • Online ISBN: 978-3-642-12630-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics