Multimodal Interaction in Image and Video Applications

  • Angel D. Sappa
  • Jordi Vitrià

Part of the Intelligent Systems Reference Library book series (ISRL, volume 48)

Table of contents

  1. Front Matter
    Pages 1-5
  2. Michal Drozdzal, Santi Seguí, Petia Radeva, Carolina Malagelada, Fernando Azpiroz, Jordi Vitrià
    Pages 1-16
  3. Ernest Valveny, Oriol Ramos, Joan Mas, Marçal Rossinyol
    Pages 17-30
  4. Joost van de Weijer, Fahad Khan, Marc Masana
    Pages 31-45
  5. Abel Gonzalez, Robert Benavente, Olivier Penacchio, Javier Vazquez-Corral, Maria Vanrell, C. Alejandro Parraga
    Pages 47-66
  6. Pablo Ruiz, Javier Mateos, Gustavo Camps-Valls, Rafael Molina, Aggelos K. Katsaggelos
    Pages 67-81
  7. Mauricio Villegas, Luis A. Leiva, Roberto Paredes
    Pages 83-109
  8. M. Angeles Mendoza, Tomás Arnau, Isabel Gracia, Filiberto Pla, Nicolás Pérez de la Blanca
    Pages 111-134
  9. Marc Castelló, Jordi Gonzàlez, Ariel Amato, Pau Baiget, Carles Fernández, Josep M. Gonfaus et al.
    Pages 135-151
  10. Javier Ortells, Henry Anaya-Sánchez, Raúl Martín-Félez, Ramón A. Mollineda
    Pages 153-167
  11. David Vázquez, Antonio M. López, Daniel Ponsa, David Gerónimo
    Pages 169-184
  12. Gonzalo Ferrer, Anaís Garrell, Michael Villamizar, Iván Huerta, Alberto Sanfeliu
    Pages 185-203

About this book


Traditional Pattern Recognition (PR) and Computer Vision (CV) technologies have mainly focused on full automation, even though full automation often proves elusive or unnatural in many applications, where the technology is expected to assist rather than replace the human agents. However, not all the problems can be automatically solved being the human interaction the only way to tackle those applications.

Recently, multimodal human interaction has become an important field of increasing interest in the research community. Advanced man-machine interfaces with high cognitive capabilities are a hot research topic that aims at solving challenging problems in image and video applications. Actually, the idea of computer interactive systems was already proposed on the early stages of computer science. Nowadays, the ubiquity of image sensors together with the ever-increasing computing performance has open new and challenging opportunities for research in multimodal human interaction.

This book aims to show how existing PR and CV technologies can naturally evolve using this new paradigm. The chapters of this book show different successful case studies of multimodal interactive technologies for both image and video applications. They cover a wide spectrum of applications, ranging from interactive handwriting transcriptions to human-robot interactions in real environments.


Computer Vision Image Processing Intelligent Systems Machine Learning Multimodal Human Interaction Pattern Recognition

Authors and affiliations

  • Angel D. Sappa
    • 1
  • Jordi Vitrià
    • 2
  1. 1.Computer Vision CenterBarcelonaSpain
  2. 2., Departament de MatemàticaUniversitat de BarcelonaBarcelonaSpain

Bibliographic information

  • DOI
  • Copyright Information Springer-Verlag Berlin Heidelberg 2013
  • Publisher Name Springer, Berlin, Heidelberg
  • eBook Packages Engineering
  • Print ISBN 978-3-642-35931-6
  • Online ISBN 978-3-642-35932-3
  • Series Print ISSN 1868-4394
  • Series Online ISSN 1868-4408
  • Buy this book on publisher's site
Industry Sectors
Chemical Manufacturing
Finance, Business & Banking
IT & Software
Energy, Utilities & Environment
Oil, Gas & Geosciences