Skip to main content

Image representations for visual learning

  • Face-based Authentication
  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1206))

Abstract

The last few years have seen new successful approaches to object recognition and to computer graphics based directly on images without the use of intermediate 3D models. I will argue that most of these techniques depend on a representation of images that induces a linear vector space structure and in principle requires dense correspondence. This image representation allows the use of learning techniques for the analysis and for the synthesis of images, that is for both computer vision and computer graphics.

The key assumption hidden in most view-based approaches to object recognition and object detection is that the relevant images are vectors. This is not true unless they are set in correspondence. The correspondence step associates a shape vector and a texture vector to each image. I will focus on the domain of face images and review how this representation can be used to learn

  • to estimate parameters such as expression and pose from images

  • to interpolate in multiple dimensions new images and images sequences from examples

  • to extrapolate from single images and generate virtual examples

In particular, I will describe a new example-based correspondence technique based on a flexible template represented as the linear combination of prototypes [1]–[4]. The approach is hierarchical and may allow to exploit context in vision tasks. It has interesting biological implications and may applied to other domains beyond vision and graphics.

This is a preview of subscription content, log in via an institution.

References

  1. Thomas Vetter and Tomaso Poggio. Linear object classes and image synthesis from a single example image. A.I. Memo No. 1531, Artificial Intelligence Laboratory, Massachusetts Institute of Technology, 1995.

    Google Scholar 

  2. M. Jones and T. Poggio. Model-based matching of line drawings by linear combination of prototypes. Proceedings of the International Conference on Computer Vision, pages 531–536. IEEE, June 1995.

    Google Scholar 

  3. David Beymer and Tomaso Poggio. Image Representations for Visual Learning. Science272 1905–1909 1996

    Google Scholar 

  4. Mike Jones and Tomaso Poggio. Model-Based Matching by Linear Combinations of Prototypes. A.I. Memo No. 1583, Artificial Intelligence Laboratory, Massachusetts Institute of Technology, 1996.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Josef Bigün Gérard Chollet Gunilla Borgefors

Rights and permissions

Reprints and permissions

Copyright information

© 1997 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Poggio, T. (1997). Image representations for visual learning. In: Bigün, J., Chollet, G., Borgefors, G. (eds) Audio- and Video-based Biometric Person Authentication. AVBPA 1997. Lecture Notes in Computer Science, vol 1206. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0015989

Download citation

  • DOI: https://doi.org/10.1007/BFb0015989

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-62660-2

  • Online ISBN: 978-3-540-68425-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics