Tracking of Multi-state Hand Models Using Particle Filtering and a Hierarchy of Multi-scale Image Features⋆

  • Ivan Laptev
  • Tony Lindeberg
Conference paper
Part of the Lecture Notes in Computer Science 2106 book series (LNCS, volume 2106)


This paper presents an approach for simultaneous tracking and recognition of hierarchical object representations in terms of multiscale image features. A scale-invariant dissimilarity measure is proposed for comparing scale-space features at different positions and scales. Based on this measure, the likelihood of hierarchical, parameterized models can be evaluated in such a way that maximization of the measure over different models and their parameters allows for both model selection and parameter estimation. Then, within the framework of particle filtering, we consider the area of hand gesture analysis, and present a method for simultaneous tracking and recognition of hand models under variations in the position, orientation, size and posture of the hand. In this way, qualitative hand states and quantitative hand motions can be captured, and be used for controlling different types of computerised equipment.


Feature Detection Hand Posture Hand Gesture Dissimilarity Measure Hand Tracking 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. Black, M. and Jepson, A. (1998). A probabilistic framework for matching temporal trajectories: Condensation-based recognition of gestures and expressions, ECCV’98, Freiburg, Germany, 909–924.Google Scholar
  2. Bretzner, L. and Lindeberg, T. (1998). Use your hand as a 3-D mouse or relative orientation from extended sequences of sparse point and line correspondences using the afine trifocal tensor, ECCV’96, Freiburg, Germany, 141–157.Google Scholar
  3. Bretzner, L. and Lindeberg, T. (1999). Qualitative multi-scale feature hierarchies for object tracking, Scale-Space’99, Corfu, Greece, 117–128.Google Scholar
  4. Bretzner, L., Laptev, I. and Lindeberg, T. (2001). Tracking and recognition of hand postures for visually guided control. In preparation.Google Scholar
  5. Cipolla, R., Okamoto, Y. and Kuno, Y. (1993). Robust structure from motion using motion parallax, ICCV’93, Berlin, Germany, 374–382.Google Scholar
  6. Crowley, J. and Sanderson, A. (1987). Multiple resolution representation and probabilistic matching of 2-d gray-scale shape, IEEE-PAMI, 9(1): 113–121.CrossRefGoogle Scholar
  7. Cui, Y. and Weng, J. (1996). View-based hand segmentation and hand-sequence recognition with complex backgrounds, ICPR’96, Vienna, Austria, 617–621.Google Scholar
  8. Deutscher, J., Blake, A. and Reid, I. (2000). Articulated body motion capture by annealed particle filtering, CVPR’00, Hilton Head, SC, II:126–133.Google Scholar
  9. Freeman, W.T. and Weissman, C.D. (1995), Television control by hand gestures, Face and Gesture’95, Zurich, Switzerland.Google Scholar
  10. Heap, T. and Hogg, D. (1998). Wormholes in shape space: Tracking through discontinuous changes in shape, ICCV’98, Bombay, India, 344–349.Google Scholar
  11. Isard, M. and Blake, A. (1996). Contour tracking by stochastic propagation of conditional density, ECCV’96, Cambridge, UK, I:343–356.Google Scholar
  12. Laptev, I and Lindeberg, T. (2000) Tracking of multi-state hand models using particle filtering and a hierarchy of multi-scale image features, Technical report ISRN KTH/NA/P00/12-SE.
  13. Lindeberg, T. (1993). Detecting salient blob-like image structures and their scales with a scale-space primal sketch: A method for focus-of-attention, IJCV, 11: 283–318.CrossRefGoogle Scholar
  14. Lindeberg, T. (1998). Feature detection with automatic scale selection, IJCV, 30(2): 77–116.Google Scholar
  15. Lindeberg, T. and Niemenmaa, J. (2001). Scale selection in hybrid multi-scale representations, in preparation.Google Scholar
  16. MacCormick, J. and Isard, M. (2000). Partitioned sampling, articulated objects, and interface-quality hand tracking, ECCV’00, Dublin, Ireland, II:3–19.Google Scholar
  17. Pizer, S.M., Burbeck, C.A., Coggins, J.M., Fritsch, D.S. and Morse, B.S. (1994). Object shape before boundary shape: Scale-space medial axis, JMIV 4: 303–313.CrossRefGoogle Scholar
  18. Regh, J.M. and Kanade, T. (1995). Model-based tracking of self-occluding articulated objects, ICCV’95, Cambridge, MA, 612–617.Google Scholar
  19. Shokoufandeh, A., Marsic, I. and Dickinson, S. (1999). View-based object recognition using saliency maps, IVC, 17(5/6): 445–460.CrossRefGoogle Scholar
  20. Sidenbladh, H., Black, M. and Fleet, D. (2000). Stochastic tracking of 3-D human figures using 2-D image motion, ECCV’00, Dublin, Ireland, II:702–718.Google Scholar
  21. Sjöbergh, J. and Lindeberg, T. (2001). In preparation.Google Scholar
  22. Sullivan, J., Blake, A., Isard, M. and MacCormick, J. (1999). Object localization by Bayesian correlation, ICCV’99, Corfu, Greece, 1068–1075.Google Scholar
  23. Triesch, J. and von der Malsburg, C. (1996). Robust classification of hand postures against complex background, Face and Gesture’96, Killington, Vermont, 170–175.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2001

Authors and Affiliations

  • Ivan Laptev
    • 1
  • Tony Lindeberg
    • 1
  1. 1.Department of Numerical Analysis and Computer ScienceComputational Vision and Active Perception Laboratory (CVAP)StockholmSweden

Personalised recommendations