A two-stage probabilistic approach for object recognition

  • Stan Z. Li
  • Joachim Hornegger
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1407)


Assume that some objects are present in an image but can be seen only partially and are overlapping each other. To recognize the objects, we have to firstly separate the objects from one another, and then match them against the modeled objects using partial observation. This paper presents a probabilistic approach for solving this problem. Firstly, the task is formulated as a two-stage optimal estimation process. The first stage, matching, separates different objects and finds feature correspondences between the scene and each potential model object. The second stage, recognition, resolves inconsistencies among the results of matching to different objects and identifies object categories. Both the matching and recognition are formulated in terms of the maximum a posteriori (MAP) principle. Secondly, contextual constraints, which play an important role in solving the problem, are incorporated in the probabilistic formulation. Specifically, between-object constraints are encoded in the prior distribution modeled as a Markov random field, and within-object constraints are encoded in the likelihood distribution modeled as a Gaussian. They are combined into the posterior distribution which defines the MAP solution. Experimental results are presented for matching and recognizing jigsaw objects under partial occlusion, rotation, translation and scaling.


Object Recognition Model Object Markov Random Field Curve Segment Gibbs Distribution 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    N. Ayache and O. D. Faugeras. “HYPER: A new approach for the representation and positioning of two-dimensional objects”. IEEE Transactions on Pattern Analysis and Machine Intelligence, 8(1):44–54, January 1986.Google Scholar
  2. 2.
    J. Besag. “On the statistical analysis of dirty pictures” (with discussions). Journal of the Royal Statistical Society, Series B, 48:259–302, 1986.zbMATHMathSciNetGoogle Scholar
  3. 3.
    P. J. Besl and R. C. Jain. “Three-Dimensional object recognition”. Computing Surveys, 17(1):75–145, March 1985.CrossRefGoogle Scholar
  4. 4.
    R. Chellappa and A. Jain, editors. Markov Random Fields: Theory and Applications. Academic Press, 1993.Google Scholar
  5. 5.
    P. R. Cooper. “Parallel structure recognition with uncertainty: Coupled segmentation and matching”. In Proceedings of IEEE International Conference on Computer Vision, pages 287–290, 1990.Google Scholar
  6. 6.
    O. Faugeras. Three-Dimensional Computer Vision — A Geometric Viewpoint. MIT Press, Cambridge, MA, 1993.Google Scholar
  7. 7.
    S. Geman and D. Geman. “Stochastic relaxation, Gibbs distribution and the Bayesian restoration of images”. IEEE Transactions on Pattern Analysis and Machine Intelligence, 6(6):721–741, November 1984.zbMATHCrossRefGoogle Scholar
  8. 8.
    W. E. L. Grimson. Object Recognition by Computer — The Role of Geometric Constraints. MIT Press, Cambridge, MA, 1990.Google Scholar
  9. 9.
    J. Hornegger and H. Niemann. “Statistical learning, localization and identification of objects”. In Proceedings of IEEE International Conference on Computer Vision, pages 914–919, MIT, MA, 1995.Google Scholar
  10. 10.
    R. A. Hummel and S. W. Zucker. “On the foundations of relaxation labeling process”. IEEE Transactions on Pattern Analysis and Machine Intelligence, 5(3):267–286, May 1983.zbMATHGoogle Scholar
  11. 11.
    S. Kirkpatrick, C. D. Gellatt, and M. P. Vecchi. “Optimization by simulated annealing”. Science, 220:671–680, 1983.MathSciNetGoogle Scholar
  12. 12.
    S. Z. Li. Markov Random Field Modeling in Computer Vision. Springer-Verlag, New York, 1995.Google Scholar
  13. 13.
    S. Z. Li, H. Wang, K. L. Chan, and M. Petrou. “Minimization of MRP energy with relaxation labeling”. Journal of Mathematical Imaging and Vision, 7:149–161, 1997.MathSciNetCrossRefGoogle Scholar
  14. 14.
    K. V. Mardia and G. K. Kanji, editors. Statistics and Images: 1. Advances in Applied Statistics. Carfax, 1993.Google Scholar
  15. 15.
    J. W. Modestino and J. Zhang. “A Markov random field model-based approach to image interpretation”. IEEE Transactions on Pattern Analysis and Machine Intelligence, 14(6):606–615, 1992.CrossRefGoogle Scholar
  16. 16.
    C. Peterson and B. Soderberg. “A new method for mapping optimization problems onto neural networks”. International Journal of Neural Systems, 1(1):3–22, 1989.zbMATHCrossRefGoogle Scholar
  17. 17.
    Ullman. High-Level Vision: Object Recognition and Visual Cognition. MIT Press, 1996.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1998

Authors and Affiliations

  • Stan Z. Li
    • 1
  • Joachim Hornegger
    • 2
  1. 1.School of EEENanyang Technological UniversitySingapore
  2. 2.Robotics LaboratoryStanford UniversityStanfordUSA

Personalised recommendations