Skip to main content

Object recognition: The search for representation

  • Conference paper
  • First Online:
Object Representation in Computer Vision (ORCV 1994)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 994))

Included in the following conference series:

Abstract

Object recognition in visual scenes by computer has proven to be more difficult than anyone would have thought three decades ago, at the beginning of the research program to achieve this goal. A central issue for further progress is the design and effective implementation of an object representation which captures all of the requirements for description and perceptual organization. In this paper, the major issues surrounding the development of such a representation are established and placed in the setting of relatively recent discoveries in the philosophy of recognition and object classification. From this viewpoint, definitions for representation, recognition, identification and classification are established and related to standard approaches to object recognition in visual scenes.

The use of biological models and introspection as a source of design ideas for representation is discussed. It is argued that the most profitable source of ideas will emerge from an engineering approach, based on principles from geometric reasoning, photogrammetry. and signal processing. The role of context in object recognition is outlined with emphasis on its use throughout all of stages of recognition. The paper concludes with a description of a object recognition system, called MORSE, which embodies many of the principles derived from these philosophical considerations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Roberts, L. G., Machine Perception of 3D Solids in Optical and Electro-Optical Information Processing, Tippet, J. T., et al eds., MIT Press, 1965.

    Google Scholar 

  2. Ayre, A.J., Language Truth and Logic, Dover Books, 1935, with a new introduction, 1946.

    Google Scholar 

  3. Hume, D., An Enquiry Concerning Human Understanding, 1748, in The English Philosophers From Bacon to Mill, Burtt, E. ed., The Modern Library, 1939.

    Google Scholar 

  4. Jowett, B., The Dialogs of Plato, vol. 3, Scribner, Armstrong and Co., 1874.

    Google Scholar 

  5. Davis, R., Shrobe, H. and Szolovits, P., The AI Magazine, Spring 1993, p. 17.

    Google Scholar 

  6. Dennett, D., Brainstorms, MIT Press, 1981

    Google Scholar 

  7. Weiskrantz, L., Thought Without Language, Oxford:Clarenden, 1988.

    Google Scholar 

  8. Raichle, M., Visualizing the Mind, Scientific American, April, 1994.

    Google Scholar 

  9. Freedman, D., Brainmakers, Simon and Schuster, 1994.

    Google Scholar 

  10. Edelman, G., Bright Air, Brilliant Fire, Basic Books, 1992. p. 194.

    Google Scholar 

  11. Kapur, D. and Mundy, J.L., editors Geometric Reasoning, MIT Press, 1989.

    Google Scholar 

  12. McCarthy, J., Epistemological Problems of Artificial Intelligence, in Readings in Knowledge Representation, Brachman, R. and Levesque, H., eds., Morgan Kaufmann, 1985.

    Google Scholar 

  13. Dennett, D., Consciousness Explained, Little, Brown and Company, 1991.

    Google Scholar 

  14. Shepard, R. N., Metzler, Mental Rotation of Three-Dimensional Objects, Science, 171, pp. 701–703.

    Google Scholar 

  15. Lowe, D., Perceptual Organization and Visual Recognition, Kluwer Academic Publishers, 1985.

    Google Scholar 

  16. Sayre, K. M., Recognition: A Study in the Philosophy of Artificial Intelligence, University of Notre Dame Press, 1965.

    Google Scholar 

  17. Grimson, W.E.L., Object Recognition by Computer: The Role of Geometric Constraints, MIT Press, 1990.

    Google Scholar 

  18. Aristotle, Aristotle: Categories on Interpretation and Prior Analytics, Loeb Classical Library, Cooke, H., translation, 1938.

    Google Scholar 

  19. Hayes, P., The Second Naive Physics Manifesto, in Readings in Knowledge Representation, Brachman, R. and Levesque, H., eds., Morgan Kaufmann, 1985.

    Google Scholar 

  20. Kline, M., Mathematics and the Search for Knowledge, Oxford University Press, 1985.

    Google Scholar 

  21. Wittgenstein, L., Philosophical Investigations, Ancombe, G., translator, The Macmillan Co., 1953.

    Google Scholar 

  22. Wantanabe, S., Pattern Recognition, Human and Mechanical, John Wiley and Sons, 1985.

    Google Scholar 

  23. Bowyer, K. and Stark, L. Form and function; a theory of purposive, qualitative 3-D object recognition, Artificial Intelligence and Computer Vision, Proc. 7'th Israeli Conference, pp 137–146, 1990.

    Google Scholar 

  24. Mundy, J.L. and Heller, A., The Evolution and Testing of a Model-Based Object Recognition System in Computer Vision: Advances and Applications, Kasturi, R. and Jain, R. editors, IEEE Computer Society Press, 1991.

    Google Scholar 

  25. Huttenlocher, D. and Ullman, S., Object Recognition Using Alignment, Proc. First International Conference on Computer Vision, 1987.

    Google Scholar 

  26. Strat, T. and Fischler, M, CONDOR, IEEE Transactions on Machine Intelligence and Pattern Analysis, Oct. 1991.

    Google Scholar 

  27. Mundy, J. L. and Vrobel, P., The Role of IU Technology in RADIUS Phase II, Proc. ARPA Image Understanding Workshop, Morgan Kaufman, 1994.

    Google Scholar 

  28. Mundy, J.L. and Zisserman, A., Repeated Structures: Image Correspondence Constraints and 3D Structure in Proc. 2nd Workshop on Applications of Geometric Invariants in Computer Vision, Lecture Notes on Computer Science, 825, Springer Verlag, 1994.

    Google Scholar 

  29. Zerroug, M. and Nevatia, R., Using Invariance and Quasi-invariance for the Segmentation and Recovery of Curved Objects in Proc. 2nd Workshop on Applications of Geometric Invariants in Computer Vision, Lecture Notes on Computer Science, 825, Springer Verlag, 1994.

    Google Scholar 

  30. Liu J., Mundy J.L., Forsyth D.A., Zisserman A. and Rothwell C.A., Efficient Recognition of Rotationally Symmetric Surfaces and Straight Homogeneous Generalized Cylinders, Proc. Conference on Computer Vision and Pattern Recognition, 1993.

    Google Scholar 

  31. Arnheim, R. Art and Visual Perception, University of California Press, 1971.

    Google Scholar 

  32. Mundy, J.L. and Zisserman, A. editors, Geometrical Invariance in Computer Vision, MIT Press, 1992.

    Google Scholar 

  33. Sullivan, S., Sandford, L. and Ponce, J., On Useing Geometric Distance Fits to Estimate 3D Object Shape, Pose and Deformation from Range, CT and Video Images, Proc. Computer Vision and Pattern Recognition Conference, 1993.

    Google Scholar 

  34. Mundy, J.L., Forsyth, D., Zisserman, A. and Rothwell, C., MORSE: Multiple Object Recognition by Scene Entailment, Draft Report, GE Research Report, Feb. 1994.

    Google Scholar 

  35. Intille, S. and Bobick, A., Tracking Using a Local Closed-World Assumption: Tracking in the Football Domain, MIT Media Lab Report 296, August, 1994.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Martial Hebert Jean Ponce Terry Boult Ari Gross

Rights and permissions

Reprints and permissions

Copyright information

© 1995 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Mundy, J.L. (1995). Object recognition: The search for representation. In: Hebert, M., Ponce, J., Boult, T., Gross, A. (eds) Object Representation in Computer Vision. ORCV 1994. Lecture Notes in Computer Science, vol 994. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-60477-4_2

Download citation

  • DOI: https://doi.org/10.1007/3-540-60477-4_2

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-60477-8

  • Online ISBN: 978-3-540-47526-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics