Object recognition: The search for representation

Mundy, J. L.

doi:10.1007/3-540-60477-4_2

J. L. Mundy¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 994))

Included in the following conference series:

International Workshop on Object Representation in Computer Vision

193 Accesses
1 Citations

Abstract

Object recognition in visual scenes by computer has proven to be more difficult than anyone would have thought three decades ago, at the beginning of the research program to achieve this goal. A central issue for further progress is the design and effective implementation of an object representation which captures all of the requirements for description and perceptual organization. In this paper, the major issues surrounding the development of such a representation are established and placed in the setting of relatively recent discoveries in the philosophy of recognition and object classification. From this viewpoint, definitions for representation, recognition, identification and classification are established and related to standard approaches to object recognition in visual scenes.

The use of biological models and introspection as a source of design ideas for representation is discussed. It is argued that the most profitable source of ideas will emerge from an engineering approach, based on principles from geometric reasoning, photogrammetry. and signal processing. The role of context in object recognition is outlined with emphasis on its use throughout all of stages of recognition. The paper concludes with a description of a object recognition system, called MORSE, which embodies many of the principles derived from these philosophical considerations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Roberts, L. G., Machine Perception of 3D Solids in Optical and Electro-Optical Information Processing, Tippet, J. T., et al eds., MIT Press, 1965.
Google Scholar
Ayre, A.J., Language Truth and Logic, Dover Books, 1935, with a new introduction, 1946.
Google Scholar
Hume, D., An Enquiry Concerning Human Understanding, 1748, in The English Philosophers From Bacon to Mill, Burtt, E. ed., The Modern Library, 1939.
Google Scholar
Jowett, B., The Dialogs of Plato, vol. 3, Scribner, Armstrong and Co., 1874.
Google Scholar
Davis, R., Shrobe, H. and Szolovits, P., The AI Magazine, Spring 1993, p. 17.
Google Scholar
Dennett, D., Brainstorms, MIT Press, 1981
Google Scholar
Weiskrantz, L., Thought Without Language, Oxford:Clarenden, 1988.
Google Scholar
Raichle, M., Visualizing the Mind, Scientific American, April, 1994.
Google Scholar
Freedman, D., Brainmakers, Simon and Schuster, 1994.
Google Scholar
Edelman, G., Bright Air, Brilliant Fire, Basic Books, 1992. p. 194.
Google Scholar
Kapur, D. and Mundy, J.L., editors Geometric Reasoning, MIT Press, 1989.
Google Scholar
McCarthy, J., Epistemological Problems of Artificial Intelligence, in Readings in Knowledge Representation, Brachman, R. and Levesque, H., eds., Morgan Kaufmann, 1985.
Google Scholar
Dennett, D., Consciousness Explained, Little, Brown and Company, 1991.
Google Scholar
Shepard, R. N., Metzler, Mental Rotation of Three-Dimensional Objects, Science, 171, pp. 701–703.
Google Scholar
Lowe, D., Perceptual Organization and Visual Recognition, Kluwer Academic Publishers, 1985.
Google Scholar
Sayre, K. M., Recognition: A Study in the Philosophy of Artificial Intelligence, University of Notre Dame Press, 1965.
Google Scholar
Grimson, W.E.L., Object Recognition by Computer: The Role of Geometric Constraints, MIT Press, 1990.
Google Scholar
Aristotle, Aristotle: Categories on Interpretation and Prior Analytics, Loeb Classical Library, Cooke, H., translation, 1938.
Google Scholar
Hayes, P., The Second Naive Physics Manifesto, in Readings in Knowledge Representation, Brachman, R. and Levesque, H., eds., Morgan Kaufmann, 1985.
Google Scholar
Kline, M., Mathematics and the Search for Knowledge, Oxford University Press, 1985.
Google Scholar
Wittgenstein, L., Philosophical Investigations, Ancombe, G., translator, The Macmillan Co., 1953.
Google Scholar
Wantanabe, S., Pattern Recognition, Human and Mechanical, John Wiley and Sons, 1985.
Google Scholar
Bowyer, K. and Stark, L. Form and function; a theory of purposive, qualitative 3-D object recognition, Artificial Intelligence and Computer Vision, Proc. 7'th Israeli Conference, pp 137–146, 1990.
Google Scholar
Mundy, J.L. and Heller, A., The Evolution and Testing of a Model-Based Object Recognition System in Computer Vision: Advances and Applications, Kasturi, R. and Jain, R. editors, IEEE Computer Society Press, 1991.
Google Scholar
Huttenlocher, D. and Ullman, S., Object Recognition Using Alignment, Proc. First International Conference on Computer Vision, 1987.
Google Scholar
Strat, T. and Fischler, M, CONDOR, IEEE Transactions on Machine Intelligence and Pattern Analysis, Oct. 1991.
Google Scholar
Mundy, J. L. and Vrobel, P., The Role of IU Technology in RADIUS Phase II, Proc. ARPA Image Understanding Workshop, Morgan Kaufman, 1994.
Google Scholar
Mundy, J.L. and Zisserman, A., Repeated Structures: Image Correspondence Constraints and 3D Structure in Proc. 2nd Workshop on Applications of Geometric Invariants in Computer Vision, Lecture Notes on Computer Science, 825, Springer Verlag, 1994.
Google Scholar
Zerroug, M. and Nevatia, R., Using Invariance and Quasi-invariance for the Segmentation and Recovery of Curved Objects in Proc. 2nd Workshop on Applications of Geometric Invariants in Computer Vision, Lecture Notes on Computer Science, 825, Springer Verlag, 1994.
Google Scholar
Liu J., Mundy J.L., Forsyth D.A., Zisserman A. and Rothwell C.A., Efficient Recognition of Rotationally Symmetric Surfaces and Straight Homogeneous Generalized Cylinders, Proc. Conference on Computer Vision and Pattern Recognition, 1993.
Google Scholar
Arnheim, R. Art and Visual Perception, University of California Press, 1971.
Google Scholar
Mundy, J.L. and Zisserman, A. editors, Geometrical Invariance in Computer Vision, MIT Press, 1992.
Google Scholar
Sullivan, S., Sandford, L. and Ponce, J., On Useing Geometric Distance Fits to Estimate 3D Object Shape, Pose and Deformation from Range, CT and Video Images, Proc. Computer Vision and Pattern Recognition Conference, 1993.
Google Scholar
Mundy, J.L., Forsyth, D., Zisserman, A. and Rothwell, C., MORSE: Multiple Object Recognition by Scene Entailment, Draft Report, GE Research Report, Feb. 1994.
Google Scholar
Intille, S. and Bobick, A., Tracking Using a Local Closed-World Assumption: Tracking in the Football Domain, MIT Media Lab Report 296, August, 1994.
Google Scholar

Download references

Author information

Authors and Affiliations

Corporate Research Center, General Electric, Schenectady, NY
J. L. Mundy

Authors

J. L. Mundy
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Martial Hebert Jean Ponce Terry Boult Ari Gross

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mundy, J.L. (1995). Object recognition: The search for representation. In: Hebert, M., Ponce, J., Boult, T., Gross, A. (eds) Object Representation in Computer Vision. ORCV 1994. Lecture Notes in Computer Science, vol 994. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-60477-4_2

Download citation

DOI: https://doi.org/10.1007/3-540-60477-4_2
Published: 01 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-60477-8
Online ISBN: 978-3-540-47526-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics