Skip to main content

Towards a Theory of Motion Understanding in Man and Machine

  • Chapter
Motion Understanding

Abstract

The design of a system that understands visual motion, whether biological or machine, must adhere to certain constraints. These constraints include the types and numbers of available processors, how those processors are arranged, the nature of the task, as well as the characteristics of the input itself. This chapter examines some of these constraints, and in two parts, presents a framework for research in this area. The first part, Section 11.2, involves time complexity arguments demonstrating that the common attack to this problem, namely, an approach that is spatially parallel (as least conceptually), with temporal considerations strictly subsequent to the spatial ones, cannot possibly succeed. The essence of this claim is not a new one, and was motivated by similar comments by Neisser (1967), among others. What Neisser and others did not do, however, is provide a framework that is plausible. Expanding on the time complexity argument, we show that in addition to spatial parallelism, the basic system elements include hierarchical organization through abstraction of both prototypical visual knowledge as well as early representations of the input, and the separation of input measurements into logical feature maps.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Adelson, E.H., (1982) ‘Some new illusions and some old ones, analyzed in terms of their Fourier components,’ Investigative Ophthalmology and Visual Science, vol. 22, pp. 144.

    Google Scholar 

  • Adelson, E.H., and Bergen, J.R., (1985) ‘Spatiotemporal energy models for the perception of motion,’ J. Optical Society of America, vol. 2, pp. 284–299.

    Article  Google Scholar 

  • Anstis, S.M., (1970) ‘Phi movement as a subtractive process,’ Vision Research, vol. 10, pp. 1411–1430.

    Article  Google Scholar 

  • Ballard, D., (1985) ‘Cortical connections and parallel processing: Structure and function,’ The Behavioral and Brain Sciences, pp. 1–43.

    Google Scholar 

  • Barnard, S.T. and Thompson, W.B., (1980) ‘Disparity analysis of images,’ IEEE Trans. on Pattern Analysis and Machine Intelligence,vol. PAMI-2, pp. 333–340.

    Article  Google Scholar 

  • Barrow, H., and Tenenbaum, J.M., (1978) ‘Recovering intrinsic scene characteristics from images,’ in Computer Vision Systems, A. Hanson and E. Riseman (eds.), Academic Press, New York.

    Google Scholar 

  • Brillouin, L., (1962) Science and Information Theory, 2nd Edition, Academic Press, New York.

    MATH  Google Scholar 

  • Burt, P.J., (1981) ‘Fast filter transforms for image processing,’ Computer Graphics Image Processing, vol. 16, pp. 20–51.

    Article  Google Scholar 

  • Capek, M., (1981) ‘Time in relativity theory: Arguments for a theory of becoming,’ in The Voices of Time, J.T. Fraser (ed.), The University of Massachusetts Press, Amherst, MA.

    Google Scholar 

  • Cowey, A., (1979) ‘Cortical maps and visual perception,’ Quart. J. of Experimental Psychology, vol. 31, pp. 1–17.

    Article  Google Scholar 

  • Crowley, J.L., (1982) ‘Representation for Visual Information,’ Carnegie Mellon University, PhD. Dissertation. Available as CMU Robotics Institute Tec. Rep. CMU-RI-TR-82–7.

    Google Scholar 

  • Daugman, J.D., (1983) ‘Six formal properties of two-dimensional anisotropic visual filters: Structural principles and frequency/orientation selectivity,’ IEEE Trans. Systems, Man and Cybernetics, vol. SMC 13, pp. 882–887.

    Google Scholar 

  • Dreschler, L.H., and Nagel, H.-H., (1982) ‘Volumetric model and 3D trajectory of a moving car derived from monocular TV frames of a street scene,’ Computer Graphics Image Processing, vol. 20, pp. 199–228.

    Article  Google Scholar 

  • Estes, W., (1978) ‘Perceptual processing in letter recognition and reading,’ in The Handbook of Perception: Perceptual Processing, vol. IX, E. Carterette and M. Friedman (eds.), Academic Press, New York.

    Google Scholar 

  • Fahle, M., and Poggio, T., (1981) ‘Visual hyperacuity: Spatio-temporal interpolation in human vision,’ Proc. Royal Society London, vol. B 213, pp. 451–477.

    Google Scholar 

  • Fleet, D.J., Hallett, P.E., and Jepson, A.D., (1985) ‘Spatio-temporal inseparability in early visual processing,’ Biological Cybernetics, vol. 52, pp. 153–164.

    Google Scholar 

  • Fleet, D.J., and Jepson, A.D., (1984) ‘A cascaded filter approach to the construction of velocity selective mechanisms,’ Tech. Rep. RBCVTR-84–6, Dept. of Computer Science, University of Toronto.

    Google Scholar 

  • Fleet, D.J., and Jepson, A.D., (1985a) ‘On the hierarchical construction of orientation and velocity selective filters,’ Tech. Rep. RBCV-TR-85–8, Dept. of Computer Science, University of Toronto.

    Google Scholar 

  • Fleet, D.J., and Jepson, A.D., (1985b) ‘Spatiotemporal inseparability in early vision: Center-surround models and velocity selectivity,’ Computational Intelligence, vol. 1, pp. 89–102.

    Article  Google Scholar 

  • Fleet, D.J., and Jepson, A.D., (1985c) ‘Velocity extraction without form interpretation,’ Proc. 3rd IEEE Workshop on Computer Vision: Representation and Control, Bellaire, MI, pp. 179–185.

    Google Scholar 

  • Ganz, L., (1975) ‘Temporal factors in visual perception,’ in The Handbook of Perception: Seeing, vol. V, E.C. Carterette and M.P. Friedman (eds.), Academic Press, New York.

    Google Scholar 

  • Goodwin, A.W., Henry, G.H., and Bishop, P., (1975) ‘Direction selectivity of simple striate cells: Properties and mechanisms,’ J. Neurophysiology, vol. 38, pp. 1500–1523.

    Google Scholar 

  • Gross, C.G., (1973) ‘Visual functions of inferotemporal cortex,’ in Handbook of Sensory Physiology, vol. VII, 3B, R. Jung (ed.), Springer-Verlag, New York, pp. 451–482.

    Google Scholar 

  • Horn, B.K.P, and Schunck, B.G., (1981) ‘Determining optic flow,’ Artificial Intelligence, vol. 17, pp. 185–204.

    Article  Google Scholar 

  • Hubel, D.H., and Weisel, T.N., (1962) ‘Receptive fields, binocular interaction, and functional architecture in the cat’s visual cortex,’ J. Physiology, vol. 160, pp. 106–154.

    Google Scholar 

  • Jepson, A.D. and Fleet, D.J., (1987) ‘The measurement of visual primitives,’ in preparation.

    Google Scholar 

  • Julesz, B., (1971) Foundations of Cyclopean Perception, University of Chicago Press, Chicago.

    Google Scholar 

  • Julesz, B., (1971) Foundations of Cyclopean Perception, University of Chicago Press, Chicago.

    Google Scholar 

  • Kiroussis, L., and Papadimitriou, C., (1985) ‘On the intractability of polyhedral labelling algorithms,’ Proc. Foundations of Computer Science.

    Google Scholar 

  • Kulikowski, J.J., Marcelja, S., and Bishop, P.O., (1982) ‘Theory of spatial properties and spatial frequency relations in the receptive fields of simple cells in the visual cortex,’ Biological Cybernetics, vol. 43, pp. 187–198.

    Article  MATH  Google Scholar 

  • Limb, J.O., and Murphy, J.A., (1975) ‘Estimating velocity of moving images in television signals,’ Computer Graphics Image Processing, vol. 4, pp. 311–327.

    Article  Google Scholar 

  • Marcelja, S., (1980) ‘Mathematical descriptions of responses of simple cells,’ J. Optical Society America, vol. 70, pp. 1297–1300.

    Article  MathSciNet  Google Scholar 

  • Marr, D. and Hildreth, E.C., (1980) ‘Theory of edge detection,’ Proc. Royal Society London, vol. B 207, pp. 187–217.

    Article  Google Scholar 

  • Marr, D., and Ullman, S., (1981) ‘Directional selectivity and its use in early visual processing,’ Proc. Royal Society London, vol. B 211, pp. 151–180.

    Article  Google Scholar 

  • McKee, S.P., (1981) ‘A Local Mechanism for Differential Velocity Detection,’ Vision Research, vol. 21, pp. 491–500.

    Article  MathSciNet  Google Scholar 

  • Morgan, M.J., (1980) ‘Analogue models of motion perception,’ Phil. Trans. Royal Society London, vol. B 290, pp. 117–135.

    Article  Google Scholar 

  • Movshon, J.A., Adelson, E.H., Gizzi, M.S., and Newsome, W.T., (1985) ‘The analysis of moving visual patterns,’ in Pattern Recognition Mechanisms, C. Chages, R. Gattass and C. Gross (eds.), Vatican Press, Rome, pp. 117–151.

    Google Scholar 

  • Nagel, H.-H., (1983) ‘Displacement vectors derived from second-order intensity variations in image sequences,’ Computer Graphics and Image Processing,vol. 21, pp. 85–117.

    Article  Google Scholar 

  • Nakayama, K., (1985) ‘Biological image motion processing: A review,’ Vision Research, vol. 25, pp. 625–659.

    Article  Google Scholar 

  • Neisser, U., (1967) Cognitive Psychology, Appleton-Century-Crofts, New York.

    Google Scholar 

  • Poggio, G., and Poggio, T., (1984) ‘The analysis of stereopsis,’ Annual Rev. Neuroscience, vol. 7, pp. 379–412.

    Article  Google Scholar 

  • Pollen, D.A., and Ronner, S.F., (1983) ‘Visual cortical neurons and localized spatial frequency filters,’ IEEE Trans. on Systems, Man and Cybernetics, vol. SMC-13, pp. 907–916.

    Google Scholar 

  • Potter, M.C., (1975) ‘Meaning in visual search,’ Science, vol. 187, pp. 965–966.

    Article  Google Scholar 

  • Rabiner, L.R., and Gold, B., (1975) Digital Signal Processing. Prentice-Hall, Englewood Cliffs, NJ.

    Google Scholar 

  • Richter, J., and Ullman, S., (1982) ‘A model for the temporal organization of X and Y type receptive fields in the primate retina’, Biological Cybernetics, vol. 43, pp. 127–145.

    Article  Google Scholar 

  • Scholl, D., (1956) The Organization of the Cerebral Cortex, Wiley.

    Google Scholar 

  • Schunck, B.G., (1985) ‘Image flow: Fundamentals and future research,’ IEEE Conf. on Computer Vision and Pattern Recognition,San Francisco, pp. 561–571.

    Google Scholar 

  • Slepian, D., (1983) ‘Some comments on Fourier analysis, uncertainty and modelling,’ SIAM Rev., vol. 25, pp. 379–393.

    Article  MathSciNet  MATH  Google Scholar 

  • Sperling, G., (1963) ‘A model for visual memory tasks,’ Human Factors, vol. 5, pp. 19–31.

    Google Scholar 

  • Stone, J., Dreher, B., and Leventhal, A., (1979) ‘Hierarchical and parallel mechanisms in the organization of the visual cortex,’ Brain Research Reviews, vol. 1, pp. 345–394.

    Article  Google Scholar 

  • Thompson, P., (1984) ‘The coding of the velocity of movement in the human visual system,’ Vision Research, vol. 24, pp. 41–45.

    Article  Google Scholar 

  • Treisman, A.M., and Schmidt, H., (1982) ‘Illusory conjunctions in the perception of objects,’ Cognitive Psychology, vol. 14, pp. 107–141.

    Article  Google Scholar 

  • Tsotsos, J.K., (1988) ‘How does human vision beat the time complexity of visual perception?,’ to appear in Computational Processes in Human Vision: An Interdiciplinary Approach, Z. Pylyshyn (ed.), Ablex Press.

    Google Scholar 

  • Ullman, S., (1979) The Interpretation of Visual Motion, MIT Press, Cambridge, MA.

    Google Scholar 

  • Uttal, W., (1981) A Taxonomy of Visual Processes, Lawrence Erlbaum.

    Google Scholar 

  • van Essen, D., and Maunsell, J., (1983) ‘Hierarchical organization and functional streams in the visual cortex,’ Trends in Neuroscience, pp. 370–375.

    Google Scholar 

  • van Santen, J.P.H., and Sperling, G., (1985) ‘Elaborated Reichardt detectors,’ J. Optical Society America, vol. A 2, pp. 300–321.

    Article  Google Scholar 

  • Watson, B.A., and Ahumada, A.J., Jr., (1985) ‘Model of human visual- motion sensing,’ J. Optical Society America, vol. A 2, pp. 322–342.

    Article  Google Scholar 

  • Watson, B.A., and Robson, J.G., (1981) ‘Discrimination at threshold: Labelled detectors in human vision,’ Vision Research, vol. 21, pp. 1115–1122.

    Article  Google Scholar 

  • Watson, A., Thompson, P., Murphy, B., Nachmais, J., (1980) ‘Summation and discrimination of gratings moving in opposite directions,’ Vision Research, vol. 20, pp. 341–347.

    Article  Google Scholar 

  • Wilson, H.R., (1985) ‘A model for directional selectivity in threshold motion perception,’ Biological Cybernetics, vol. 51, pp. 213–222.

    Article  Google Scholar 

Download references

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1988 Kluwer Academic Publishers

About this chapter

Cite this chapter

Tsotsos, J.K., Fleet, D.J., Jepson, A.D. (1988). Towards a Theory of Motion Understanding in Man and Machine. In: Martin, W.N., Aggarwal, J.K. (eds) Motion Understanding. The Kluwer International Series in Engineering and Computer Science, vol 44. Springer, Boston, MA. https://doi.org/10.1007/978-1-4613-1071-6_11

Download citation

  • DOI: https://doi.org/10.1007/978-1-4613-1071-6_11

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4612-8413-0

  • Online ISBN: 978-1-4613-1071-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics