Abstract
The design of a system that understands visual motion, whether biological or machine, must adhere to certain constraints. These constraints include the types and numbers of available processors, how those processors are arranged, the nature of the task, as well as the characteristics of the input itself. This chapter examines some of these constraints, and in two parts, presents a framework for research in this area. The first part, Section 11.2, involves time complexity arguments demonstrating that the common attack to this problem, namely, an approach that is spatially parallel (as least conceptually), with temporal considerations strictly subsequent to the spatial ones, cannot possibly succeed. The essence of this claim is not a new one, and was motivated by similar comments by Neisser (1967), among others. What Neisser and others did not do, however, is provide a framework that is plausible. Expanding on the time complexity argument, we show that in addition to spatial parallelism, the basic system elements include hierarchical organization through abstraction of both prototypical visual knowledge as well as early representations of the input, and the separation of input measurements into logical feature maps.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Adelson, E.H., (1982) ‘Some new illusions and some old ones, analyzed in terms of their Fourier components,’ Investigative Ophthalmology and Visual Science, vol. 22, pp. 144.
Adelson, E.H., and Bergen, J.R., (1985) ‘Spatiotemporal energy models for the perception of motion,’ J. Optical Society of America, vol. 2, pp. 284–299.
Anstis, S.M., (1970) ‘Phi movement as a subtractive process,’ Vision Research, vol. 10, pp. 1411–1430.
Ballard, D., (1985) ‘Cortical connections and parallel processing: Structure and function,’ The Behavioral and Brain Sciences, pp. 1–43.
Barnard, S.T. and Thompson, W.B., (1980) ‘Disparity analysis of images,’ IEEE Trans. on Pattern Analysis and Machine Intelligence,vol. PAMI-2, pp. 333–340.
Barrow, H., and Tenenbaum, J.M., (1978) ‘Recovering intrinsic scene characteristics from images,’ in Computer Vision Systems, A. Hanson and E. Riseman (eds.), Academic Press, New York.
Brillouin, L., (1962) Science and Information Theory, 2nd Edition, Academic Press, New York.
Burt, P.J., (1981) ‘Fast filter transforms for image processing,’ Computer Graphics Image Processing, vol. 16, pp. 20–51.
Capek, M., (1981) ‘Time in relativity theory: Arguments for a theory of becoming,’ in The Voices of Time, J.T. Fraser (ed.), The University of Massachusetts Press, Amherst, MA.
Cowey, A., (1979) ‘Cortical maps and visual perception,’ Quart. J. of Experimental Psychology, vol. 31, pp. 1–17.
Crowley, J.L., (1982) ‘Representation for Visual Information,’ Carnegie Mellon University, PhD. Dissertation. Available as CMU Robotics Institute Tec. Rep. CMU-RI-TR-82–7.
Daugman, J.D., (1983) ‘Six formal properties of two-dimensional anisotropic visual filters: Structural principles and frequency/orientation selectivity,’ IEEE Trans. Systems, Man and Cybernetics, vol. SMC 13, pp. 882–887.
Dreschler, L.H., and Nagel, H.-H., (1982) ‘Volumetric model and 3D trajectory of a moving car derived from monocular TV frames of a street scene,’ Computer Graphics Image Processing, vol. 20, pp. 199–228.
Estes, W., (1978) ‘Perceptual processing in letter recognition and reading,’ in The Handbook of Perception: Perceptual Processing, vol. IX, E. Carterette and M. Friedman (eds.), Academic Press, New York.
Fahle, M., and Poggio, T., (1981) ‘Visual hyperacuity: Spatio-temporal interpolation in human vision,’ Proc. Royal Society London, vol. B 213, pp. 451–477.
Fleet, D.J., Hallett, P.E., and Jepson, A.D., (1985) ‘Spatio-temporal inseparability in early visual processing,’ Biological Cybernetics, vol. 52, pp. 153–164.
Fleet, D.J., and Jepson, A.D., (1984) ‘A cascaded filter approach to the construction of velocity selective mechanisms,’ Tech. Rep. RBCVTR-84–6, Dept. of Computer Science, University of Toronto.
Fleet, D.J., and Jepson, A.D., (1985a) ‘On the hierarchical construction of orientation and velocity selective filters,’ Tech. Rep. RBCV-TR-85–8, Dept. of Computer Science, University of Toronto.
Fleet, D.J., and Jepson, A.D., (1985b) ‘Spatiotemporal inseparability in early vision: Center-surround models and velocity selectivity,’ Computational Intelligence, vol. 1, pp. 89–102.
Fleet, D.J., and Jepson, A.D., (1985c) ‘Velocity extraction without form interpretation,’ Proc. 3rd IEEE Workshop on Computer Vision: Representation and Control, Bellaire, MI, pp. 179–185.
Ganz, L., (1975) ‘Temporal factors in visual perception,’ in The Handbook of Perception: Seeing, vol. V, E.C. Carterette and M.P. Friedman (eds.), Academic Press, New York.
Goodwin, A.W., Henry, G.H., and Bishop, P., (1975) ‘Direction selectivity of simple striate cells: Properties and mechanisms,’ J. Neurophysiology, vol. 38, pp. 1500–1523.
Gross, C.G., (1973) ‘Visual functions of inferotemporal cortex,’ in Handbook of Sensory Physiology, vol. VII, 3B, R. Jung (ed.), Springer-Verlag, New York, pp. 451–482.
Horn, B.K.P, and Schunck, B.G., (1981) ‘Determining optic flow,’ Artificial Intelligence, vol. 17, pp. 185–204.
Hubel, D.H., and Weisel, T.N., (1962) ‘Receptive fields, binocular interaction, and functional architecture in the cat’s visual cortex,’ J. Physiology, vol. 160, pp. 106–154.
Jepson, A.D. and Fleet, D.J., (1987) ‘The measurement of visual primitives,’ in preparation.
Julesz, B., (1971) Foundations of Cyclopean Perception, University of Chicago Press, Chicago.
Julesz, B., (1971) Foundations of Cyclopean Perception, University of Chicago Press, Chicago.
Kiroussis, L., and Papadimitriou, C., (1985) ‘On the intractability of polyhedral labelling algorithms,’ Proc. Foundations of Computer Science.
Kulikowski, J.J., Marcelja, S., and Bishop, P.O., (1982) ‘Theory of spatial properties and spatial frequency relations in the receptive fields of simple cells in the visual cortex,’ Biological Cybernetics, vol. 43, pp. 187–198.
Limb, J.O., and Murphy, J.A., (1975) ‘Estimating velocity of moving images in television signals,’ Computer Graphics Image Processing, vol. 4, pp. 311–327.
Marcelja, S., (1980) ‘Mathematical descriptions of responses of simple cells,’ J. Optical Society America, vol. 70, pp. 1297–1300.
Marr, D. and Hildreth, E.C., (1980) ‘Theory of edge detection,’ Proc. Royal Society London, vol. B 207, pp. 187–217.
Marr, D., and Ullman, S., (1981) ‘Directional selectivity and its use in early visual processing,’ Proc. Royal Society London, vol. B 211, pp. 151–180.
McKee, S.P., (1981) ‘A Local Mechanism for Differential Velocity Detection,’ Vision Research, vol. 21, pp. 491–500.
Morgan, M.J., (1980) ‘Analogue models of motion perception,’ Phil. Trans. Royal Society London, vol. B 290, pp. 117–135.
Movshon, J.A., Adelson, E.H., Gizzi, M.S., and Newsome, W.T., (1985) ‘The analysis of moving visual patterns,’ in Pattern Recognition Mechanisms, C. Chages, R. Gattass and C. Gross (eds.), Vatican Press, Rome, pp. 117–151.
Nagel, H.-H., (1983) ‘Displacement vectors derived from second-order intensity variations in image sequences,’ Computer Graphics and Image Processing,vol. 21, pp. 85–117.
Nakayama, K., (1985) ‘Biological image motion processing: A review,’ Vision Research, vol. 25, pp. 625–659.
Neisser, U., (1967) Cognitive Psychology, Appleton-Century-Crofts, New York.
Poggio, G., and Poggio, T., (1984) ‘The analysis of stereopsis,’ Annual Rev. Neuroscience, vol. 7, pp. 379–412.
Pollen, D.A., and Ronner, S.F., (1983) ‘Visual cortical neurons and localized spatial frequency filters,’ IEEE Trans. on Systems, Man and Cybernetics, vol. SMC-13, pp. 907–916.
Potter, M.C., (1975) ‘Meaning in visual search,’ Science, vol. 187, pp. 965–966.
Rabiner, L.R., and Gold, B., (1975) Digital Signal Processing. Prentice-Hall, Englewood Cliffs, NJ.
Richter, J., and Ullman, S., (1982) ‘A model for the temporal organization of X and Y type receptive fields in the primate retina’, Biological Cybernetics, vol. 43, pp. 127–145.
Scholl, D., (1956) The Organization of the Cerebral Cortex, Wiley.
Schunck, B.G., (1985) ‘Image flow: Fundamentals and future research,’ IEEE Conf. on Computer Vision and Pattern Recognition,San Francisco, pp. 561–571.
Slepian, D., (1983) ‘Some comments on Fourier analysis, uncertainty and modelling,’ SIAM Rev., vol. 25, pp. 379–393.
Sperling, G., (1963) ‘A model for visual memory tasks,’ Human Factors, vol. 5, pp. 19–31.
Stone, J., Dreher, B., and Leventhal, A., (1979) ‘Hierarchical and parallel mechanisms in the organization of the visual cortex,’ Brain Research Reviews, vol. 1, pp. 345–394.
Thompson, P., (1984) ‘The coding of the velocity of movement in the human visual system,’ Vision Research, vol. 24, pp. 41–45.
Treisman, A.M., and Schmidt, H., (1982) ‘Illusory conjunctions in the perception of objects,’ Cognitive Psychology, vol. 14, pp. 107–141.
Tsotsos, J.K., (1988) ‘How does human vision beat the time complexity of visual perception?,’ to appear in Computational Processes in Human Vision: An Interdiciplinary Approach, Z. Pylyshyn (ed.), Ablex Press.
Ullman, S., (1979) The Interpretation of Visual Motion, MIT Press, Cambridge, MA.
Uttal, W., (1981) A Taxonomy of Visual Processes, Lawrence Erlbaum.
van Essen, D., and Maunsell, J., (1983) ‘Hierarchical organization and functional streams in the visual cortex,’ Trends in Neuroscience, pp. 370–375.
van Santen, J.P.H., and Sperling, G., (1985) ‘Elaborated Reichardt detectors,’ J. Optical Society America, vol. A 2, pp. 300–321.
Watson, B.A., and Ahumada, A.J., Jr., (1985) ‘Model of human visual- motion sensing,’ J. Optical Society America, vol. A 2, pp. 322–342.
Watson, B.A., and Robson, J.G., (1981) ‘Discrimination at threshold: Labelled detectors in human vision,’ Vision Research, vol. 21, pp. 1115–1122.
Watson, A., Thompson, P., Murphy, B., Nachmais, J., (1980) ‘Summation and discrimination of gratings moving in opposite directions,’ Vision Research, vol. 20, pp. 341–347.
Wilson, H.R., (1985) ‘A model for directional selectivity in threshold motion perception,’ Biological Cybernetics, vol. 51, pp. 213–222.
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1988 Kluwer Academic Publishers
About this chapter
Cite this chapter
Tsotsos, J.K., Fleet, D.J., Jepson, A.D. (1988). Towards a Theory of Motion Understanding in Man and Machine. In: Martin, W.N., Aggarwal, J.K. (eds) Motion Understanding. The Kluwer International Series in Engineering and Computer Science, vol 44. Springer, Boston, MA. https://doi.org/10.1007/978-1-4613-1071-6_11
Download citation
DOI: https://doi.org/10.1007/978-1-4613-1071-6_11
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4612-8413-0
Online ISBN: 978-1-4613-1071-6
eBook Packages: Springer Book Archive