Spatial Reasoning as a Tool for Scene Generation and Recognition

  • Giovanni Adorni


Although a great deal of effort has been put into the research and development of artificial intelligence reasoning systems, spatial reasoning is a relatively new independent research area. Up to now spatial reasoning problems have been considered in a variety of areas, including computer graphics, computer vision, robotics, geographical information systems, man-machine interaction, autonomous systems, and expert systems. Spatial reasoning involves spatial task planning, navigation planning for robots, representing and indexing large spatial databases, the integration of symbolic reasoning with geometrical constraints, and multisensor data fusion.

In this paper I focus on the aspects of spatial reasoning that are more closely related to high-level computer vision. More precisely, after a brief review of studies performed by psychologists of perception related to the field, I investigate the problems of: i) the description of objects and space modelling, ii) the representation of spatial relationships, iii) functional aspects of objects and naive reasoning.


Spatial Relation Recognition Activity Living Room Spatial Reasoning Content Field 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    I.E. Sigel, The development of pictorial comprehension, in Visual Learning, Thinking, and Communication, B.S. Randhawa and W.E. Coffman eds., Academic Press, New York, NY (1978).Google Scholar
  2. 2.
    A.L. Yarbus, Eyes movements and vision, Plenum Press, New York, NY (1967).Google Scholar
  3. 3.
    R.M. Cooper, The control of the eye fixation by the meaning of spoken language: a new methodology for the real-time investigation of speech perception, memory, and language processing, Cognitive Psychology, Vol.6, pp. 84–107 (1974).CrossRefGoogle Scholar
  4. 4.
    N.H. Mackworth and A.J. Morandi, The gaze selects informative details within pictures, Perception and Psychophysics, Vol.2, pp. 547–551 (1967).CrossRefGoogle Scholar
  5. 5.
    J.R. Antes, The time course of picture viewing, Journal of Experimental Psychology, Vol. 103, pp. 62–70 (1974).PubMedCrossRefGoogle Scholar
  6. 6.
    D.E. Berlyne, The influence of complexity and novelty in visual figures on orienting responses, Journal of Experimental Psychology, Vol.55, pp. 289–296 (1958).PubMedCrossRefGoogle Scholar
  7. 7.
    G.R. Loftus and N.H. Mackworth, Cognitive determinants of fixation location during picture viewing, Journal of Experimental Psychology: Human Perception and Performance, Vol.4, pp. 565–572 (1978).PubMedCrossRefGoogle Scholar
  8. 8.
    I. Biederman, Perceiving real-world scenes, Science, Vol.177, pp. 77–80 (1972).PubMedCrossRefGoogle Scholar
  9. 9.
    I. Biederman, On the semantics of a glance at a scene, in Perceptual Organization, M. Kubovy and J.R. Pomerantz eds., Lawrence Erlbaum Associates, Hillsdale, NJ, p. 215 (1981).Google Scholar
  10. 10.
    I. Biederman, R.C. Teitelbaum, and R.J. Mezzanotte, Scene perception: a failure to find a benefit from prior expectancy or familiarity, Journal of Experimental Psychology: Learning, Memory, and Cognition, Vol.9, No.2, pp. 411–429 (1983).PubMedCrossRefGoogle Scholar
  11. 11.
    J.M. Mandler and R.E. Parker, Memory for descriptive and spatial information in complex pictures, Journal of Experimental Psychology: Human Learning and Memory, Vol.2, pp. 38–48 (1976).CrossRefGoogle Scholar
  12. 12.
    J.M. Mandler and N.S. Johnson, Some of the thousand words a picture is worth, Journal of Experimental Psychology: Human Learning and Memory, Vol.2, pp. 529–540 (1976).CrossRefGoogle Scholar
  13. 13.
    J.D. Bransford and J.J. Franks, The abstraction of linguistic ideas, Cognitive Psychology, Vol.2, pp. 331–350 (1971).CrossRefGoogle Scholar
  14. 14.
    K. Pezdek, Recognition memory for related pictures, Memory and Cognition, Vol.6, pp. 64–69 (1978).CrossRefGoogle Scholar
  15. 15.
    J.R. Anderson, Arguments concerning representations for mental imagery, Psychological Review, Vol.85, pp. 249–277 (1978).CrossRefGoogle Scholar
  16. 16.
    A. Pavio, Imagery and verbal processes, Holt, Rinehart and Winston, New York, NY (1971).Google Scholar
  17. 17.
    G. Atwood, An experimental study of visual imagination and memory, Cognitive Psychology, Vol.2, pp. 290–299 (1971).CrossRefGoogle Scholar
  18. 18.
    L.R. Brooks, Spatial and verbal components of the act of recall, Canadian Journal of Psychology, Vol.22, pp. 349–368 (1968).CrossRefGoogle Scholar
  19. 19.
    R.N. Shepard, The mental image, American Psychologist, Vol.33, pp. 125–137 (1978).CrossRefGoogle Scholar
  20. 20.
    G. Adorni, A. Boccalatte, and M. DiManzo, Object representation and spatial knowledge: an insight into the problem of men-robot communication, Proc. 7th. Conference of the Canadian Man-Computer Communication Society, Waterloo, CDN (1981).Google Scholar
  21. 21.
    M. Minsky, A framework for representing knowledge, in The psychology of computer vision, P.H. Winston ed., McGraw-Hill, New York, NY (1975).Google Scholar
  22. 22.
    R.C. Schänk and R.P. Abelson, Scripts, Plans, Goals, and Understanding, Lawrence Erlbaum, Hillsdale, NJ (1977).Google Scholar
  23. 23.
    G. Adorni and M. DiManzo, Top-down approach to scene interpretation, Proc. Convencion de Informatica Latina, Barcelona, E, pp. 591-605 (1983).Google Scholar
  24. 24.
    D. Marr and H.K. Nishihara, Representation and recognition of the spatial organization of 3-D shapes, Proc. Royal Soc. Lond.B., pp. 269-294 (1978).Google Scholar
  25. 25.
    R.C. Schänk ed., Conceptual information processing, North Holland, Amsterdam, NL (1975).Google Scholar
  26. 26.
    G. Adorni, Some notes on a cognitive model for scene description, Technical Report, Istituto di Elettrotecnica, Università di Genova (1982).Google Scholar
  27. 27.
    G. Adorni, A. Boccalatte, and M. DiManzo, Cognitive models for computer vision, Proc. 9th. COLING, Prague, pp. 7-12 (1982).Google Scholar
  28. 28.
    B. Kuipers, Modeling spatial knowledge, Cognitive Science, Vol.2, pp. 129–153 (1978).CrossRefGoogle Scholar
  29. 29.
    H. Clark, Space, time, semantics, and the child, in Cognitive Development and the Acquisition of Language, T.E. Moore ed., Academic Press, New York, NY (1973).Google Scholar
  30. 30.
    N.K. Sondheimer, Spatial reference and natural language machine control, Int. Journal of Man-Machine Studies, Vol.8, pp. 329–336 (1976).CrossRefGoogle Scholar
  31. 31.
    L.C. Boggess, Computational interpretation of English spatial prepositions, Coordinated Science Lab., Tech.Rep. T-75, Urbana, IL (1979).Google Scholar
  32. 32.
    M. Bierwisch, Some semantic universal of german adjectivals, Foundations of Language, Vol.3, pp. 1–16 (1967).Google Scholar
  33. 33.
    G.S. Cooper, A semantic analysis of English locative expressions, BBN Tech. Rep., No. 1587, Cambridge, MA (1968).Google Scholar
  34. 34.
    N. Goguen, A procedural description of spatial prepositions, M.S. Thesis, Univ. of Pennsylvania, Moore School of Electrical Engineering, Philadelphia, PN (1973).Google Scholar
  35. 35.
    D. Waltz and L.C. Boggess, Visual analog representations for natural language understanding, Proc. 6th. IJCAI, Tokyo, J, pp. 926-934 (1979).Google Scholar
  36. 36.
    D. Waltz, Understanding scene descriptions as event simulations, Proc. 18th. Annual Meeting of ACL, Philadelphia, PN, pp. 7-12 (1980).Google Scholar
  37. 37.
    D. Waltz, Toward a detailed model of processing for natural language describing the physical world, Proc. 7th. IJCAI, Vancouver, CDN, pp. 1-6 (1981).Google Scholar
  38. 38.
    G. Adorni, M. DiManzo, and F. Giunchiglia, Some basic mechanisms for common sense reasoning about stories environments, Proc. 8th. IJCAI, Karlsruhe, D, pp. 72-74 (1983).Google Scholar
  39. 39.
    G. Adorni, M. DiManzo, and F. Giunchiglia, From descriptions to images: what reasoning in between?, Proc. 6th. ECAI, Pisa, I, pp. 359-368 (1984).Google Scholar
  40. 40.
    M. DiManzo, G. Adorni, and F. Giunchiglia, Reasoning about scene descriptions, Proceedings of the IEEE, Vol.74, No.7, pp. 1013–1025 (1986).CrossRefGoogle Scholar
  41. 41.
    A. Herskovits, Language and Spatial Cognition: an Interdisciplinary Study of the Prepositions in English, Cambridge University Press, Cambridge, UK (1986).Google Scholar
  42. 42.
    P.J. Hayes, Naive physics I: ontology for liquids, Working Paper, No.35, Univ. of Geneve, ISSCO, Geneve, CH (1978).Google Scholar
  43. 43.
    R.A. Brooks, Symbolic reasoning among 3-D models and 2-D images, Artificial Intelligence, Vol.17, pp. 285–348 (1981).CrossRefGoogle Scholar
  44. 44.
    R.B. Fisher, Using surfaces and object models to recognize partially observed objects, Proc. 8th. IJCAI, Karlsruhe, D, pp. 231-234 (1983).Google Scholar
  45. 45.
    M. DiManzo, E. Trucco, F. Giunchiglia, and F. Ricci, FUR: understanding functional reasoning, International Journal of Intelligent Systems, Vol.4, pp. 431–457 (1989).CrossRefGoogle Scholar
  46. 46.
    M. DiManzo, G. Adorni, F. Ricci, A. Batistoni, and C. Ferrari, Qualitative theories for functional description of objects, Esprit Project P419, Report TK4-WP2-DI1 (1986).Google Scholar
  47. 47.
    G. Adorni, Causal analysis: a case study in a vectorial domain, Proc. 1st. Conf. of AI*IA, Trento, I, pp. 158-164 (1989).Google Scholar
  48. 48.
    J. De Kleer and J.S. Brown, A qualitative physics based on confluences, Artificial Intelligence, Vol.24, pp. 7–83 (1984).CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media New York 1994

Authors and Affiliations

  • Giovanni Adorni
    • 1
  1. 1.Dipartimento di Ingegneria dell’InformazioneUniversità di ParmaParmaItaly

Personalised recommendations