Perceptual Sketch Interpretation

  • Markus Wuersch
  • Max J. Egenhofer
Part of the Lecture Notes in Geoinformation and Cartography book series (LNGC)


An automated extraction of regions from sketches can be of great value for multi-modal user interfaces and for interpreting spatial data. This paper develops the Perceptual Sketch Interpretation algorithm, which employs the theory of topological relations from spatial reasoning as well as good continuity from gestalt theory in order to model people’s perception. The Perceptual Sketch Interpretation algorithm extracts regions iteratively, removing one region at each a time, thus making the remaining sketch simpler and easier to interpret. The evaluation of the algorithm shows that the use of gestalt theory empowers the algorithm to correctly identify regions and saves processing time over other approaches.


Perceptual Organization Topological Relation Spatial Reasoning Spatial Query Good Continuity 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. Bennamoun M, Mamic G (2002). Object recognition. Springer-Verlag, LondonGoogle Scholar
  2. Blake A, Isard M (1998). Active contours. Springer-Verlag, LondonGoogle Scholar
  3. Blaser A (2000). Sketching spatial queries. Ph.D. Thesis, University of Maine, Orono, MEGoogle Scholar
  4. Blaser A, Egenhofer M (2000). A visual tool for querying geographic databases. In Di Gesù V, Levialdi S, Tarantini L (eds), AVI 2000—Advanced visual databases, Salerno, Italy, pp 211-216Google Scholar
  5. Chrisman N (2001) Exploring geographic information systems. John Wiley, New YorkGoogle Scholar
  6. Cohen P, Johnston M, McGee D, Oviatt S, Pittman J, Smith I, Chen L, Clow J (1997). Quickset: multimodal interaction for distributed applications. Proceedings of the fifth ACM international multimedia conference, pp 31-40Google Scholar
  7. Egenhofer M (1993). A model for detailed binary topological relationships, Geomatica, 47(3&4), 261-273Google Scholar
  8. Egenhofer M (1993). Definitions of line-line relations for geographic databases. IEEE data engineering bulletin 16(3), 40-45Google Scholar
  9. Egenhofer M (1996). Spatial-Query-by Sketch. In Burnett M and Citrin W (eds) VL ‘96: IEEE symposium on visual languages, Boulder, CO, 60-67Google Scholar
  10. Egenhofer M (1997). Query processing in Spatial-Query-by-Sketch. Journal of visual languages and computing 8(4): 403-424CrossRefGoogle Scholar
  11. Egenhofer M, Herring J (1991). Categorizing binary topological relationships between regions, lines, and points in geographic databases. Technical Report, Department of Surveying Engineering, University of Maine, Orono, ME, (∼ max/9intreport.pdf)Google Scholar
  12. Egenhofer M, Mark D (1995). Naive geography, In: Frank A, Kuhn W (eds), COSIT ‘95, Spatial information theory. Lecture Notes in Computer Science vol 988, pp 1-16Google Scholar
  13. Egenhofer M, Shariff AR (1998). Metric details for natural-language spatial relations, ACM transactions on information systems 16(4): 295-321CrossRefGoogle Scholar
  14. Koffka, K (1935). Principles of gestalt psychology, Harcourt, Brace and Company, New YorkGoogle Scholar
  15. Kuipers B (1979) Modeling spatial knowledge. Cognitive science 2(2): 129-153CrossRefGoogle Scholar
  16. Lowe D (1990) Visual recognition as probabilistic inference from spatial relations, In Blake A, Troscianko T (eds), AI and eye, John Wiley, New YorkGoogle Scholar
  17. Mohan R, Nevatia R (1989). Using perceptual organization to extract 3-d structures, IEEE transactions on pattern analysis and machine intelligence 11(11): 1121-1139CrossRefGoogle Scholar
  18. Mohan R, Nevatia R (1992) Perceptual organization for scene segmentation and description, IEEE transactions on pattern analysis and machine intelligence 14(6): 616-635CrossRefGoogle Scholar
  19. Oviatt S, DeAngeli A, Kuhn K (1997) Integration and synchronization of input modes during multimodal human-computer interaction. Proceedings of the conference on human factors in computing systems (CHI ‘97), pp 415-422Google Scholar
  20. Park S-H, Gero J (1999) Qualitative representation and reasoning about shapes, In Gero J, Tversky B (eds.), Visual and spatial reasoning in design, Key Centre of Design Computing and Cognition, University of Sydney, Sydney, Australia, pp 55-68Google Scholar
  21. Sarkar S, Boyer K (1993) Integration, inference, and management of spatial information using Bayesian networks: perceptual organization. IEEE transactions on pattern analysis and machine intelligence 15(3): 256-274CrossRefGoogle Scholar
  22. Saund E (2003) Finding perceptually closed paths in sketches and drawings. IEEE transactions on pattern analysis and machine intelligence 25(4): 475-491CrossRefGoogle Scholar
  23. Saund E, Moran T (1995) Perceptual organization in an interactive sketch editing application. International conference on computer vision (ICCV ‘95), IEEE Computer Society Press, pp 597-604Google Scholar
  24. Shariff AR, Egenhofer M, Mark D (1998) Natural-language spatial relations between linear and areal objects: the topology and metric of English-language terms, International journal of geographical information science 12(3): 215-246Google Scholar
  25. Vieu L (1997) Spatial representation and reasoning in artificial intelligence, in Stock, O. (ed.) Spatial and Temporal Reasoning, Kluwer, Dordrecht, pp 5-41CrossRefGoogle Scholar
  26. Waranusast R (2007) Perceptual-based region extraction from hand drawn sketches. Proceedings of the third IASTED international conference, advances in computer science and technology, Phuket, Thailand, pp 222-227Google Scholar
  27. Wertheimer M (1923) Laws of organization in perceptual forms, In Ellis W (ed.), A source book of gestalt psychology, Routledge & Kegan Paul, London, pp 71-88Google Scholar
  28. Wuersch M (2003) Perceptual sketch interpretation. M.S. thesis, University of MaineGoogle Scholar
  29. Zabrodsky H, Algom D (1994) Continuous symmetry: a model for human figural perception. Spatial vision, 8(4): 455-467CrossRefGoogle Scholar
  30. Zhu S-C (1999) Embedding gestalt laws in Markov random fields—a theory for shape modeling and perceptual organization. IEEE transactions on pattern analysis and machine intelligence 21(11): 1170-1187CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2008

Authors and Affiliations

  • Markus Wuersch
    • 1
  • Max J. Egenhofer
    • 2
  1. 1.uLocate Communications IncBostonUSA
  2. 2.National Center for Geographic Information and Analysis Department of Spatial Information Science and Engineering Department of Computer ScienceUniversity of MaineOronoUSA

Personalised recommendations