Skip to main content
  • 3026 Accesses

Abstract

In this chapter, we use the metaphor of a mathematical function to identify the types of tasks (questions) involved in exploratory data analysis. A task is viewed as consisting of two parts: the target, i.e. what information needs to be obtained, and the constraints, i.e. what conditions this information needs to fulfil. The target and constraints can be understood as unknown and known (specified) information, respectively; the goal is to find the initially unknown information corresponding to the specified information.

Our task typology has its origin in the ideas expressed by Jacques Bertin in his Semiology of Graphics (Bertin 1967/1983). Like Bertin, we distinguish tasks according to the level of data analysis (“reading level”, in Bertin’s terms) but additionally take into account the division of data components into referrers and attributes:

  • Elementary tasks refer to individual elements of the reference set; for example, “what is the proportion of children in the enumeration district X?”

  • Synoptic tasks involve the whole reference set or its subsets; for example, “describe the variation of the proportions of children over the whole country” (or “over the southern part of the country”).

The tasks are further divided according to the target (“question type”, in Bertin’s terms), i.e. what is the unknown information that needs to be found. At the elementary level, the target may be one or more characteristics (attribute values) or one or more references (referrer values). For example:

  • What is the proportion of children in the enumeration district X? (That is, find the characteristic corresponding to the reference “district X”.)

  • In what enumeration districts are the proportions of children 20% or more? (That is, find the references corresponding to the characteristics “20% and more”.)

It is important that, when a task involves several references, each of them is dealt with individually.

We have extended Bertin’s ideas by explicitly considering the possible relations between references and between characteristics. Relations may also appear in a task target or may be used in task constraints. For example:

  • Compare the proportions of children in district X and district Y (i.e. find the relation between the characteristics corresponding to the references “district X” and “district Y”).

  • Find districts where the proportion of children is higher than in district X (i.e. find references such that the corresponding characteristics are linked by the relation “higher than” to the characteristic of “district X”).

At the synoptic level of analysis, we introduce the notion of a “behaviour” — the set of all characteristics corresponding to a given reference (sub)set, considered in its entirety and its particular organisation with respect to the reference sub(set). The behaviour is a generalisation of such notions as distributions, variations, and trends; for example, the variation of the proportions of children over the whole country or the trend in a stock price over a week.

Synoptic tasks involve reference (sub)sets, behaviours, and relations between (sub)sets or between behaviours. Here are a few examples:

  • Describe the variation of the proportion of children over the whole country (target: the behaviour of the proportion of children; constraint: the whole country as the reference set).

  • Find spatial clusters of districts with a high proportion of children (target: the reference subset(s); constraint: the behaviour specified as “spatial cluster of high values”).

  • Compare the distributions of the proportion of children in the north and in the south of the country (target: two behaviours and the relation between them; constraint: two reference subsets, the north and the south of the country).

Elementary tasks play a marginal role in exploratory data analysis, as compared with synoptic tasks. Among synoptic tasks, the most challenging are tasks of finding significant connections between phenomena, such as cause-effect relations or structural links, and of finding the principles of the internal organisation, functioning, and development of a single phenomenon. We call such tasks “connectional”.

The main purpose of our task typology is to evaluate the existing tools and techniques for EDA in terms of their suitability for different tasks and to try to derive operational general principles for tool selection and tool design.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Albrecht, J., Jung, S., Mann, S.: VGIS: a GIS shell for the conceptual design of environmental models. In: Innovations in GIS 4: Selected Papers from the 4th National Conference on GIS Research UK, GISRUK, ed. by Kemp, Z. (Taylor & Francis, London 1997) pp. 154–165

    Google Scholar 

  2. Andrienko, N., Andrienko, G., Gatalsky, P.: Exploratory spatio-temporal visualization: an analytical review. Journal of Visual Languages and Computing 14(6), 503–541 (2003)

    Google Scholar 

  3. Andrienko, G., Andrienko, N., Dykes, J., Gahegan, M., Mountain, D., Noy, P., Roberts, J., Rodgers, P. Theus, M.: Creating instruments for ideation: software approaches to geovisualization. In: Exploring Geovisualization, ed. by Dykes, J., MacEachren, A., Kraak, M.-J. (Elsevier, Oxford 2005) pp. 103–125

    Google Scholar 

  4. Arnheim, R.: Visual Thinking (University of California Press, Berkeley 1969, renewed 1997)

    Google Scholar 

  5. Bertin, J.: Semiology of Graphics. Diagrams, Networks, Maps (University of Wisconsin Press, Madison 1983). Translated from Bertin, J.: Sémiologie graphique (Gauthier-Villars, Paris 1967)

    Google Scholar 

  6. Blok, C.: Monitoring change: characteristics of dynamic geo-spatial phenomena for visual exploration. In: Spatial Cognition II, ed. by Freksa, C., Brauer, W., Habel, C., Wender, K.F., Lecture Notes in Artificial Intelligence, Vol. 1849 (Springer, Berlin, Heidelberg 2000) pp. 16–30

    Google Scholar 

  7. Casner, S.M.: A task-analytic approach to the automated design of graphic presentations. ACM Transactions on Graphics 10, 111–151 (1991)

    Article  Google Scholar 

  8. Perception (psychology). In: Microsoft® Encarta® Online Encyclopedia, http://encarta.msn.com (Microsoft, Redmond, WA 2004). Accessed 28 Mar 2005

    Google Scholar 

  9. Fayyad, U.M., Piatetsky-Shapiro, G., Smyth., P.: From data mining to knowledge discovery: an overview. In: Advances in Knowledge Discovery and Data Mining, ed. by Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R. (AAAI Press/MIT Press, Menlo Park 1996) pp. 1–36

    Google Scholar 

  10. Gahegan, M., O’Brien, D.: A strategy and architecture for the visualisation of complex geographical datasets. International Journal of Pattern Recognition and Artificial Intelligence, 11(2), 239–261 (1997)

    Article  Google Scholar 

  11. Heylighen, F.: Occam’s razor. In: Principia Cybernetica Web, ed. by Heylighen, F., Joslyn, C., Turchin, V. (Principia Cybernetica, Brussels 1997), http://pespmc1.vub.ac.be/REFERPCP.html. Accessed 28 Mar 2005

    Google Scholar 

  12. Jung, V.: Knowledge-based visualization design for geographic information systems. In Proceedings of the 3rd ACM International Workshop on Advances in GIS, Baltimore, 1995, ed. by Bergougnoux, P., Makki, K., Pissinou, N. (ACM Press, New York 1995) pp. 101–108

    Google Scholar 

  13. Klir, G.J.: Architecture of Systems Problem Solving (Plenum, New York 1985)

    Google Scholar 

  14. Knapp, L.: A task analysis approach to the visualization of geographic data. In: Cognitive Aspects of Human—Computer Interaction for Geographic Information Systems, ed. by Nyerges, T.L., Mark, D.M., Laurini, R., Egenhofer, M.J. (Kluwer, Dordrecht 1995) pp. 355–371

    Google Scholar 

  15. Koussoulakou, A., Kraak, M.J.: Spatio-temporal maps and cartographic communication. The Cartographic Journal 29, 101–108 (1992)

    Google Scholar 

  16. Kraak, M.-J., Edsall, R., MacEachren, A.M.: Cartographic animation and legends for temporal maps: exploration and/or interaction. In Proceedings of the 18th International Cartographic Conference, Vol. 1 (1997) pp. 253–261

    Google Scholar 

  17. Miller, H.J., Han, J.: Geographic data mining and knowledge discovery: an overview. In: Geographic Data Mining and Knowledge Discovery, ed. by Miller, H.J., Han, J. (Taylor & Francis, London 2001) pp. 3–32

    Google Scholar 

  18. MacEachren, A.M.: How Maps Work: Representation, Visualization, and Design (Guilford, New York 1995)

    Google Scholar 

  19. Qian, L., Wachowicz, M., Peuquet, D., MacEachren, A.: Delineating operations for visualization and analysis of space—time data in GIS. In: Proceedings of the GIS/LIS’ 97, Cincinnati, (1997) pp. 872–877

    Google Scholar 

  20. Peuquet, D.J.: It’s about time: a conceptual framework for the representation of temporal dynamics in geographic information systems. Annals of the Association of American Geographers 84(3), 441–461 (1994)

    Article  Google Scholar 

  21. Random House Webster’s Unabridged Electronic Dictionary (Random House, Broadway, NY 1996)

    Google Scholar 

  22. Robertson, P.K.: A methodology for choosing data representations. IEEE Computer Graphics and Applications 11(3), 56–67 (1991)

    Article  Google Scholar 

  23. Roth, S.M., Mattis, J.: Data characterization for intelligent graphics presentation. In: Proceedings of SIGCHI’90: Human Factors in Computing Systems, Seattle, 1990 (ACM Press, New York 1990) pp. 193–200

    Google Scholar 

  24. Shneiderman, B.: The eyes have it: a task by data type taxonomy for information visualizations. In: Proceedings of the 1996 IEEE Symposium on Visual Languages (IEEE Computer Society Press, Piscataway 1996) pp. 336–343

    Google Scholar 

  25. Tukey, J.W.: Exploratory Data Analysis (Addison-Wesley, Reading, MA 1977)

    Google Scholar 

  26. Wehrend, S., Lewis, C.: A problem-oriented classification of visualization techniques. In: Proceedings of the First IEEE Conference on Visualization: Visualization 90 (IEEE, Los Alamitos 1990) pp. 139–146

    Google Scholar 

  27. Yuan, M., Albrecht, J.: Structural analysis of geographic information and GIS operations from a user’s perspective. In: Spatial Information Theory: a Theoretical Basis for GIS: International Conference COSIT’95, Proceedings, ed. by Frank, A.U., Kuhn, W., Lecture Notes in Computer Science, Vol. 988, (Springer, Berlin, Heidelberg 1995) pp. 107–122

    Google Scholar 

  28. Zhou, M., Feiner, S.: Visual task characterization for automated visual discourse synthesis. In: Proceedings of the SIGCHI conference on Human factors in computing systems (ACM Press, New York 1998) pp. 392–399

    Google Scholar 

Download references

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

(2006). Tasks. In: Exploratory Analysis of Spatial and Temporal Data. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-31190-4_3

Download citation

  • DOI: https://doi.org/10.1007/3-540-31190-4_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-25994-7

  • Online ISBN: 978-3-540-31190-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics