Data

doi:10.1007/3-540-31190-4_2

2990 Accesses

Abstract

Data represent results of the observation or measurement of phenomena. By means of data analysis, people can study these phenomena. Data analysis can be regarded as seeking answers to various questions regarding the phenomena. These questions, or, in other words, data analysis tasks, are the focus of our attention. In this chapter, we attempt to develop a general view of data, which will help us to understand what data analysis tasks are potentially possible.

We distinguish two types of components of data, referrers and attributes, which can also be called independent and dependent variables. A dataset can be viewed on an abstract level as a correspondence between references, i.e. values of the referrers, and characteristics, i.e. values of the attributes. Here are a few examples:

In a dataset containing daily prices of a stock on a stock market, the referrer is time and the attribute is the stock price. The moments of time (i.e. days) are references, and the price on each day is the characteristic corresponding to this reference.
In a dataset containing census data of a country, the set of enumeration districts is the referrer, and various counts (e.g. the total population or the numbers of females and males in the population) are the attributes. Each district is a reference, and the corresponding counts are its characteristics.
In a dataset containing marks received by schoolchildren in tests in various subjects (mathematics, physics, history, etc.), the set of pupils and the set of school subjects are the referrers, and the test result is the attribute. References in this case are pairs consisting of a pupil and a subject, and the respective mark is the characteristic of this reference.

As may be seen from the last example, a dataset may contain several referrers. The second example shows that a dataset may contain any number of attributes.

The examples demonstrate the three most important types of referrers:

time (e.g. days);
space (e.g. enumeration districts);
population (e.g. pupils or school subjects).

The term “population” is used in an abstract sense to mean a group of any items, irrespective of their nature.

We introduce a general view of a dataset structure as a function (in the mathematical sense) defining the correspondence between the references and the characteristics.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bertin, J.: Semiology of Graphics. Diagrams, Networks, Maps (University of Wisconsin Press, Madison 1983). Translated from Bertin, J.: Sémiologie graphique (Gauthier-Villars, Paris 1967)
Google Scholar
Blok, C.: Monitoring change: characteristics of dynamic geo-spatial phenomena for visual exploration. In: Spatial Cognition II, ed. by Freksa, Ch., Brauer, W., Habel., C., Wender, K.F., Lecture Notes in Artificial Intelligence, Vol.1849 (Springer, Berlin, Heidelberg 2000), pp.16–30
Google Scholar
Chrisman, N.R.: Exploring Geographic Information Systems (Wiley, New York 1997)
Google Scholar
Gray, J., Chaudhuri, S., Bosworth, A., Layman, A., Reichart, D., Venkatrao, M., Pellow, F., Pirahesh, H.: Data cube: a relational aggregation operator generalizing group-by, cross-tab, and sub-totals. In: Readings in Database Systems, ed. by Stonebraker, M., Hellerstein, J.M., 3rd edn (Morgan Kaufmann, San Francisco 1998) pp.555–567
Google Scholar
Jung, V.: Knowledge-based visualization design for geographic information systems. In: Proceedings of the 3rd ACM International Workshop on Advances in GIS, ed. by Bergougnoux, P., Makki, K., Pissinou, N., Baltimore 1995 (ACM Press, New York 1995) pp.101–108
Google Scholar
Klir, G.J.: Architecture of Systems Problem Solving (Plenum, New York 1985)
Google Scholar
Langran, G.: Time in Geographic Information Systems (Taylor & Francis, London 1992)
Google Scholar
MacEachren, A.M.: How Maps Work: Representation, Visualization, and Design (Guilford, New York 1995)
Google Scholar
Merriam-Webster’s Collegiate® Dictionary, 10th edn, (Merriam-Webster, Springfield, MA 1999)
Google Scholar
Peuquet, D.J.: It’s about time: a conceptual framework for the representation of temporal dynamics in geographic information systems. Annals of the Association of American Geographers 84(3), 441–461 (1994)
Article Google Scholar
Peuquet, D.J.: Making space for time: issues in space—time data representation. Geoinformatica 5(1), 11–32 (2001)
Article MATH Google Scholar
Peuquet, D.J.: Representations of Space and Time (Guilford, New York 2002)
Google Scholar
Roth, S.M., Mattis, J.: Data characterization for intelligent graphics presentation. In: Proceedings SIGCHI’90: Human Factors in Computing Systems, ed. by Carrasco, J., Whiteside, J., Seattle, 1990 (ACM Press, New York 1990) pp.193–200
Google Scholar
Slocum, T.A.: Thematic Cartography and Visualization (Prentice Hall, Upper Saddle River 1999)
Google Scholar
Stevens, S.S.: On the theory of scales of measurement. Science 103, 677–680 (1946)
Google Scholar
Stolte, C., Tang, D., Hanrahan, P.: Multiscale visualization using data cubes. In: Proceedings of the IEEE Symposium on Information Visualization 2002 InfoVis’02, ed. by Wong. P.C., Andrews, K., Boston, October 2002 (IEEE Computer Society, Piscataway 2002) pp.7–14
Google Scholar
Verbyla, D.L.: Practical GIS Analysis (Taylor & Francis, London 2002)
Google Scholar
Yuan, M., Albrecht, J.: Structural analysis of geographic information and GIS operations from a user’s perspective. In: Spatial Information Theory: a Theoretical Basis for GIS: International Conference COSIT’95, Proceedings, ed. by Frank, A.U., Kuhn, W., Lecture Notes in Computer Science, Vol.988 (Springer, Berlin, Heidelberg 1995) pp.107–122
Google Scholar

Download references

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

(2006). Data. In: Exploratory Analysis of Spatial and Temporal Data. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-31190-4_2

Download citation

DOI: https://doi.org/10.1007/3-540-31190-4_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25994-7
Online ISBN: 978-3-540-31190-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics