Abstract
Data represent results of the observation or measurement of phenomena. By means of data analysis, people can study these phenomena. Data analysis can be regarded as seeking answers to various questions regarding the phenomena. These questions, or, in other words, data analysis tasks, are the focus of our attention. In this chapter, we attempt to develop a general view of data, which will help us to understand what data analysis tasks are potentially possible.
We distinguish two types of components of data, referrers and attributes, which can also be called independent and dependent variables. A dataset can be viewed on an abstract level as a correspondence between references, i.e. values of the referrers, and characteristics, i.e. values of the attributes. Here are a few examples:
-
In a dataset containing daily prices of a stock on a stock market, the referrer is time and the attribute is the stock price. The moments of time (i.e. days) are references, and the price on each day is the characteristic corresponding to this reference.
-
In a dataset containing census data of a country, the set of enumeration districts is the referrer, and various counts (e.g. the total population or the numbers of females and males in the population) are the attributes. Each district is a reference, and the corresponding counts are its characteristics.
-
In a dataset containing marks received by schoolchildren in tests in various subjects (mathematics, physics, history, etc.), the set of pupils and the set of school subjects are the referrers, and the test result is the attribute. References in this case are pairs consisting of a pupil and a subject, and the respective mark is the characteristic of this reference.
As may be seen from the last example, a dataset may contain several referrers. The second example shows that a dataset may contain any number of attributes.
The examples demonstrate the three most important types of referrers:
-
time (e.g. days);
-
space (e.g. enumeration districts);
-
population (e.g. pupils or school subjects).
The term “population” is used in an abstract sense to mean a group of any items, irrespective of their nature.
We introduce a general view of a dataset structure as a function (in the mathematical sense) defining the correspondence between the references and the characteristics.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bertin, J.: Semiology of Graphics. Diagrams, Networks, Maps (University of Wisconsin Press, Madison 1983). Translated from Bertin, J.: Sémiologie graphique (Gauthier-Villars, Paris 1967)
Blok, C.: Monitoring change: characteristics of dynamic geo-spatial phenomena for visual exploration. In: Spatial Cognition II, ed. by Freksa, Ch., Brauer, W., Habel., C., Wender, K.F., Lecture Notes in Artificial Intelligence, Vol.1849 (Springer, Berlin, Heidelberg 2000), pp.16–30
Chrisman, N.R.: Exploring Geographic Information Systems (Wiley, New York 1997)
Gray, J., Chaudhuri, S., Bosworth, A., Layman, A., Reichart, D., Venkatrao, M., Pellow, F., Pirahesh, H.: Data cube: a relational aggregation operator generalizing group-by, cross-tab, and sub-totals. In: Readings in Database Systems, ed. by Stonebraker, M., Hellerstein, J.M., 3rd edn (Morgan Kaufmann, San Francisco 1998) pp.555–567
Jung, V.: Knowledge-based visualization design for geographic information systems. In: Proceedings of the 3rd ACM International Workshop on Advances in GIS, ed. by Bergougnoux, P., Makki, K., Pissinou, N., Baltimore 1995 (ACM Press, New York 1995) pp.101–108
Klir, G.J.: Architecture of Systems Problem Solving (Plenum, New York 1985)
Langran, G.: Time in Geographic Information Systems (Taylor & Francis, London 1992)
MacEachren, A.M.: How Maps Work: Representation, Visualization, and Design (Guilford, New York 1995)
Merriam-Webster’s Collegiate® Dictionary, 10th edn, (Merriam-Webster, Springfield, MA 1999)
Peuquet, D.J.: It’s about time: a conceptual framework for the representation of temporal dynamics in geographic information systems. Annals of the Association of American Geographers 84(3), 441–461 (1994)
Peuquet, D.J.: Making space for time: issues in space—time data representation. Geoinformatica 5(1), 11–32 (2001)
Peuquet, D.J.: Representations of Space and Time (Guilford, New York 2002)
Roth, S.M., Mattis, J.: Data characterization for intelligent graphics presentation. In: Proceedings SIGCHI’90: Human Factors in Computing Systems, ed. by Carrasco, J., Whiteside, J., Seattle, 1990 (ACM Press, New York 1990) pp.193–200
Slocum, T.A.: Thematic Cartography and Visualization (Prentice Hall, Upper Saddle River 1999)
Stevens, S.S.: On the theory of scales of measurement. Science 103, 677–680 (1946)
Stolte, C., Tang, D., Hanrahan, P.: Multiscale visualization using data cubes. In: Proceedings of the IEEE Symposium on Information Visualization 2002 InfoVis’02, ed. by Wong. P.C., Andrews, K., Boston, October 2002 (IEEE Computer Society, Piscataway 2002) pp.7–14
Verbyla, D.L.: Practical GIS Analysis (Taylor & Francis, London 2002)
Yuan, M., Albrecht, J.: Structural analysis of geographic information and GIS operations from a user’s perspective. In: Spatial Information Theory: a Theoretical Basis for GIS: International Conference COSIT’95, Proceedings, ed. by Frank, A.U., Kuhn, W., Lecture Notes in Computer Science, Vol.988 (Springer, Berlin, Heidelberg 1995) pp.107–122
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
(2006). Data. In: Exploratory Analysis of Spatial and Temporal Data. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-31190-4_2
Download citation
DOI: https://doi.org/10.1007/3-540-31190-4_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-25994-7
Online ISBN: 978-3-540-31190-4
eBook Packages: Computer ScienceComputer Science (R0)