Abstract
All data contain errors, and large spatial data sets are especially prone because they contain data from multiple sources, and use different assumptions about structure and semantics. The general issue is one of data quality assurance, defined in terms of lineage, completeness, logical consistency, attribute accuracy, and positional accuracy. We review a series of quality metrics suitable for empirical description of data quality, and consider some of the special issues of quality related to spatial data, especially the need to incorporate visualizations of data quality into graphics and maps. We conclude that data quality is an essential component of software for spatial data handling, including geographic information systems.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Bibliography
M.K. Beard, B.P. Buttenfield, and S.B. Clapham. NCGIA Research Initiative 7: Visualization of spatial data quality. Technical Report 9126, National Center for Geographic Information and Analysis, 1991.
P.A. Burrough and A.U. Frank. Geographic objects with indeterminate boundaries. Taylor and Francis, 1996.
N.R. Chrisman. Exploring geographic information systems. Wiley, 1997.
N.R. Chrisman and B. Yandell. Effects of point error on area calculations. Surveying and Mapping, 48: 241–246, 1989.
K. Clarke, P.D. Teague, and H.G. Smith. Virtual depth-based representation of cartographic uncertainty. In W. Shi, M.F. Goodchild, and P.F. Fisher, editors, Proceedings of the International Symposium on Spatial Data Quality ‘89, pages 253–259, 1999.
S.C. Guptill and J.L. Morrison. Elements of spatial data quality. Elsevier, 1995.
K.C. Clarke and P.D. Teague. Cartographic symbolization of uncertainty. In Proceedings, ACSM Annual Conference, 1998. CD-ROM.
T.J. Davis and C.P. Keller. Modelling and visualizing multiple spatial uncertainties. Computers and Geosciences, 23: 397–408, 1997.
C.R. Ehlschlaeger, A.M. Shortridge, and M.F. Goodchild. Visualizing spatial data uncertainty using animation. Computers and Geosciences, 23: 387–395, 1997.
P.F. Fisher. Visualizing uncertainty in soil maps by animation. Cartographica, 30: 20–27, 1993.
A.R. Gillespie. Spectral mixture analysis of multispectral thermal infrared images. Remote Sensing of Environment, 42: 137–145, 1992.
M.F. Goodchild, A.M. Shortridge, and P. Fohl. Encapsulating simulation models with geospatial data sets. In K. Lowell and A. Jaton, editors, Spatial accurary assessment: Land information uncertainty in natural resources, pages 131–138. Ann Arbor Press, 1999.
G.B.M. Heuvelink. Error propagation in environmental modelling with GIS. Taylor and Francis, 1998.
G.B.M. Heuvelink, P.A. Burrough, and A. Stein. Propagation of errors in spatial modelling with GIS. International Journal of Geographical Information Systems, 3: 303–322, 1989.
G.J. Hunter and M.F. Goodchild. Modeling the uncertainty of slope and aspect estimates obtained from spatial databases. Geographical Analysis, 29: 35–47, 1997.
A.G. Journel. Modelling uncertainty and spatial dependence: Stochastic imaging. International Journal of Geographical Information Systems, 10: 517–522, 1996.
A.M. MacEachren. Visualizing uncertain information. Cartographic Perspectives, 13: 10–19, 1992.
M. McGranaghan. A cartographic view of spatial data quality. Cartographica, 30: 8–19, 1993.
J.R. Taylor. An introduction to error analysis: The study of uncertainties in physical measurements. University Science Books, 1982.
C.M. Wittenbrink, A.T. Pang, and S. Lodha. Glyphs for visualizing uncertainty in vector fields. IEEE Transactions on Visualization and Computer Graphics, 2: 266–279, 1996.
A.X. Zhu, L.E. Band, B. Dutton, and T.J. Nimlos. Automated soil inference under fuzzy logic. Ecological Modelling, 90: 123–145, 1996.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Goodchild, M.F., Clarke, K.C. (2002). Data Quality in Massive Data Sets. In: Abello, J., Pardalos, P.M., Resende, M.G.C. (eds) Handbook of Massive Data Sets. Massive Computing, vol 4. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-0005-6_18
Download citation
DOI: https://doi.org/10.1007/978-1-4615-0005-6_18
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-4882-5
Online ISBN: 978-1-4615-0005-6
eBook Packages: Springer Book Archive