Skip to main content

Data Quality in Massive Data Sets

  • Chapter
Handbook of Massive Data Sets

Part of the book series: Massive Computing ((MACO,volume 4))

Abstract

All data contain errors, and large spatial data sets are especially prone because they contain data from multiple sources, and use different assumptions about structure and semantics. The general issue is one of data quality assurance, defined in terms of lineage, completeness, logical consistency, attribute accuracy, and positional accuracy. We review a series of quality metrics suitable for empirical description of data quality, and consider some of the special issues of quality related to spatial data, especially the need to incorporate visualizations of data quality into graphics and maps. We conclude that data quality is an essential component of software for spatial data handling, including geographic information systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 629.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 799.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 799.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Bibliography

  • M.K. Beard, B.P. Buttenfield, and S.B. Clapham. NCGIA Research Initiative 7: Visualization of spatial data quality. Technical Report 9126, National Center for Geographic Information and Analysis, 1991.

    Google Scholar 

  • P.A. Burrough and A.U. Frank. Geographic objects with indeterminate boundaries. Taylor and Francis, 1996.

    Google Scholar 

  • N.R. Chrisman. Exploring geographic information systems. Wiley, 1997.

    Google Scholar 

  • N.R. Chrisman and B. Yandell. Effects of point error on area calculations. Surveying and Mapping, 48: 241–246, 1989.

    Google Scholar 

  • K. Clarke, P.D. Teague, and H.G. Smith. Virtual depth-based representation of cartographic uncertainty. In W. Shi, M.F. Goodchild, and P.F. Fisher, editors, Proceedings of the International Symposium on Spatial Data Quality ‘89, pages 253–259, 1999.

    Google Scholar 

  • S.C. Guptill and J.L. Morrison. Elements of spatial data quality. Elsevier, 1995.

    Google Scholar 

  • K.C. Clarke and P.D. Teague. Cartographic symbolization of uncertainty. In Proceedings, ACSM Annual Conference, 1998. CD-ROM.

    Google Scholar 

  • T.J. Davis and C.P. Keller. Modelling and visualizing multiple spatial uncertainties. Computers and Geosciences, 23: 397–408, 1997.

    Article  Google Scholar 

  • C.R. Ehlschlaeger, A.M. Shortridge, and M.F. Goodchild. Visualizing spatial data uncertainty using animation. Computers and Geosciences, 23: 387–395, 1997.

    Article  Google Scholar 

  • P.F. Fisher. Visualizing uncertainty in soil maps by animation. Cartographica, 30: 20–27, 1993.

    Article  Google Scholar 

  • A.R. Gillespie. Spectral mixture analysis of multispectral thermal infrared images. Remote Sensing of Environment, 42: 137–145, 1992.

    Article  Google Scholar 

  • M.F. Goodchild, A.M. Shortridge, and P. Fohl. Encapsulating simulation models with geospatial data sets. In K. Lowell and A. Jaton, editors, Spatial accurary assessment: Land information uncertainty in natural resources, pages 131–138. Ann Arbor Press, 1999.

    Google Scholar 

  • G.B.M. Heuvelink. Error propagation in environmental modelling with GIS. Taylor and Francis, 1998.

    Google Scholar 

  • G.B.M. Heuvelink, P.A. Burrough, and A. Stein. Propagation of errors in spatial modelling with GIS. International Journal of Geographical Information Systems, 3: 303–322, 1989.

    Article  Google Scholar 

  • G.J. Hunter and M.F. Goodchild. Modeling the uncertainty of slope and aspect estimates obtained from spatial databases. Geographical Analysis, 29: 35–47, 1997.

    Article  Google Scholar 

  • A.G. Journel. Modelling uncertainty and spatial dependence: Stochastic imaging. International Journal of Geographical Information Systems, 10: 517–522, 1996.

    Article  Google Scholar 

  • A.M. MacEachren. Visualizing uncertain information. Cartographic Perspectives, 13: 10–19, 1992.

    Article  Google Scholar 

  • M. McGranaghan. A cartographic view of spatial data quality. Cartographica, 30: 8–19, 1993.

    Article  Google Scholar 

  • J.R. Taylor. An introduction to error analysis: The study of uncertainties in physical measurements. University Science Books, 1982.

    Google Scholar 

  • C.M. Wittenbrink, A.T. Pang, and S. Lodha. Glyphs for visualizing uncertainty in vector fields. IEEE Transactions on Visualization and Computer Graphics, 2: 266–279, 1996.

    Article  Google Scholar 

  • A.X. Zhu, L.E. Band, B. Dutton, and T.J. Nimlos. Automated soil inference under fuzzy logic. Ecological Modelling, 90: 123–145, 1996.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Goodchild, M.F., Clarke, K.C. (2002). Data Quality in Massive Data Sets. In: Abello, J., Pardalos, P.M., Resende, M.G.C. (eds) Handbook of Massive Data Sets. Massive Computing, vol 4. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-0005-6_18

Download citation

  • DOI: https://doi.org/10.1007/978-1-4615-0005-6_18

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4613-4882-5

  • Online ISBN: 978-1-4615-0005-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics