Skip to main content

What’s Spatial About Spatial Data Mining: Three Case Studies

  • Chapter
Data Mining for Scientific and Engineering Applications

Part of the book series: Massive Computing ((MACO,volume 2))

Abstract

Spatial data mining is the process of discovering interesting and previously unknown, but potentially useful, patterns from large spatial datasets. Extracting interesting and useful patterns from spatial datasets is more difficult than extracting the corresponding patterns from traditional numeric and categorical data due to the complexity of spatial data types, spatial relationships, and spatial autocorrelation. A popular approach is to apply classical data mining techniques after transforming spatial components into non-spatial components via feature selection. An alternative is to explore new models, new objective functions, and new patterns which are more suitable for spatial data and their unique properties. This chapter investigates techniques in the literature to incorporate spatial components via feature selection, new models, new objective functions, and new patterns.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. M. Ankerst, M.M. Breunig, H.P. Kriegel, and J. Sander. Optics: Ordering points to identify the clustering structure. In Proceedings of the 1999 ACM SIGMOD International Conference on Management of Data, Philadelphia, Pennsylvania, USA, pages 49–60, 1999.

    Google Scholar 

  2. R. Agrawal. Tutorial on database mining. In Thirteenth ACM Symposium on Principles of Databases Systems, pages 75–76, Minneapolis, MN, 1994.

    Google Scholar 

  3. P.S. Albert and L.M. McShane. A Generalized Estimating Equations Approach for Spatially Correlated Binary Data: Applications to the Analysis of Neuroimaging Data. Biometrics (Publisher: Washington, Biometric Society, Etc.), 1:627–638, 1995.

    Article  MATH  Google Scholar 

  4. L Anselin. Spatial Econometrics: methods and models. Kluwer, Dordrecht, Netherlands, 1988.

    Book  MATH  Google Scholar 

  5. R. Agrawal and R. Srikant. Fast algorithms for Mining Association Rules. In Proc. of Very Large Databases, may 1994.

    Google Scholar 

  6. M.M. Breunig, H.R Kriegel, R. T. Ng, and J. Sander. Opticsof: Identifying local outliers. In Proc. of PKDD’99, Prague, Czech Republic, Lecture Notes in Computer Science (LNAI 1704), pp. 262–270, Springer Verlag, 1999.

    Google Scholar 

  7. V. Barnett and T. Lewis. Outliers in Statistical Data. John Wiley, New York, 3rd edition, 1994.

    MATH  Google Scholar 

  8. Y. Boykov, O. Veksler, and R. Zabih. Fast Approximate Energy Minimization via Graph Cuts. Proc. of International Conference on Computer Vision, September 1999.

    Book  Google Scholar 

  9. N.A. Cressie. Statistics for Spatial Data (Revised Edition). Wiley, New York, 1993.

    MATH  Google Scholar 

  10. M. Ester, H.-P. Kriegel, and J. Sander. Spatial Data Mining: A Database Approach. In Proc. Fifth Symposium on Rules in Geographic Information Databases, 1997.

    Google Scholar 

  11. G. Greenman. Turning a map into a cake layer of information. New York Times, http://www.nytimes.com/library/tech /00/01/circuits/articles/20giss.html, Feb 12 2000.

    Google Scholar 

  12. R.H. Guting. An Introduction to Spatial Database Systems. In Very Large Data Bases Journal(Publisher: Springer Verlag), October 1994.

    Google Scholar 

  13. R.J. Haining. Spatial Data Analysis in the Social and Environmental Sciences. In Cambridge University Press, Cambridge, U.K, 1989.

    Google Scholar 

  14. D. Hawkins. Identification of Outliers. Chapman and Hall, 1980.

    Book  MATH  Google Scholar 

  15. M. Hohn, L. Gribki, and A.E. Liebhold. A Geostatistical Model for Forecasting the Spatial Dynamics of Defoliation Caused by the Beypsy MothLymantria dispar (Lepidoptera: Lymantriidae). Environmental Entomology (Publisher: Entomological Society of America, 22:1066–1075, 1993.

    Article  Google Scholar 

  16. J. Hipp, U. Guntzer, and G. Nakaeizadeh. Algorithms for Association Rule Mining - A General Survey and Comparison. In Proc. A CM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2000.

    Google Scholar 

  17. Issaks, Edward, and M. Svivastava. Applied Geostatistics. In Oxford University Press, Oxford, 1989.

    Google Scholar 

  18. R. Johnson. Applied Multivariate Statistical Analysis. Prentice Hall, 1992.

    MATH  Google Scholar 

  19. K. Koperski, J. Adhikary, and J. Han. Knowledge Discovery in Spatial Databases: Progress and Challenges. In Proc. ACM SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery, Montreal, Canada. 55–70, 1996.

    Google Scholar 

  20. K. Koperski and J. Han. Discovery of Spatial Association Rules in Geographic Information Databases. In Proc. Fourth International Symposium on Large Spatial Databases, Maine. 47–66, 1995.

    Google Scholar 

  21. K. Koperski, J. Han, and N. Stefanovic. An Efficient Two-Step Method for Classification of Spatial Data. 1998.

    Google Scholar 

  22. E.M. Knorr and R.T. Ng. Extraction of Spatial Proximity Patterns by Concept Generalization. In Proc. Second International Conference on Knowledge Discovery and Data Mining, Portland, Oregon. AAAI Press. 347–350, 1996.

    Google Scholar 

  23. E. Knorr and R. Ng. A unified notion of outliers: Properties and computation. In Proc. of the International Conference on Knowledge Discovery and Data Mining, pages 219–222, 1997.

    Google Scholar 

  24. E. Knorr and R. Ng. Algorithms for mining distance-based outliers in large datasets. In Proc. 24th VLDB Conference, 1998.

    Google Scholar 

  25. E.M. Knorr, R.T. Ng, and D.L. Shilvock. Finding boundary shape matching relationships in spatial data. In Proc. 5th International Symposium, SSD 97. Springer-Verlag, Berlin, 29–46, 1997.

    Google Scholar 

  26. E.M. Knorr and Ng R.T. Finding aggregate proximity relationships and commonalities in spatial data mining. IEEE Trans. Knowl. and Data Eng. 8(6):884–897, 1996.

    Article  Google Scholar 

  27. P. Krugman. Development, Geography, and Economic theory. In MIT Press, Cambridge, MA, 1995.

    Google Scholar 

  28. Z. Li, J. Cihlar, L. Moreau, F. Huang, and B. Lee. Monitoring Fire Activities in the Boreal Ecosystem. Journal Geophys. Res,. 102(29):611–629, 1997.

    Google Scholar 

  29. J.P. LeSage. Bayesian estimation of spatial autoregressive models. International Regional Science Review, (20):113–129, 1997.

    Article  Google Scholar 

  30. D. Mark. Geographical Information Science: Critical Issues in an Emerging Cross-disciplinary Research Domain. In NSF Workshop, February 1999.

    Google Scholar 

  31. D.C. Nepstad, A. Verissimo, A. Alencar, C. Nobre, E. Lima, P. Lefebvre, P. Schlesinger, C. Potter, P. Moutinho, E. Mendoza, M. Cochrane, and V. Brooks. Large-scale Impoverishment of Amazonian Forests by Logging and Fire. Nature, 398:505–508, 1999.

    Article  Google Scholar 

  32. U. Ozesmi and W. Mitsch. A spatial habitat model for the Marsh-breeding red-winged black-bird(agelaius phoeniceus 1.) In coastal lake Erie wetlands. Ecological Modelling (Publisher: Elsevier Science B. V.), (101):139–152, 1997.

    Article  Google Scholar 

  33. S. Ozesmi and U. Ozesmi. An Artificial neural network approach to spatial habitat modeling with interspecific interaction. Ecological Modelling (Publisher: Elsevier Science B. V.), (116):15–31, 1999.

    Article  Google Scholar 

  34. F. Preparata and M. Shamos. Computatinal Geometry: An Introduction. Springer Verlag, 1988.

    Google Scholar 

  35. J.R. Quinlan. Induction of Decision Trees. Machine Learning, 1986.

    Google Scholar 

  36. I. Ruts and P. Rousseeuw. Computing depth contours of bivariate point clouds. In Computational Statistics and Data Analysis, 23:153–168, 1996.

    Google Scholar 

  37. S. Ramaswamy, R. Rastongi, and K. Shim. Efficient algorithms for mining outliers from large data sets. In Bell Laboratories, Murray Hill, NJ.

    Google Scholar 

  38. J.-F. Roddick and M. Spiliopoulou. A Bibliography of Temporal, Spatial and Spatio-Temporal Data Mining Research. SIGKDD Explorations 1(1): 34–38 (1999), 1999.

    Article  MATH  Google Scholar 

  39. S. Shekhar and S. Chawla. Spatial Databases: Issues, Implementation and Trends. Prentice Hall (under contract), 2001.

    Google Scholar 

  40. S. Shekhar, S. Chawla, S. Ravada, A. Fetterer, X. Liu, and C.-T. Lu. Spatial Databases - Accomplishments and Research Needs. Trans. on Knowledge and Data Engineering 11(1): 45–55 (1999), 1999.

    Article  Google Scholar 

  41. S. Shekhar and Y. Huang. Co-location Rules Mining: A Summary of Results. Proc. Spatio-temporal Symposium on Databases, 2001.

    Google Scholar 

  42. S. Shekhar, C.T. Lu, and P Zhang. Detecting Graph-based Spatial Outliers: Algorithms and Applications. In Department of Computer Science Techinical Report TR 01–014, University of Minnesota: http://tiberius.cs.umn.edu/tech-reports/listing/, 2001.

    Google Scholar 

  43. P. Stolorz, H. Nakamura, E. Mesrobian, R.R. Muntz, E.C. Shek, J.R. Santos, J. Yi, K. Ng, S.Y. Chien, R. Mechoso, and J.D. Farrara. Fast Spatio-Temporal Data Mining of Large Geophysical Datasets. In Proceedings of the First International Conference on Knowledge Discovery and Data Mining, AAAI Press, 300–305, 1995.

    Google Scholar 

  44. S. Shekhar, T.A. Yang, and P. Hancock. An Intelligent Vehicle Highway Information Management System. Intl Jr. on Microcomputers in Civil Engineering (Publisher: Blackwell Publishers), 8(3), 1993.

    Google Scholar 

  45. W.R. Tobler. Cellular Geography, Philosophy in Geography. Gale and Olsson, Eds., Dordrecht, Reidel, 1979.

    Google Scholar 

  46. M.F. Worboys. GIS A Computing Perspective. Taylor and Francis, 1995.

    Google Scholar 

  47. Y. Yasui and S.R. Lele. A Regression Method for Spatial Disease Rates: An Estimating Function Approach. Journal of the American Statistical Association, 94:21–32, 1997.

    Article  MATH  Google Scholar 

  48. D. Yu, G. Sheikholeslami, and A. Zhang. Findout: Finding outliers in very large datasets. In Departmerit of Computer Science and Engineering State University of New York at Buffalo, Technical report 99–03, http://www.cse. buffalo.edu/tech-reports/, 1999.

    Google Scholar 

Download references

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Shekhar, S., Huang, Y., Wu, W., Lu, C.T., Chawla, S. (2001). What’s Spatial About Spatial Data Mining: Three Case Studies. In: Grossman, R.L., Kamath, C., Kegelmeyer, P., Kumar, V., Namburu, R.R. (eds) Data Mining for Scientific and Engineering Applications. Massive Computing, vol 2. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-1733-7_26

Download citation

  • DOI: https://doi.org/10.1007/978-1-4615-1733-7_26

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4020-0114-7

  • Online ISBN: 978-1-4615-1733-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics