Abstract
Spatial data mining is the process of discovering interesting and previously unknown, but potentially useful, patterns from large spatial datasets. Extracting interesting and useful patterns from spatial datasets is more difficult than extracting the corresponding patterns from traditional numeric and categorical data due to the complexity of spatial data types, spatial relationships, and spatial autocorrelation. A popular approach is to apply classical data mining techniques after transforming spatial components into non-spatial components via feature selection. An alternative is to explore new models, new objective functions, and new patterns which are more suitable for spatial data and their unique properties. This chapter investigates techniques in the literature to incorporate spatial components via feature selection, new models, new objective functions, and new patterns.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
M. Ankerst, M.M. Breunig, H.P. Kriegel, and J. Sander. Optics: Ordering points to identify the clustering structure. In Proceedings of the 1999 ACM SIGMOD International Conference on Management of Data, Philadelphia, Pennsylvania, USA, pages 49–60, 1999.
R. Agrawal. Tutorial on database mining. In Thirteenth ACM Symposium on Principles of Databases Systems, pages 75–76, Minneapolis, MN, 1994.
P.S. Albert and L.M. McShane. A Generalized Estimating Equations Approach for Spatially Correlated Binary Data: Applications to the Analysis of Neuroimaging Data. Biometrics (Publisher: Washington, Biometric Society, Etc.), 1:627–638, 1995.
L Anselin. Spatial Econometrics: methods and models. Kluwer, Dordrecht, Netherlands, 1988.
R. Agrawal and R. Srikant. Fast algorithms for Mining Association Rules. In Proc. of Very Large Databases, may 1994.
M.M. Breunig, H.R Kriegel, R. T. Ng, and J. Sander. Opticsof: Identifying local outliers. In Proc. of PKDD’99, Prague, Czech Republic, Lecture Notes in Computer Science (LNAI 1704), pp. 262–270, Springer Verlag, 1999.
V. Barnett and T. Lewis. Outliers in Statistical Data. John Wiley, New York, 3rd edition, 1994.
Y. Boykov, O. Veksler, and R. Zabih. Fast Approximate Energy Minimization via Graph Cuts. Proc. of International Conference on Computer Vision, September 1999.
N.A. Cressie. Statistics for Spatial Data (Revised Edition). Wiley, New York, 1993.
M. Ester, H.-P. Kriegel, and J. Sander. Spatial Data Mining: A Database Approach. In Proc. Fifth Symposium on Rules in Geographic Information Databases, 1997.
G. Greenman. Turning a map into a cake layer of information. New York Times, http://www.nytimes.com/library/tech /00/01/circuits/articles/20giss.html, Feb 12 2000.
R.H. Guting. An Introduction to Spatial Database Systems. In Very Large Data Bases Journal(Publisher: Springer Verlag), October 1994.
R.J. Haining. Spatial Data Analysis in the Social and Environmental Sciences. In Cambridge University Press, Cambridge, U.K, 1989.
D. Hawkins. Identification of Outliers. Chapman and Hall, 1980.
M. Hohn, L. Gribki, and A.E. Liebhold. A Geostatistical Model for Forecasting the Spatial Dynamics of Defoliation Caused by the Beypsy MothLymantria dispar (Lepidoptera: Lymantriidae). Environmental Entomology (Publisher: Entomological Society of America, 22:1066–1075, 1993.
J. Hipp, U. Guntzer, and G. Nakaeizadeh. Algorithms for Association Rule Mining - A General Survey and Comparison. In Proc. A CM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2000.
Issaks, Edward, and M. Svivastava. Applied Geostatistics. In Oxford University Press, Oxford, 1989.
R. Johnson. Applied Multivariate Statistical Analysis. Prentice Hall, 1992.
K. Koperski, J. Adhikary, and J. Han. Knowledge Discovery in Spatial Databases: Progress and Challenges. In Proc. ACM SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery, Montreal, Canada. 55–70, 1996.
K. Koperski and J. Han. Discovery of Spatial Association Rules in Geographic Information Databases. In Proc. Fourth International Symposium on Large Spatial Databases, Maine. 47–66, 1995.
K. Koperski, J. Han, and N. Stefanovic. An Efficient Two-Step Method for Classification of Spatial Data. 1998.
E.M. Knorr and R.T. Ng. Extraction of Spatial Proximity Patterns by Concept Generalization. In Proc. Second International Conference on Knowledge Discovery and Data Mining, Portland, Oregon. AAAI Press. 347–350, 1996.
E. Knorr and R. Ng. A unified notion of outliers: Properties and computation. In Proc. of the International Conference on Knowledge Discovery and Data Mining, pages 219–222, 1997.
E. Knorr and R. Ng. Algorithms for mining distance-based outliers in large datasets. In Proc. 24th VLDB Conference, 1998.
E.M. Knorr, R.T. Ng, and D.L. Shilvock. Finding boundary shape matching relationships in spatial data. In Proc. 5th International Symposium, SSD 97. Springer-Verlag, Berlin, 29–46, 1997.
E.M. Knorr and Ng R.T. Finding aggregate proximity relationships and commonalities in spatial data mining. IEEE Trans. Knowl. and Data Eng. 8(6):884–897, 1996.
P. Krugman. Development, Geography, and Economic theory. In MIT Press, Cambridge, MA, 1995.
Z. Li, J. Cihlar, L. Moreau, F. Huang, and B. Lee. Monitoring Fire Activities in the Boreal Ecosystem. Journal Geophys. Res,. 102(29):611–629, 1997.
J.P. LeSage. Bayesian estimation of spatial autoregressive models. International Regional Science Review, (20):113–129, 1997.
D. Mark. Geographical Information Science: Critical Issues in an Emerging Cross-disciplinary Research Domain. In NSF Workshop, February 1999.
D.C. Nepstad, A. Verissimo, A. Alencar, C. Nobre, E. Lima, P. Lefebvre, P. Schlesinger, C. Potter, P. Moutinho, E. Mendoza, M. Cochrane, and V. Brooks. Large-scale Impoverishment of Amazonian Forests by Logging and Fire. Nature, 398:505–508, 1999.
U. Ozesmi and W. Mitsch. A spatial habitat model for the Marsh-breeding red-winged black-bird(agelaius phoeniceus 1.) In coastal lake Erie wetlands. Ecological Modelling (Publisher: Elsevier Science B. V.), (101):139–152, 1997.
S. Ozesmi and U. Ozesmi. An Artificial neural network approach to spatial habitat modeling with interspecific interaction. Ecological Modelling (Publisher: Elsevier Science B. V.), (116):15–31, 1999.
F. Preparata and M. Shamos. Computatinal Geometry: An Introduction. Springer Verlag, 1988.
J.R. Quinlan. Induction of Decision Trees. Machine Learning, 1986.
I. Ruts and P. Rousseeuw. Computing depth contours of bivariate point clouds. In Computational Statistics and Data Analysis, 23:153–168, 1996.
S. Ramaswamy, R. Rastongi, and K. Shim. Efficient algorithms for mining outliers from large data sets. In Bell Laboratories, Murray Hill, NJ.
J.-F. Roddick and M. Spiliopoulou. A Bibliography of Temporal, Spatial and Spatio-Temporal Data Mining Research. SIGKDD Explorations 1(1): 34–38 (1999), 1999.
S. Shekhar and S. Chawla. Spatial Databases: Issues, Implementation and Trends. Prentice Hall (under contract), 2001.
S. Shekhar, S. Chawla, S. Ravada, A. Fetterer, X. Liu, and C.-T. Lu. Spatial Databases - Accomplishments and Research Needs. Trans. on Knowledge and Data Engineering 11(1): 45–55 (1999), 1999.
S. Shekhar and Y. Huang. Co-location Rules Mining: A Summary of Results. Proc. Spatio-temporal Symposium on Databases, 2001.
S. Shekhar, C.T. Lu, and P Zhang. Detecting Graph-based Spatial Outliers: Algorithms and Applications. In Department of Computer Science Techinical Report TR 01–014, University of Minnesota: http://tiberius.cs.umn.edu/tech-reports/listing/, 2001.
P. Stolorz, H. Nakamura, E. Mesrobian, R.R. Muntz, E.C. Shek, J.R. Santos, J. Yi, K. Ng, S.Y. Chien, R. Mechoso, and J.D. Farrara. Fast Spatio-Temporal Data Mining of Large Geophysical Datasets. In Proceedings of the First International Conference on Knowledge Discovery and Data Mining, AAAI Press, 300–305, 1995.
S. Shekhar, T.A. Yang, and P. Hancock. An Intelligent Vehicle Highway Information Management System. Intl Jr. on Microcomputers in Civil Engineering (Publisher: Blackwell Publishers), 8(3), 1993.
W.R. Tobler. Cellular Geography, Philosophy in Geography. Gale and Olsson, Eds., Dordrecht, Reidel, 1979.
M.F. Worboys. GIS A Computing Perspective. Taylor and Francis, 1995.
Y. Yasui and S.R. Lele. A Regression Method for Spatial Disease Rates: An Estimating Function Approach. Journal of the American Statistical Association, 94:21–32, 1997.
D. Yu, G. Sheikholeslami, and A. Zhang. Findout: Finding outliers in very large datasets. In Departmerit of Computer Science and Engineering State University of New York at Buffalo, Technical report 99–03, http://www.cse. buffalo.edu/tech-reports/, 1999.
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Shekhar, S., Huang, Y., Wu, W., Lu, C.T., Chawla, S. (2001). What’s Spatial About Spatial Data Mining: Three Case Studies. In: Grossman, R.L., Kamath, C., Kegelmeyer, P., Kumar, V., Namburu, R.R. (eds) Data Mining for Scientific and Engineering Applications. Massive Computing, vol 2. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-1733-7_26
Download citation
DOI: https://doi.org/10.1007/978-1-4615-1733-7_26
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4020-0114-7
Online ISBN: 978-1-4615-1733-7
eBook Packages: Springer Book Archive