Skip to main content

3D Shape Histograms for Similarity Search and Classification in Spatial Databases

  • Conference paper
  • First Online:
Advances in Spatial Databases (SSD 1999)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1651))

Included in the following conference series:

Abstract

Classification is one of the basic tasks of data mining in modern database applications including molecular biology, astronomy, mechanical engineering, medical imaging or meteorology. The underlying models have to consider spatial properties such as shape or extension as well as thematic attributes. We introduce 3D shape histograms as an intuitive and powerful similarity model for 3D objects. Particular flexibility is provided by using quadratic form distance functions in order to account for errors of measurement, sampling, and numerical rounding that all may result in small displacements and rotations of shapes. For query processing, a general filter-refinement architecture is employed that efficiently supports similarity search based on quadratic forms. An experimental evaluation in the context of molecular biology demonstrates both, the high classification accuracy of more than 90% and the good performance of the approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ankerst M., Braunmüller B., Kriegel H.-P., Seidl T.: Improving Adaptable Similarity Query Processing by Using Approximatios. Proc. 24th Int. Conf. on Very Large Databases (VLDB’98), New York, USA. Morgan Kaufmann (1998) 206–217

    Google Scholar 

  2. Ankerst M., Kriegel H.-P., Seidl T.: A Multi-Step Approach for Shape Similarity Search in Image Databases.IEEE Transactions on Knowledge and Data Engineering, Vol. 10, No. 6 (1998) 996–1004

    Article  Google Scholar 

  3. Berchtold S., Böhm C., Braunmüller B., Keim D., Kriegel H.-P.: Fast Parallel Similarity Search in Multimedia Databases. Proc. ACM SIGMOD Int. Conf. on Management of Data, Tucson, AZ. ACM Press (1997) 1–12, Best Paper Award

    Chapter  Google Scholar 

  4. Berchtold S., Böhm C., Keim D., Kriegel H.-P.: A Cost Model for Nearest Neighbor Search in High-Dimensional Data Spaces. Proc. 16th ACM SIGACT-SIGMODSIGART Symp. on Principles of Database Systems (PODS), Tucson, AZ (1997) 78–86

    Google Scholar 

  5. Berchtold S.: Geometry Based Search of Similar Mechanical Parts. Ph.D. Thesis, Institute for Computer Science, University of Munich.Shaker Verlag, Aachen (1997) in German

    Google Scholar 

  6. Bernstein F.C., Koetzle T.F., Williams G.J., Meyer E.F., Brice M.D., Rodgers J.R., Kennard O., Shimanovichi T., Tasumi M.: The Protein Data Bank: a Computer-based Archival File for Macromolecular Structures. Journal of Molecular Biology, Vol. 112 (1977) 535–542

    Article  Google Scholar 

  7. Berchtold S., Keim D., Kriegel H.-P.: The X-tree: An Index Structure for High-Dimensional Data.Proc. 22nd Int. Conf. on Very Large Data Bases (VLDB‘96), Mumbai, India. Morgan Kaufmann (1996) 28–39

    Google Scholar 

  8. Berchtold S., Kriegel H.-P.: S3: Similarity Search in CAD Database Systems. Proc. ACM SIGMOD Int. Conf. on Management of Data. ACM Press (1997) 564–567

    Google Scholar 

  9. Berchtold S., Keim D.A., Kriegel H.-P.: Using Extended Feature Objects for Partial Similarity Retrieval. VLDB Journal, Vol. 6, No. 4. Springer Verlag, Berlin Heidelberg New York (1997) 333–348

    Google Scholar 

  10. Berchtold S., Keim D.A., Kriegel H.-P.: Section Coding: A Method for Similarity Search in CAD Databases. Proc. German Conf. on Databases for Office Automation, Technology, and Science (BTW). Series Informatik Aktuell. Springer Verlag, Berlin Heidelberg New York (1997) 152–171; in German

    Google Scholar 

  11. Beckmann N., Kriegel H.-P., Schneider R., Seeger B.: The R*-tree: An Efficient and Robust Access Method for Points and Rectangles.Proc. ACM SIGMOD Int. Conf. on Management of Data, Atlantic City, NJ. ACM Press (1990) 322–331

    Google Scholar 

  12. Chen M.-S., Han J. and Yu P.S.: Data Mining: An Overview from a Database Perspective.IEEE Transactions on Knowledge and Data Engineering, Vol. 8, No. 6 (1996) 866–883

    Article  Google Scholar 

  13. Faloutsos C., Barber R., Flickner M., Hafner J., Niblack W., Petkovic D., Equitz W.: Efficient and Effective Querying by Image Content.Journal of Intelligent Information Systems, Vol. 3 (1994) 231–262

    Article  Google Scholar 

  14. Gaede V., Günther O.: Multidimensional Access Methods.ACM Computing Surveys, Vol. 30, No. 2 (1998) 170–231

    Article  Google Scholar 

  15. Gary J.E., Mehrotra R.: Similar Shape Retrieval Using a Structural Feature Index. Information Systems, Vol. 18, No. 7 (1993) 525–537

    Article  Google Scholar 

  16. Guttman A.: R-trees: A Dynamic Index Structure for Spatial Searching.Proc. ACM SIGMOD Int. Conf. on Management of Data, Boston, MA. ACM Press (1984) 47–57

    Google Scholar 

  17. Holm L., Sander C.: The FSSP Database of Structurally Aligned Protein Fold Families. Nucleic Acids Research, Vol. 22 (1994) 3600–3609

    Google Scholar 

  18. Hjaltason G.R., Samet H.: Ranking in Spatial Databases. Proc. 4th Int. Symposium on Large Spatial Databases (SSD’95). Lecture Notes in Computer Science, Vol. 951. Springer Verlag, Berlin Heidelberg New York (1995) 83–95

    Google Scholar 

  19. Holm L., Sander C.: Touring Protein Fold Space with Dali/FSSP.Nucleic Acids Research, Vol. 26 (1998) 316–319

    Article  Google Scholar 

  20. Hafner J., Sawhney H.S., Equitz W., Flickner M., Niblack W.: Efficient Color Histogram Indexing for Quadratic Form Distance Functions. IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 17, No. 7. IEEE Press (1995) 729–736

    Article  Google Scholar 

  21. Jagadish H.V.: A Retrieval Technique for Similar Shapes.Proc. ACM SIGMOD Int. Conf. on Management of Data. ACM Press (1991) 208–217

    Google Scholar 

  22. Kastenmüller G.: Shape-oriented Similarity Search in 3D Protein Database Systems. Diploma Thesis, Institute for Computer Science, University of Munich (1998) in German

    Google Scholar 

  23. Kriegel H.-P., Seidl T.: Approximation-Based Similarity Search for 3-D Surface Segments. GeoInformatica Journal, Vol. 2, No. 2. Kluwer Academic Publishers (1998) 113–147

    Article  Google Scholar 

  24. Korn F., Sidiropoulos N., Faloutsos C., Siegel E., Protopapas Z.: Fast Nearest Neighbor Search in Medical Image Databases. Proc. 22nd VLDB Conference, Mumbai, India. Morgan Kaufmann (1996) 215–226

    Google Scholar 

  25. Kriegel H.-P., Schmidt T., Seidl T.: 3D Similarity Search by Shape Approximation. Proc. Fifth Int. Symposium on Large Spatial Databases (SSD’97), Berlin, Germany. Lecture Notes in Computer Science, Vol. 1262. Springer Verlag, Berlin Heidelberg New York (1997) 11–28

    Google Scholar 

  26. Lamdan Y., Wolfson H.J.: Geometric Hashing: A General and Efficient Model-Based Recognition Scheme. Proc. IEEE Int. Conf. on Computer Vision, Tampa, Florida, 1988 238–249

    Google Scholar 

  27. Mitchell T.M.: Machine Learning. McCraw-Hill, (1997)

    Google Scholar 

  28. Michie D., Spiegelhalter D.J., Taylor C.C.: Machine Learning, Neural and Statistical Classification. Ellis Horwood (1994)

    Google Scholar 

  29. Orengo C.A., Michie A.D., Jones S., Jones D.T. Swindells M.B., Thornton, J.M.: CATH-A Hierarchic Classification of Protein Domain Structures. Structure, Vol. 5, No. 8 (1997)1093–1108

    Article  Google Scholar 

  30. Samet H.: The Design and Analysis of Spatial Data Structures. Addison Wesley (1990)

    Google Scholar 

  31. Seidl T.: Adaptable Similarity Search in 3-D Spatial Database Systems. Ph.D. Thesis, Institute for Computer Science, University of Munich (1997). Herbert Utz Verlag, Munich, http://utzverlag.com, ISBN: 3-89675-327-4

  32. Seidl T., Kriegel H.-P.: A 3D Molecular Surface Representation Supporting Neighborhood Queries. Proc. 4th Int. Symposium on Large Spatial Databases (SSD’95), Portland, Maine, USA. Lecture Notes in Computer Science, Vol. 951. Springer Verlag, Berlin Heidelberg New York (1995)240–258

    Google Scholar 

  33. Seidl T., Kriegel H.-P.: Efficient User-Adaptable Similarity Search in Large Multimedia Databases. Proc. 23rd Int. Conf. on Very Large Databases (VLDB’97), Athens, Greece. Morgan Kaufmann (1997) 506–515

    Google Scholar 

  34. Seidl T., Kriegel H.-P.: Optimal Multi-Step k-Nearest Neighbor Search. Proc. ACM SIGMOD Int. Conf. on Management of Data, Seattle, Washington (1998)154–165

    Google Scholar 

  35. Sellis T., Roussopoulos N., Faloutsos C.: The R+-Tree: A Dynamic Index for Multi-Dimensional Objects. Proc. 13th Int. Conf. on Very Large Databases, Brighton, England (1987) 507–518

    Google Scholar 

  36. Taubin G., Cooper D.B.: Recognition and Positioning of Rigid Objects Using Algebraic Moment Invariants. in Geometric Methods in Computer Vision, Vol. 1570, SPIE (1991) 175–186

    Google Scholar 

  37. Weiss S.M., Kulikowski C.A.: Computer Systems that Learn: Classification and Prediction Methods from Statistics, Neural Nets, Machine Learning, and Expert Systems. Morgan Kaufmann, San Francisco (1991)

    Google Scholar 

  38. Weber R., Schek H.-J., Blott S.: A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces. Proc. 24th Int. Conf. on Very Large Databases (VLDB’98), New York, USA. Morgan Kaufmann (1998) 194–205

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ankerst, M., Kastenmüller, G., Kriegel, HP., Seidl, T. (1999). 3D Shape Histograms for Similarity Search and Classification in Spatial Databases. In: Güting, R.H., Papadias, D., Lochovsky, F. (eds) Advances in Spatial Databases. SSD 1999. Lecture Notes in Computer Science, vol 1651. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48482-5_14

Download citation

  • DOI: https://doi.org/10.1007/3-540-48482-5_14

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-66247-1

  • Online ISBN: 978-3-540-48482-0

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics