Skip to main content

P-CBF: A Parallel Cell-Based Filtering Scheme Using a Horizontal Partitioning Technique

  • Conference paper
High Performance Computing and Communications (HPCC 2005)

Part of the book series: Lecture Notes in Computer Science ((LNCCN,volume 3726))

  • 803 Accesses

Abstract

To efficiently retrieve high-dimensional data in data warehousing and multimedia database applications, many high-dimensional index structures have been proposed, but they suffer from the so called ’dimensional curse’ problem, i.e., the retrieval performance becomes increasingly degraded as the dimensionality is increased. To solve this problem, the cell-based filtering (CBF) scheme has been proposed, but it shows a linear decrease in performance as the dimensionality is increased. In this paper, we propose a parallel CBF scheme using a horizontal partitioning technique, which is called P-CBF, so as to cope with the linear decrease in retrieval performance. To achieve it, we construct our P-CBF scheme under an SN(Shared Nothing) cluster-based parallel architecture. In addition, we present data insertion, range query processing and k-NN query processing algorithms which are suitable for the SN architecture. Finally, we show that our P-CBF scheme achieves good retrieval performance in proportion to the number of servers in the SN architecture and that it outperforms a parallel version of the VA-File when the dimensionality is over 10.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Robinson, J.T.: The K-D-B-tree: A Search Structure for Large Multidimensional Dynamic Indexes. In: Proc. ACM SIGMOD Int. Conf. on Management of Data, pp. 10–18 (1981)

    Google Scholar 

  2. White, D.A., Jain, R.: Similarity Indexing: Algorithms and Performance. In: Proc. of the SPIE: Storage and Retrieval for Image and Video Databases IV, vol. 2670, pp. 62–75 (1996)

    Google Scholar 

  3. Lin, H.I., Jagadish, H., Faloutsos, C.: The TV-tree: An Index Structure for High Dimensional Data. VLDB Journal 3, 517–542 (1995)

    Article  Google Scholar 

  4. Berchtold, S., Keim, D.A., Kriegel, H.-P.: The X-tree: An Index Structure for High-Dimensional Data. In: Proceedings of the 22nd VLDB Conference, pp. 28–39 (1996)

    Google Scholar 

  5. Arya, S., Mount, D.M., Narayan, O.: Accounting for Boundary Effects in Nearest Neighbor Searching. In: Proc. 11th Annaual Symp. on Computational Geometry, Vancouver, Canada, pp. 336–344 (1995)

    Google Scholar 

  6. Berchtold, S., Bohm, C., Keim, D., Kriegel, H.-P: A Cost Model for Nearest Neighbor Search in High-Dimensional Data Space. In: ACM PODS Symposium on Principles of Databases Systems, Tucson, Arizona (1997)

    Google Scholar 

  7. Weber, R., Schek, H.-J., Blott, S.: A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces. In: Proceedings of 24th International Conference on Very Large Data Bases, pp. 24–27 (1998)

    Google Scholar 

  8. Han, S.-G., Chang, J.-W.: A new high-dimensional index structure using a cell-based filtering technique. In: Masunaga, Y., Thalheim, B., Å tuller, J., PokornĂ½, J. (eds.) ADBIS 2000 and DASFAA 2000. LNCS, vol. 1884, pp. 79–92. Springer, Heidelberg (2000)

    Chapter  Google Scholar 

  9. Faloutsos, C.: Design of a Signature File Method that Accounts for Non-Uniform Occurrence and Query Frequencies. ACM SIGMOD, 165–170 (1985)

    Google Scholar 

  10. Kim, J.-K., Chang, J.-W.: Horizontally-divided Signature File on a Parallel Machine Architecture. Journal of Systems Architecture 44(9-10), 723–735 (1998)

    Article  Google Scholar 

  11. Kim, J.-K., Chang, J.-W.: Vertically-partitioned Parallel Signature File Method. Journal of Systems Architecture 46(8), 655–673 (2000)

    Article  Google Scholar 

  12. Roussopoulos, N., Kelley, S., Vincent, F.: Nearest Neighbor Queries. In: Proc. ACM Int. Conf. on Management of Data(SIGMOD), pp. 71–79 (1995)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chang, JW., Kim, YC. (2005). P-CBF: A Parallel Cell-Based Filtering Scheme Using a Horizontal Partitioning Technique. In: Yang, L.T., Rana, O.F., Di Martino, B., Dongarra, J. (eds) High Performance Computing and Communications. HPCC 2005. Lecture Notes in Computer Science, vol 3726. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11557654_30

Download citation

  • DOI: https://doi.org/10.1007/11557654_30

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-29031-5

  • Online ISBN: 978-3-540-32079-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics