Skip to main content

Parallel relational database systems: Why, how and beyond

  • Invited Talk
  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1134))

Abstract

This paper aims to describe synthetically integration and use of parallelism in relational databases on MIMD parallel architecture models. More precisely, after exposing the main goals of parallel relational databases, we highlight that it is essential to exploit recent parallel architectures to obtain high performance. Parallelization of database programs requires the use of data placement approaches and data partitioning strategies which lead to extract levels, forms and types of parallelism. As for the inter-operation parallelization phase, the key problem of optimization, we describe one-phase and two-phase inter-operation parallelization strategies. This leads to unsolved problems which constitute a challenge for future parallel relational database systems.

This is a preview of subscription content, log in via an institution.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. F. Andrès et al., “A Multi-Environment Cost Evaluator for Parallel Database Systems”, 2nd Intl. Symp. on Database Systems for Advanced Applications, DASFAA'91, Tokyo, April 1991

    Google Scholar 

  2. C. Baru et al., “An Overview of DB2 Parallel Edition”, Proc. ACM SIGMOD Conf. on Management of Data, San Jose, May 1995, CA, pp. 460–462.

    Google Scholar 

  3. B. Bergstein et al., “Prototyping DBS3, a Shared Memory Parallel Database System”, First Int. Conf. on Parallel and Distributed Information Systems, Miami, pp. 226–234, 1991.

    Google Scholar 

  4. H. Boral et al., “Prototyping Bubba, a Highly Parallel Database System”, IEEE Trans. Knowledge and Data Engineering. Vol. 1, No.1, March 1990, pp. 4–24.

    Article  Google Scholar 

  5. F. Cariño, P. Kostamaa, “Exegesis of DBC/1012 and P-90 — Industrial Supercomputer Database Machines”, 4th Intl. PARLE Conf., Paris, June 1992, pp. 877–892.

    Google Scholar 

  6. C. Chachaty et al., “A Compositional Approach for the Design of a Parallel Query Processing Language”, 4th Int. PARLE Conf., LNCS, Paris, June 1992, pp. 825–840.

    Google Scholar 

  7. C. Chekuri, et al., “Scheduling Problem in Parallel Query Optimization”, Symposium in Principles of Database Systems, PODS'95, 1995.

    Google Scholar 

  8. M.S Chen, et al., “Using Segmented Right-Deep Trees for the Execution of Pipelined Hash Joins”, Proceedings of the 18th Intl. VLDB conf., Vancouver, August, 1992, pp. 15–26.

    Google Scholar 

  9. R.W. Conway, et al., “The Theory of Scheduling”, Addition-Wesley, 1967.

    Google Scholar 

  10. G. Copeland et al., “Data Placement in Bubba”, Proc. ACM SIGMOD Conf. on Management of Data, Chicago, May 1988, pp. 99–108.

    Google Scholar 

  11. E. Dahlhaus, et al., “Complexity of Multiway Cuts”, 24th Annual ACM Symposium on the Theory of Computing, 1992, pp. 241–251.

    Google Scholar 

  12. D.J. Dewitt et al., “The Gamma Database Machine Project”, IEEE Trans. Knowledge and Data Engineering. Vol. 2, No. 1, March, 1990, pp. 44–61.

    Article  Google Scholar 

  13. D.J. Dewitt, J. Gray, “Parallel Database Systems: The future of High Performance Database Systems”, Communication of the ACM, Vol. 35, No. 6, June 1992, pp. 85–98.

    Article  Google Scholar 

  14. S. Englert, et al. “Parallelism and its Price: A Case Study of NonStop SQL/MP' SIGMOD Record, Vol. 24, No. 4, Dec. 1995.

    Google Scholar 

  15. F.T. Fleighton,” Introduction aux algorithmes et architectures parallèles”, Traduction de P. Fraigniaud et E. Fleury, Morgan and Kaufmann Publisher, Inc. 1995.

    Google Scholar 

  16. M.J. Flynn, “Some Computer Organisation and their Effectivenesses”, IEEE Trans. Computers, C-21, pp. 948–960, September 1972.

    Google Scholar 

  17. C. Germain-Renaud, J.P. Sansonnet, “Les ordinateurs massivement parallèles ” Ed. Armand Colin, Paris, 1991.

    Google Scholar 

  18. S. Ghandeharizadeh, D.J. Dewitt, “Hybrid-Range Partitioning Strategy: A New Declustering Strategy for Multiprocessor Database Machines”, Proc. of the 16th Intl. Conf. onVLDB, Brisbane, 1990, pp. 481–492.

    Google Scholar 

  19. A. Hameurlain et al., “An Analytical Method to Allocate Processors in High Performance Parallel Execution of Recursive Queries”, Intl. Conf. DEXA92, Valencia, 1992, pp. 44–47.

    Google Scholar 

  20. A. Hameurlain, F. Morvan, “A Parallel Scheduling Method for Efficient Query Processing”, 22nd Intl. Conf. on Parallel Processing, St. Charles, Chicago IL. August 16–20, 1993, Vol. 3, pp. 258–261

    Google Scholar 

  21. A. Hameurlain, F. Morvan, “An Optimization Method of Data Communication and Control for Parallel Execution of SQL Queries”, Intl. Conf. DEXA'93, LNCS 720, Prague, 1993, pp. 301–312.

    Google Scholar 

  22. A. Hameurlain, F. Morvan, “A Cost Evaluator for Parallel Database Systems”, 6th Intl. Conf. on Database and Expert Systems Applications, DEXA'95, London, 4–8 Sept. 1995, LNCS 978, pp. 146–156.

    Google Scholar 

  23. A. Hameurlain, F. Morvan, “Scheduling and Mapping for Parallel Execution of Extended SQL Queries”, 4th Intl. Conf. on Information and Knowledge Management, ACM Press, Baltimore, Maryland, 28 Nov.–2 Dec. 1995, pp. 197–204.

    Google Scholar 

  24. W. Hasan, R. Motwani, “Optimization Algorithms for Exploiting the Parallelism-Communication Tradeoff in Pipelined Parallelism”, Proc. of the 20th Intl. Conf. on VLDB, Santiago, Sept. 1994.

    Google Scholar 

  25. W. Hasan, R. Motwani, “Coloring Away Communication in Parallel Query Optimization”, Proc. of the 21th Intl. Conf. on VLDB, Zurich, Switerland, Sept. 1995.

    Google Scholar 

  26. W. Hong, “Exploiting Inter-Operation Parallelism in XPRS”, ACM SIGMOD Intl. Conf. on Management of Data, USA, June 1992, pp. 19–28.

    Google Scholar 

  27. K. A. Hua, et al. “Interconnecting Shared-Everything Systems for Efficient Parallel Query Processing”, First Intl. Conf. on Parallel Distributed Information Systems, Dec. 1991 Florida, USA, pp. 262–270.

    Google Scholar 

  28. Y.E. Ioannidis, Y. Cha Kong, “Randomized Algorithms for Optimizing Large Join Queries”, Proc. of the 1990 ACM SIGMOD Conf. on the Manag. of Data, Atlantic City, May 1990, pp. 312–321.

    Google Scholar 

  29. J.L. Jacquemin, “Informatique parallèle et systèmes multiprocesseurs”, Ed. Hermes, Hermes, Paris, 1993.

    Google Scholar 

  30. R.S.G Lanzelotte, P. Valduriez, “Extending the Search Strategy in a Query Optimizer”, Proc. of the 17th International Conf. on VLDB, Barcelona,1991, pp. 363–373.

    Google Scholar 

  31. R.S.G. Lanzelotte, et al., “On the Effectiveness of Optimization Search Strategies for Parallel Execution Spaces”, Proc. of the 17th Intl. Conf. on VLDB, Dublin, 1993, pp. 413–504.

    Google Scholar 

  32. M. Livny et al, “Multi-Disk Management”, ACM Sigmetrics Conf. on Measurement and Modeling of Computer Systems, Banff Alberta, May 1987, pp. 69–77.

    Google Scholar 

  33. H. Lu, et al. “Query Processing in Parallel Relational Database Systems”, IEEE Computer Society Press, 1994.

    Google Scholar 

  34. J. P. Richardson, et al., “Design and Evaluation of Parallel Pipelined Join Algorithms”, Proc. ACM SIGMOD Conf. on Management of Data, New York, 1987, pp. 399–409.

    Google Scholar 

  35. D. Schneider, D. Dewitt, “Tradeoffs in Processing Complex Join Queries via Hashing in Multiprocessor Database Machines”, Proc. of the 16th VLDB Conf., Brisbane, Australia 1990, pp. 469–480.

    Google Scholar 

  36. M. Stonebraker, et. al, “The Design of XPRS”, Proceedings of the14 th VLDB Conf., 1988.

    Google Scholar 

  37. A. Swami, “Optimization of Large Join Queries: Combining Heuristics an Combinatorial Techniques”, Proc. of the 1989 ACM-SIGMOD Conf. on Management of Data, Portland, USA, Vol. 18, No. 2, 1989, pp. 367–376.

    Google Scholar 

  38. K.L. Tan, H. Lu, “A Note on the Strategy Space of Multiway Join Query Optimization Problem in Parallel Systems”, SIGMOD Record, Vol. 20, No. 4, December 1991, pp. 81–82.

    Article  Google Scholar 

  39. K.L. Tan, H. Lu, “Pipeline Processing of Multi-Way Join Queries in Shared-Memory Systems”, 22nd Intl. Conf. on Parallel Processing, St. Charles, Chicago IL. August 16–20, 1993, Vol. 1, pp. 345–348.

    Google Scholar 

  40. P. Valduriez, “Parallel Database Systems: Open Problems and News Issues”, Distributed and parallel Databases, Kluwer Academic Publishers, Vol. 1, No. 2, 1993, pp. 137–165.

    Google Scholar 

  41. Witkowski A., “NCR 3700 the Next-Generation Industrial Database Computer”, Proc. of the 19th VLDB Conf., Dublin, Ireland, 1993, pp. 230–243.

    Google Scholar 

  42. M. Ziane, et al. “Parallel Query Processing in DBS3”, 2nd International Conference on Parallel and Distributed Information Systems, San Diego, USA, January 22–24, 1993.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Roland R. Wagner Helmut Thoma

Rights and permissions

Reprints and permissions

Copyright information

© 1996 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Hameurlain, A., Morvan, F. (1996). Parallel relational database systems: Why, how and beyond. In: Wagner, R.R., Thoma, H. (eds) Database and Expert Systems Applications. DEXA 1996. Lecture Notes in Computer Science, vol 1134. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0034690

Download citation

  • DOI: https://doi.org/10.1007/BFb0034690

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-61656-6

  • Online ISBN: 978-3-540-70651-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics