Skip to main content
  • 181 Accesses

Abstract

In the previous chapter, parallel and vectorized Choleski algorithms which were based on the skyline (column-by-column) storage scheme for shared memory computers (such as the Cray-2, Cray-YMP, Cray-C90, etc..) had been discussed. The factorized algorithms discussed in Chapter 4 have been based upon the “dot product” operations. For certain types of shared memory computers (such as Cray-YMP, Cray-C90, etc.), “Saxpy” operations (to be explained in more detail, later on in this chapter) are known to be faster than “dot product” operations[5.1]. The skyline storage scheme and its associated parallel and vectorized algorithms has been found to prohibit the traditional “loop unrolling” technique used to optimize vector performance, so a less powerful “vector unrolling” strategy was used. This chapter describes a different algorithm that overcomes the deficiency of skyline storage by using a variable bandwidth storage scheme. The objective of this chapter is to describe this new algorithm for solving matrix equations and to demonstrate its accuracy and speed by solving large-scale structural analysis applications on shared memory (such as Cray) supercomputers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

5.15 References

  1. Agarwal, T.K., O.O. Storaasli and D.T. Nguyen, “A Parallel-Vector Algorithm for Rapid Structural Analysis on High-Performance Computers”,Proceedings of the AIAA/ASME/ASCE/AHS 31st SDM Conference, Long Beach, CA, AIAA paper No. 90-1149, April 2-4, 1990.

    Google Scholar 

  2. Bathe, K.J.,Finite Element Procedures Prentice-Hall, Inc., New York, (1996).

    Google Scholar 

  3. George, A. and J. W-H Liu,Computer Solution of Large Sparse Positive Definite Systems, Prentice-Hall, Inc., Englewood Cliffs, NJ, 1981.

    Google Scholar 

  4. Jordan, H.F., M.S. Benton, N.S. Arenstorf, and A.V. Ramann, “Force User’s Manual: A Portable Parallel FORTRAN”, NASA CR 4265, January, 1990.

    Google Scholar 

  5. Storaasli, O.O., D.T. Nguyen, and T.K. Agarwal, “The Parallel Solution of Large-Scale Structural Analysis Problems on Supercomputers”, Proceedings of the A1AA/ASME/ASCE/ HAS 30th Structures, Structural Dynamics and Materials Conference Mobile, AL, April 3-5, 1989, pp.859-867, Paper No. 89-1259 (also appeared in AIAA Journal, September, 1990)

    Google Scholar 

  6. Robins, W.A. et al., “Concept Development of a Mach 3.0 High-Speed Civil Transport”, NASA TM 4058, September, 1988.

    Google Scholar 

  7. Stewart, C.B. (compiler), “The Computational Structural Mechanics Testbed User’s Manual” , NASA TM-100644, October, 1989.

    Google Scholar 

  8. Knight, N.F., S.L. McCleary, S.C. Macy, and M.A. Aminpour, “Large Scale Structural Analysis: The Structural Analyst, The CSM Testbed, and the NAS System”, NASA TM-100643, March, 1989.

    Google Scholar 

  9. Knight, N.F., R.E. Gillian, and M.P. Nemeth, “Preliminary 2-D Shell Analysis of the Space Shuttle Solid Rocket Booster”, NASA TM-100515, 1987.

    Google Scholar 

  10. Ashcraft, C.C., R.G. Grimes, J.G. Lewis, B.W. Peyton, and H.D. Simon, “Progress in Sparse Matrix Methods for Large Linear Systems on Vector Supercomputers”, The International Journal of Supercomputer Applications, Vol. 1, No. 4, Winter 1987, pp. 10–30.

    Article  Google Scholar 

  11. Simon, H., P. Vu, and C. Yang, “Performance of a Supernodal General Sparse Solver on the Cray Y-MP: 1.68 GFLOPS with Autotasking”, Scientific and Computing Analysis Division Report SCA-TR-117, Boeing Computer Services, Seattle, WA, March, 1989.

    Google Scholar 

  12. Storaasli, O.O., D.T. Nguyen, and T.K. Agarwal, “Force on the Cray Y-MP”,/u/nas/news The Numerical Aerodynamic Simulation Program Newsletter, NASA Ames Research Center, Vol. 4, No. 7, July, 1989, pp. 1–4.

    Google Scholar 

  13. Storaasli, O.O., “New Equation Solver for Supercomputers”,/u/nas/news The Numerical Aerodynamic Simulation Program NewsletterNASA Ames Research Center, Vol. 5, No. 1, January, 1990, pp. 1–3.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer Science+Business Media New York

About this chapter

Cite this chapter

Nguyen, D.T. (2002). Parallel — Vector Variable Bandwidth Equation Solver on Shared Memory Computers. In: Parallel-Vector Equation Solvers for Finite Element Engineering Applications. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-1337-7_5

Download citation

  • DOI: https://doi.org/10.1007/978-1-4615-1337-7_5

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4613-5504-5

  • Online ISBN: 978-1-4615-1337-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics