Parallel — Vector Variable Bandwidth Equation Solver on Shared Memory Computers

Nguyen, Due Thai

doi:10.1007/978-1-4615-1337-7_5

Due Thai Nguyen²

181 Accesses

Abstract

In the previous chapter, parallel and vectorized Choleski algorithms which were based on the skyline (column-by-column) storage scheme for shared memory computers (such as the Cray-2, Cray-YMP, Cray-C90, etc..) had been discussed. The factorized algorithms discussed in Chapter 4 have been based upon the “dot product” operations. For certain types of shared memory computers (such as Cray-YMP, Cray-C90, etc.), “Saxpy” operations (to be explained in more detail, later on in this chapter) are known to be faster than “dot product” operations[5.1]. The skyline storage scheme and its associated parallel and vectorized algorithms has been found to prohibit the traditional “loop unrolling” technique used to optimize vector performance, so a less powerful “vector unrolling” strategy was used. This chapter describes a different algorithm that overcomes the deficiency of skyline storage by using a variable bandwidth storage scheme. The objective of this chapter is to describe this new algorithm for solving matrix equations and to demonstrate its accuracy and speed by solving large-scale structural analysis applications on shared memory (such as Cray) supercomputers.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

5.15 References

Agarwal, T.K., O.O. Storaasli and D.T. Nguyen, “A Parallel-Vector Algorithm for Rapid Structural Analysis on High-Performance Computers”,Proceedings of the AIAA/ASME/ASCE/AHS 31st SDM Conference, Long Beach, CA, AIAA paper No. 90-1149, April 2-4, 1990.
Google Scholar
Bathe, K.J.,Finite Element Procedures Prentice-Hall, Inc., New York, (1996).
Google Scholar
George, A. and J. W-H Liu,Computer Solution of Large Sparse Positive Definite Systems, Prentice-Hall, Inc., Englewood Cliffs, NJ, 1981.
Google Scholar
Jordan, H.F., M.S. Benton, N.S. Arenstorf, and A.V. Ramann, “Force User’s Manual: A Portable Parallel FORTRAN”, NASA CR 4265, January, 1990.
Google Scholar
Storaasli, O.O., D.T. Nguyen, and T.K. Agarwal, “The Parallel Solution of Large-Scale Structural Analysis Problems on Supercomputers”, Proceedings of the A1AA/ASME/ASCE/ HAS 30th Structures, Structural Dynamics and Materials Conference Mobile, AL, April 3-5, 1989, pp.859-867, Paper No. 89-1259 (also appeared in AIAA Journal, September, 1990)
Google Scholar
Robins, W.A. et al., “Concept Development of a Mach 3.0 High-Speed Civil Transport”, NASA TM 4058, September, 1988.
Google Scholar
Stewart, C.B. (compiler), “The Computational Structural Mechanics Testbed User’s Manual” , NASA TM-100644, October, 1989.
Google Scholar
Knight, N.F., S.L. McCleary, S.C. Macy, and M.A. Aminpour, “Large Scale Structural Analysis: The Structural Analyst, The CSM Testbed, and the NAS System”, NASA TM-100643, March, 1989.
Google Scholar
Knight, N.F., R.E. Gillian, and M.P. Nemeth, “Preliminary 2-D Shell Analysis of the Space Shuttle Solid Rocket Booster”, NASA TM-100515, 1987.
Google Scholar
Ashcraft, C.C., R.G. Grimes, J.G. Lewis, B.W. Peyton, and H.D. Simon, “Progress in Sparse Matrix Methods for Large Linear Systems on Vector Supercomputers”, The International Journal of Supercomputer Applications, Vol. 1, No. 4, Winter 1987, pp. 10–30.
Article Google Scholar
Simon, H., P. Vu, and C. Yang, “Performance of a Supernodal General Sparse Solver on the Cray Y-MP: 1.68 GFLOPS with Autotasking”, Scientific and Computing Analysis Division Report SCA-TR-117, Boeing Computer Services, Seattle, WA, March, 1989.
Google Scholar
Storaasli, O.O., D.T. Nguyen, and T.K. Agarwal, “Force on the Cray Y-MP”,/u/nas/news The Numerical Aerodynamic Simulation Program Newsletter, NASA Ames Research Center, Vol. 4, No. 7, July, 1989, pp. 1–4.
Google Scholar
Storaasli, O.O., “New Equation Solver for Supercomputers”,/u/nas/news The Numerical Aerodynamic Simulation Program NewsletterNASA Ames Research Center, Vol. 5, No. 1, January, 1990, pp. 1–3.
Google Scholar

Download references

Author information

Authors and Affiliations

Old Dominion University, Norfolk, Virginia, USA
Due Thai Nguyen

Authors

Due Thai Nguyen
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Nguyen, D.T. (2002). Parallel — Vector Variable Bandwidth Equation Solver on Shared Memory Computers. In: Parallel-Vector Equation Solvers for Finite Element Engineering Applications. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-1337-7_5

Download citation

DOI: https://doi.org/10.1007/978-1-4615-1337-7_5
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-5504-5
Online ISBN: 978-1-4615-1337-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics