Multifrontal Computations on GPUs and Their Multi-core Hosts

Lucas, Robert F.; Wagenbreth, Gene; Davis, Dan M.; Grimes, Roger

doi:10.1007/978-3-642-19328-6_9

Robert F. Lucas²⁰,
Gene Wagenbreth²⁰,
Dan M. Davis²⁰ &
…
Roger Grimes²¹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6449))

Included in the following conference series:

International Conference on High Performance Computing for Computational Science

1544 Accesses
10 Citations

Abstract

The use of GPUs to accelerate the factoring of large sparse symmetric matrices shows the potential of yielding important benefits to a large group of widely used applications. This paper examines how a multifrontal sparse solver performs when exploiting both the GPU and its multi-core host. It demonstrates that the GPU can dramatically accelerate the solver relative to one host CPU. Furthermore, the solver can profitably exploit both the GPU to factor its larger frontal matrices and multiple threads on the host to handle the smaller frontal matrices.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Heath, M., Ng, E., Peyton, B.: Parallel algorithms for sparse linear systems. Society for Industrial and Applied Mathematics Review 33, 420–460 (1991)
MathSciNet MATH Google Scholar
Charlesworth, A., Gustafson, J.: Introducing Replicated VLSI to Supercomputing: the FPS-164/MAX Scientific Computer. IEEE Computer 19(3), 10–23 (1986)
Article Google Scholar
Pham, D.C., Aipperspach, T., Boerstler, D., Bolliger, M., Chaudhry, R., Cox, D., Harvey, P., Harvey, P.M., Hofstee, H.P., Johns, C., Kahle, J., Kameyama, A., Keaty, J., Masubuchi, Y., Pham, M., Pille, J., Posluszny, S., Riley, M., Stasiak, D.L., Suzuoki, M., Takahashi, O., Warnock, J., Weitzel, S., Wendel, D., Yazawa, K.: Overview of the Architecture, Circuit Design, and Physical Implementation of a First-Generation Cell Processor. IEEE Journal of Solid State Circuits 41(1) (January 2006)
Google Scholar
Lastra, A., Lin, M., Minocha, D.: ACM Workshop on General Purpose Computations on Graphics Processors (2004)
Google Scholar
Duff, I., Reid, J.: The Multifrontal Solution of Indefinite Sparse Symmetric Linear Systems. ACM Transactions on Mathematical Software 9, 302–335 (1983)
Article MathSciNet MATH Google Scholar
Dongarra, J.J., Du Croz, J., Hammarling, S., Duff, I.S.: A Set of Level 3 Basic Linear Algebra Subprograms. ACM Transactions on Mathematical Software 16(1), 1–17 (1990)
Article MATH Google Scholar
Scott Larson, E., McAllister, D.: Fast matrix multiplies using graphics hardware. In: Proceedings of the 2001 ACM/IEEE Conference on Supercomputing, p. 55. ACM Press, New York (2001)
Chapter Google Scholar
Fatahalian, K., Sugarman, J., Hanrahan, P.: Understanding the Efficiency of GPU Algorithms for Matrix-Matrix Multiplication. In: Proceedings of the ACM Sigraph/Eurographics Conference on Graphics Hardware. Eurographics Association, pp. 133–138 (2004)
Google Scholar
Govindaraju, N., Manocha, D.: Cache-Efficient Numerical Algorithms Using Graphics Hardware, University of North Carolina Technical Report (2007)
Google Scholar
Lucas, R.F.: GPU-Enhanced Linear Solver Results. In: The Proceedings of Parallel Processing for Scientific Computing. SIAM, Philadelphia (2008)
Google Scholar
Private communication with Gene Poole, ANSYS Inc., at SC|2008, Austin, TX (November 2008)
Google Scholar
cqse.ntu.edu.tw/cqse/download_file/DPierce_20090116.pdf
Karypis, G., Kumar, V.: A fast and high quality multilevel scheme for partitioning irregular graphs. In: Haridi, S., Ali, K., Magnusson, P. (eds.) Euro-Par 1995. LNCS, vol. 966, pp. 113–122. Springer, Heidelberg (1995)
Google Scholar
Ashcraft, C., Grimes, R.: The Influence of Relaxed Supernode Partitions on the Multifrontal Method. ACM Transactions in Mathematical Software 15, 291–309 (1989)
Article MATH Google Scholar
Ashcraft, C., Lucas, R.: A Stackless Multifrontal Method. In: Tenth SIAM Conference on Parallel Processing for Scientific Computing (March 2001)
Google Scholar
Arnold, M.G., Bailey, T.A., Cowles, J.R., Winkel, M.D.: Applying Features of IEEE 754 to Sign/Logarithm Arithmetic. IEEE Transactions on Computers 41(8), 1040–1050 (1992)
Article Google Scholar
Buck, I.: GPU Computing: Programming a Massively Parallel Processor. In: International Symposium on Code Generation and Optimization, San Jose, California
Google Scholar
Duff, I.: Parallel Implementation of Multifrontal Schemes. Parallel Computing 3, 193–204 (1986)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Information Sciences Institute, University of Southern California, 4676 Admiralty Way, Suite 1001, Marina del Rey, California, 90230, USA
Robert F. Lucas, Gene Wagenbreth & Dan M. Davis
Livermore Software Technology Corporation, 7374 Las Positas Rd, Livermore, California, 94551, USA
Roger Grimes

Authors

Robert F. Lucas
View author publications
You can also search for this author in PubMed Google Scholar
Gene Wagenbreth
View author publications
You can also search for this author in PubMed Google Scholar
Dan M. Davis
View author publications
You can also search for this author in PubMed Google Scholar
Roger Grimes
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculdade de Engenharia da, Universidade do Porto, Rua Dr. Roberto Frias s/n, 4200-465, Porto, Portugal
José M. Laginha M. Palma
INP (ENSEEIHT) IRIT, University of Toulouse, rue Charles-Camichel, CEDEX 7, 31071, Toulouse, France
Michel Daydé
Lawrence Berkeley National Laboratory, Berkeley, USA
Osni Marques
Faculty of Engineering, University of Porto, Rua Dr. Roberto Frias, s/n, 4200-465, Porto, Portugal
João Correia Lopes

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lucas, R.F., Wagenbreth, G., Davis, D.M., Grimes, R. (2011). Multifrontal Computations on GPUs and Their Multi-core Hosts. In: Palma, J.M.L.M., Daydé, M., Marques, O., Lopes, J.C. (eds) High Performance Computing for Computational Science – VECPAR 2010. VECPAR 2010. Lecture Notes in Computer Science, vol 6449. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19328-6_9

Download citation

DOI: https://doi.org/10.1007/978-3-642-19328-6_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-19327-9
Online ISBN: 978-3-642-19328-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics