Skip to main content

Multifrontal Computations on GPUs and Their Multi-core Hosts

  • Conference paper
High Performance Computing for Computational Science – VECPAR 2010 (VECPAR 2010)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6449))

Abstract

The use of GPUs to accelerate the factoring of large sparse symmetric matrices shows the potential of yielding important benefits to a large group of widely used applications. This paper examines how a multifrontal sparse solver performs when exploiting both the GPU and its multi-core host. It demonstrates that the GPU can dramatically accelerate the solver relative to one host CPU. Furthermore, the solver can profitably exploit both the GPU to factor its larger frontal matrices and multiple threads on the host to handle the smaller frontal matrices.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Heath, M., Ng, E., Peyton, B.: Parallel algorithms for sparse linear systems. Society for Industrial and Applied Mathematics Review 33, 420–460 (1991)

    MathSciNet  MATH  Google Scholar 

  2. Charlesworth, A., Gustafson, J.: Introducing Replicated VLSI to Supercomputing: the FPS-164/MAX Scientific Computer. IEEE Computer 19(3), 10–23 (1986)

    Article  Google Scholar 

  3. Pham, D.C., Aipperspach, T., Boerstler, D., Bolliger, M., Chaudhry, R., Cox, D., Harvey, P., Harvey, P.M., Hofstee, H.P., Johns, C., Kahle, J., Kameyama, A., Keaty, J., Masubuchi, Y., Pham, M., Pille, J., Posluszny, S., Riley, M., Stasiak, D.L., Suzuoki, M., Takahashi, O., Warnock, J., Weitzel, S., Wendel, D., Yazawa, K.: Overview of the Architecture, Circuit Design, and Physical Implementation of a First-Generation Cell Processor. IEEE Journal of Solid State Circuits 41(1) (January 2006)

    Google Scholar 

  4. Lastra, A., Lin, M., Minocha, D.: ACM Workshop on General Purpose Computations on Graphics Processors (2004)

    Google Scholar 

  5. Duff, I., Reid, J.: The Multifrontal Solution of Indefinite Sparse Symmetric Linear Systems. ACM Transactions on Mathematical Software 9, 302–335 (1983)

    Article  MathSciNet  MATH  Google Scholar 

  6. Dongarra, J.J., Du Croz, J., Hammarling, S., Duff, I.S.: A Set of Level 3 Basic Linear Algebra Subprograms. ACM Transactions on Mathematical Software 16(1), 1–17 (1990)

    Article  MATH  Google Scholar 

  7. Scott Larson, E., McAllister, D.: Fast matrix multiplies using graphics hardware. In: Proceedings of the 2001 ACM/IEEE Conference on Supercomputing, p. 55. ACM Press, New York (2001)

    Chapter  Google Scholar 

  8. Fatahalian, K., Sugarman, J., Hanrahan, P.: Understanding the Efficiency of GPU Algorithms for Matrix-Matrix Multiplication. In: Proceedings of the ACM Sigraph/Eurographics Conference on Graphics Hardware. Eurographics Association, pp. 133–138 (2004)

    Google Scholar 

  9. Govindaraju, N., Manocha, D.: Cache-Efficient Numerical Algorithms Using Graphics Hardware, University of North Carolina Technical Report (2007)

    Google Scholar 

  10. Lucas, R.F.: GPU-Enhanced Linear Solver Results. In: The Proceedings of Parallel Processing for Scientific Computing. SIAM, Philadelphia (2008)

    Google Scholar 

  11. Private communication with Gene Poole, ANSYS Inc., at SC|2008, Austin, TX (November 2008)

    Google Scholar 

  12. cqse.ntu.edu.tw/cqse/download_file/DPierce_20090116.pdf

  13. Karypis, G., Kumar, V.: A fast and high quality multilevel scheme for partitioning irregular graphs. In: Haridi, S., Ali, K., Magnusson, P. (eds.) Euro-Par 1995. LNCS, vol. 966, pp. 113–122. Springer, Heidelberg (1995)

    Google Scholar 

  14. Ashcraft, C., Grimes, R.: The Influence of Relaxed Supernode Partitions on the Multifrontal Method. ACM Transactions in Mathematical Software 15, 291–309 (1989)

    Article  MATH  Google Scholar 

  15. Ashcraft, C., Lucas, R.: A Stackless Multifrontal Method. In: Tenth SIAM Conference on Parallel Processing for Scientific Computing (March 2001)

    Google Scholar 

  16. Arnold, M.G., Bailey, T.A., Cowles, J.R., Winkel, M.D.: Applying Features of IEEE 754 to Sign/Logarithm Arithmetic. IEEE Transactions on Computers 41(8), 1040–1050 (1992)

    Article  Google Scholar 

  17. Buck, I.: GPU Computing: Programming a Massively Parallel Processor. In: International Symposium on Code Generation and Optimization, San Jose, California

    Google Scholar 

  18. Duff, I.: Parallel Implementation of Multifrontal Schemes. Parallel Computing 3, 193–204 (1986)

    Article  MathSciNet  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lucas, R.F., Wagenbreth, G., Davis, D.M., Grimes, R. (2011). Multifrontal Computations on GPUs and Their Multi-core Hosts. In: Palma, J.M.L.M., Daydé, M., Marques, O., Lopes, J.C. (eds) High Performance Computing for Computational Science – VECPAR 2010. VECPAR 2010. Lecture Notes in Computer Science, vol 6449. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-19328-6_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-19328-6_9

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-19327-9

  • Online ISBN: 978-3-642-19328-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics