Using Hybrid CPU-GPU Platforms to Accelerate the Computation of the Matrix Sign Function

Benner, Peter; Ezzatti, Pablo; Quintana-Ortí, Enrique S.; Remón, Alfredo

doi:10.1007/978-3-642-14122-5_17

Peter Benner⁸,
Pablo Ezzatti⁹,
Enrique S. Quintana-Ortí¹⁰ &
…
Alfredo Remón¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 6043))

Included in the following conference series:

European Conference on Parallel Processing

1409 Accesses
13 Citations

Abstract

We investigate the numerical computation of the matrix sign function of large-scale dense matrices. This is a common task in various application areas. The main computational work in Newton’s iteration for the matrix sign function consits of matrix inversion. Therefore, we investigate the performance of two approaches for matrix inversion based on Gaussian (LU factorization) and Gauss-Jordan eliminations. The target architecture is a current general-purpose multi-core processor connected to a graphics processor. Parallelism is extracted in both processors by linking sequential versions of the codes with multi-threaded implementations of BLAS. Our results on a system with two Intel QuadCore processors and an nvidia Tesla C1060 illustrate the performance and scalability attained by the codes on this system.

Download to read the full chapter text

Chapter PDF

Computing the Sparse Matrix-Vector Product in High-Precision Arithmetic for GPU Architectures

Accelerating Numerical Dense Linear Algebra Calculations with GPUs

Revisiting the Gauss-Huard Algorithm for the Solution of Linear Systems on Graphics Accelerators

Keywords

References

Golub, G., Loan, C.V.: Matrix Computations, 3rd edn. The Johns Hopkins University Press, Baltimore (1996)
MATH Google Scholar
Petkov, P., Christov, N., Konstantinov, M.: Computational Methods for Linear Control Systems. Prentice Hall, Hertfordshire (1991)
MATH Google Scholar
Frommer, A., Lippert, T., Medeke, B., Schilling, K. (eds.): Numerical Challenges in Lattice Quantum Chromodynamics. Lecture Notes in Computational Science and Engineering, vol. 15. Springer, Heidelberg (2000)
Google Scholar
Byers, R.: Solving the algebraic Riccati equation with the matrix sign function. Linear Algebra Appl. 85, 267–279 (1987)
Article MathSciNet Google Scholar
IMTEK, Oberwolfach model reduction benchmark collection, http://www.imtek.de/simulation/benchmark/
Benner, P., Quintana-Ortí, E.: Solving stable generalized Lyapunov equations with the matrix sign function. Numer. Algorithms 20, 75–100 (1999)
Article MathSciNet Google Scholar
Benner, P., Claver, J., Quintana-Ortí, E.: Parallel distributed solvers for large stable generalized Lyapunov equations. Parallel Processing Letters 9, 147–158 (1999)
Article MathSciNet Google Scholar
Benner, P., Quintana-Ortí, E., Quintana-Ortí, G.: A portable subroutine library for solving linear control problems on distributed memory computers. In: Cooperman, G., Jessen, E., Michler, G. (eds.) Workshop on Wide Area Networks and High Performance Computing, Essen (Germany), September 1998. Lecture Notes in Control and Information, pp. 61–88. Springer, Heidelberg (1999)
Chapter Google Scholar
Benner, P., Quintana-Ortí, E., Quintana-Ortí, G.: State-space truncation methods for parallel model reduction of large-scale systems. Parallel Comput. 29, 1701–1722 (2003)
Article MathSciNet Google Scholar
Benner, P., Quintana-Ortí, E., Quintana-Ortí, G.: Solving linear-quadratic optimal control problems on parallel computers. Optimization Methods & Software 23, 879–909 (2008)
Article MathSciNet Google Scholar
Lawson, C.L., Hanson, R.J., Kincaid, D.R., Krogh, F.T.: Basic linear algebra subprograms for Fortran usage. ACM Trans. Math. Soft. 5, 308–323 (1979)
Article Google Scholar
Dongarra, J.J., Croz, J.D., Hammarling, S., Hanson, R.J.: An extended set of FORTRAN basic linear algebra subprograms. ACM Trans. Math. Soft. 14, 1–17 (1988)
Article Google Scholar
Dongarra, J.J., Croz, J.D., Hammarling, S., Duff, I.: A set of level 3 basic linear algebra subprograms. ACM Trans. Math. Soft. 16, 1–17 (1990)
Article MathSciNet Google Scholar
Anderson, E., Bai, Z., Demmel, J., Dongarra, J.E., DuCroz, J., Greenbaum, A., Hammarling, S., McKenney, A.E., Ostrouchov, S., Sorensen, D.: LAPACK Users’ Guide. SIAM, Philadelphia (1992)
MATH Google Scholar
Gerbessiotis, A.V.: Algorithmic and Practical Considerations for Dense Matrix Computations on the BSP Model. PRG-TR 32, Oxford University Computing Laboratory (1997)
Google Scholar
Gunnels, J.A., Gustavson, F.G., Henry, G.M., van de Geijn, R.A.: FLAME: Formal linear algebra methods environment. ACM Trans. Math. Soft. 27, 422–455 (2001)
Article Google Scholar
Bientinesi, P., Gunnels, J.A., Myers, M.E., Quintana-Ortí, E.S., van de Geijn, R.A.: The science of deriving dense linear algebra algorithms. ACM Trans. Math. Soft. 31, 1–26 (2005)
Article MathSciNet Google Scholar
University of Texas, http://www.cs.utexas.edu/~flame/
Quintana-Ortí, E., Quintana-Ortí, G., Sun, X., van de Geijn, R.: A note on parallel matrix inversion. SIAM J. Sci. Comput. 22, 1762–1771 (2001)
Article MathSciNet Google Scholar
Texas Advanced Computing Center, http://www.tacc.utexas.edu/~kgoto/
Intel Corporation, http://www.intel.com/
Nvidia Corporation, http://www.nvidia.com/cuda/
Nicholas, J.N.: Accuracy and stability of numerical algorithms, Philadelphia, PA, USA (1996)
Google Scholar

Download references

Author information

Authors and Affiliations

Fakultät für Mathematik, Chemnitz University of Technology, D-09107, Chemnitz, Germany
Peter Benner
Centro de Cálculo–Instituto de la Computación, Universidad de la República, 11.300, Montevideo, Uruguay
Pablo Ezzatti
Depto. de Ingeniería y Ciencia de Computadores, Universidad Jaume I, 12.071, Castellón, Spain
Enrique S. Quintana-Ortí & Alfredo Remón

Authors

Peter Benner
View author publications
You can also search for this author in PubMed Google Scholar
Pablo Ezzatti
View author publications
You can also search for this author in PubMed Google Scholar
Enrique S. Quintana-Ortí
View author publications
You can also search for this author in PubMed Google Scholar
Alfredo Remón
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Insitute for Applied Mathematics, Delft University of Technology, 2628, Delft, The Netherlands
Hai-Xiang Lin
Scaledinfra technologies GmbH, Köllnerhofgasse 3/15A, 1010, Vienna, Austria
Michael Alexander
VTT, Kaitovayla 1, 90570, Oulu, Finland
Martti Forsell
Technische Universität Dresden, 01069, Dresden, Germany
Andreas Knüpfer
Institute for Computer Science, Technical University of Innsbruck, 6020, Innsbruck, Austria
Radu Prodan
Instituto Superior Técnico/INESC-ID., Rua Alves Redol 9, 1000-029, Lisbon, Portugal
Leonel Sousa
Jülich Supercomputing Centre, 52425, Jülich, Germany
Achim Streit

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Benner, P., Ezzatti, P., Quintana-Ortí, E.S., Remón, A. (2010). Using Hybrid CPU-GPU Platforms to Accelerate the Computation of the Matrix Sign Function. In: Lin, HX., et al. Euro-Par 2009 – Parallel Processing Workshops. Euro-Par 2009. Lecture Notes in Computer Science, vol 6043. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14122-5_17

Download citation

DOI: https://doi.org/10.1007/978-3-642-14122-5_17
Published: 17 June 2010
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14121-8
Online ISBN: 978-3-642-14122-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Using Hybrid CPU-GPU Platforms to Accelerate the Computation of the Matrix Sign Function

Abstract

Chapter PDF

Similar content being viewed by others

Computing the Sparse Matrix-Vector Product in High-Precision Arithmetic for GPU Architectures

Accelerating Numerical Dense Linear Algebra Calculations with GPUs

Revisiting the Gauss-Huard Algorithm for the Solution of Linear Systems on Graphics Accelerators

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Using Hybrid CPU-GPU Platforms to Accelerate the Computation of the Matrix Sign Function

Abstract

Chapter PDF

Similar content being viewed by others

Computing the Sparse Matrix-Vector Product in High-Precision Arithmetic for GPU Architectures

Accelerating Numerical Dense Linear Algebra Calculations with GPUs

Revisiting the Gauss-Huard Algorithm for the Solution of Linear Systems on Graphics Accelerators

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation