An Evaluation of High Performance Fortran Compilers Using the HPFBench Benchmark Suite

  • Guohua Jin
  • Y. Charlie Hu
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1685)


The High Performance Fortran (HPF) benchmark suite HPFBench was designed for evaluating the HPF language and compilers on scalable architectures. The functionality of the benchmarks covers scientific software library functions and application kernels. In this paper, we report on an evaluation of two commercial HPF compilers, namely, xlhpf from IBM and pghpf from PGI, on an IBM SP2 using the linear algebra subset of the HPFBench benchmarks.

Our evaluation shows that, on a single processor, there is a significant overhead for the codes compiled under the two HPF compilers and their Fortran 90 companions, compared with the sequential versions of the codes compiled using xlf. The difference mainly comes from the difference in code segments corresponding to the communications when running in parallel. When running in parallel, codes compiled under pghpf achieve from slightly to significantly better speedups than when compiled under xlhpf. The difference is mainly from better performance of communications such as cshift, spread, sum and gather/scatter under pghpf.


Sequential Version Benchmark Suite Code Segment Scalable Architecture Tridiagonal System 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. [1]
    D. Bailey, et. al. The NAS parallel benchmarks. Technical Report RNR-94-007, NASA Ames Research Center, Moffett Field, California, March 1994.Google Scholar
  2. [2]
    M. Berry et. al. The PERFECT Club benchmarks: Effective performance evaluation of supercomputers. International Journal of Supercomputer Applications, 3:5–40, 1989.Google Scholar
  3. [3]
    L.S. Blackford, J. Choi, A. Cleary, E. D’Azevedo, J. Demmel, I. Dhillon, J. Dongarra, S. Hammarling, G. Henry, A. Petitet, K. Stanley, D. Walker, and R.C. Whaley. ScaLAPACK: A Linear Algebra Library for Message-Passing Computers. In SIAM Conference on Parallel Processing, March 1997.Google Scholar
  4. [4]
    J.C. Cooley and J. Tukey. An algorithm for the machine computation of complex fourier series. Math. Comp, 19:291–301, 1965.Google Scholar
  5. [5]
    G. Dahlquist, A. Björck, and N. Anderson. Numerical Methods. Series in Automatic Computation. Prentice Hall, Inc., Englewood Cliffs, NJ, 1974.Google Scholar
  6. [6]
    J.J. Dongarra. Performance of various computers using standard linear equations software. Technical Report CS-89-85, University of Tennessee, Department of Computer Science, 1989.Google Scholar
  7. [7]
    G. Golub and C. vanLoan. Matrix Computations. The Johns Hopkins University Press, second edition, 1989.Google Scholar
  8. [8]
    High Performance Fortran Forum. High Performance Fortran; language specification, version 1.0. Scientific Programming, 2(1-2):1–170, 1993.Google Scholar
  9. [9]
    R.W. Hockney and C. Jesshope. Parallel Computers 2. Adam Hilger, 1988.Google Scholar
  10. [10]
    R. Hocney and M. Berry. Public international benchmarks for parallel computers: Parkbench committe report-1. Technical report, Netlib, Oak Ridge National Laboratory, February 1994.Google Scholar
  11. [11]
    Y.C. Hu, G. Jin, S.L. Johnsson, D. Kehagias, and N. Shalaby. HPFBench: A High Performance Fortran benchmark. Tech. Rep. TR98-322, Computer Science Dept., Rice Univ., 1998. URL:
  12. [12]
    Y.C. Hu, S.L. Johnsson, and S.-H. Teng. High Performance Fortran for highly irregular problems. In Proc. of the 6th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Las Vegas, Nevada, June 1997.Google Scholar
  13. [13]
    IBM. IBM Parallel Engineering and Scientific Subroutine Library Release 2, Guide and Reference, 1996.Google Scholar
  14. [14]
    S.L. Johnsson, T. Harris, and K.K. Mathur. Matrix multiplication on the Connection Machine. In Supercomputing 89, pages 326–332. ACM, November 1989.Google Scholar
  15. [15]
    F. McMahon. The Livermore Fortran kernels: A test of numerical performance range. In Performance Evaluation of Supercomputers, pages 143–186. North Holland, 1988.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1999

Authors and Affiliations

  • Guohua Jin
    • 1
  • Y. Charlie Hu
    • 1
  1. 1.Department of Computer ScienceRice UniversityHoustonUSA

Personalised recommendations