Skip to main content

GPU vs FPGA: A Comparative Analysis for Non-standard Precision

  • Conference paper
Reconfigurable Computing: Architectures, Tools, and Applications (ARC 2014)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8405))

Included in the following conference series:

Abstract

FPGAs and GPUs are increasingly used in a range of high performance computing applications. When implementing numerical algorithms on either platform, we can choose to represent operands with different levels of accuracy. A trade-off exists between the numerical accuracy of arithmetic operators and the resources needed to implement them. Where algorithmic requirements for numerical stability are captured in a design description, this trade-off can be exploited to optimize performance by using high-accuracy operators only where they are most required. Support for half and double-double floating point representations allows additional flexibility to achieve this. The aim of this work is to study the language and hardware support, and the achievable peak performance for non-standard precisions on a GPU and an FPGA. A compute intensive program, matrix-matrix multiply, is selected as a benchmark and implemented for various different matrix sizes. The results show that for large-enough matrices, GPUs out-perform FPGA-based implementations but for some smaller matrix sizes, specialized FPGA floating-point operators for half and double-double precision can deliver higher throughput than implementation on a GPU.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Dennard, R., Gaensslen, F., Rideout, V., Bassous, E., LeBlanc, A.: Design of Ion-Implanted MOSFET’s with Very Small Physical Dimensions. IEEE Journal of Solid-State Circuits 9(5), 256–268 (1974)

    Article  Google Scholar 

  2. NVIDIA Corporation, Santa Clara, U.: Tesla C1060 Computing Processor Board (January 2010)

    Google Scholar 

  3. Xilinx Corporation: Virtex-6 Family Overview. Technical Report DS150 (January 2012)

    Google Scholar 

  4. Xilinx Corporation: LogiCORE Floating-Point Operator v5.0. (2011)

    Google Scholar 

  5. De Dinechin, F., Pasca, B.: Designing Custom Arithmetic Data Paths with FloPoCo. IEEE Design & Test of Computers 28(4), 18–27 (2011)

    Article  Google Scholar 

  6. Volkov, V., Demmel, J.W.: Benchmarking GPUs to tune Dense Linear Algebra. In: Proceedings of the 2008 ACM/IEEE conference on Supercomputing, p. 31. IEEE Press (2008)

    Google Scholar 

  7. NVIDIA Corporation: CUBLAS library v5.5. Technical report (2013)

    Google Scholar 

  8. NVIDIA Corporation: CUDA library documentation 4.1, http://developer.download.nvidia.com/compute/cuda/4_1/rel/toolkit/docs/online

  9. Thall, A.: Extended-Precision Floating-Point Numbers for GPU Computation. In: ACM SIGGRAPH 2006 Research posters, p. 52. ACM (2006)

    Google Scholar 

  10. Lu, M., He, B., Luo, Q.: Supporting Extended Precision on Graphics Processors. In: Proceedings of the Sixth International Workshop on Data Management on New Hardware, pp. 19–26. ACM (2010)

    Google Scholar 

  11. Minhas, U.: GPU vs FPGA: A Comparative Performance Analysis for Non-Standard Precision. Master’s thesis, Imperial College London (2013)

    Google Scholar 

  12. Whaley, R.C., Petitet, A., Dongarra, J.J.: Automated Empirical Optimizations of Software and the ATLAS project. Parallel Computing 27(12), 3–35 (2001)

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Minhas, U.I., Bayliss, S., Constantinides, G.A. (2014). GPU vs FPGA: A Comparative Analysis for Non-standard Precision. In: Goehringer, D., Santambrogio, M.D., Cardoso, J.M.P., Bertels, K. (eds) Reconfigurable Computing: Architectures, Tools, and Applications. ARC 2014. Lecture Notes in Computer Science, vol 8405. Springer, Cham. https://doi.org/10.1007/978-3-319-05960-0_32

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-05960-0_32

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-05959-4

  • Online ISBN: 978-3-319-05960-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics