Chisel Usecase: Designing General Matrix Multiply for FPGA

Ferres, Bruno; Muller, Olivier; Rousseau, Frédéric

doi:10.1007/978-3-030-44534-8_5

Bruno Ferres¹³,
Olivier Muller¹³ &
Frédéric Rousseau¹³

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 12083))

Included in the following conference series:

International Symposium on Applied Reconfigurable Computing

1616 Accesses
1 Citations

Abstract

To ease developers work in an industry where FPGA usage is constantly growing, we propose an alternative methodology for architecture design. Targeting FPGA boards, we aim at comparing implementations on multiple criteria. We implement it as a tool flow based on Chisel, taking advantage of high level functionalities to ease circuit design, evolution and reutilization, improving designers productivity.

We target a Xilinx VC709 board and propose a case study on General Matrix Multiply implementation using this flow, which demonstrates its usability with performances comparable to the state of the art, as well as the genericity one can benefit from when designing an application-specific accelerator. We show that we were able to generate, simulate and synthesize 80 different architectures in less than 24 h, allowing different trade-offs to be quickly and easily studied, from the most performant to the less costly, to easily comply with integration constraints.

Grenoble INP—Institute of Engineering Univ. Grenoble Alpes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Resource metric is defined as the maximum usage percentage for the 4 considered resources: LUTs, Flip Flops, BRAMs and DSPs.

References

Alon, E., Asanović, K., Bachrach, J., Nikolić, B.: Invited: Open-Source EDA Tools and IP, A View from the Trenches, p. 3 (2019)
Google Scholar
Bachrach, J., et al.: Chisel: constructing hardware in a Scala embedded language. In: Proceedings of the 49th Annual Design Automation Conference on - DAC 2012, San Francisco, California, p. 1216. ACM Press (2012)
Google Scholar
Caulfield, A.M., et al.: A Cloud-Scale Acceleration Architecture, p. 13 (2016)
Google Scholar
De Matteis, T., de Fine Licht, J., Hoefler, T.: FBLAS: streaming linear algebra on FPGA. arXiv:1907.07929 [cs], August 2019
Dongarra, J.J., Du Croz, J., Hammarling, S., Duff, I.S.: A set of level 3 basic linear algebra subprograms. ACM Trans. Math. Softw. 16(1), 1–17 (1990)
Article MathSciNet Google Scholar
Garg, R., Hendren, L.: A portable and high-performance general matrix-multiply (GEMM) library for GPUs and single-chip CPU/GPU systems. In: 2014 22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, Torino, Italy, pp. 672–680. IEEE, February 2014
Google Scholar
Ouyang, J., Lin, S., Qi, W., Wang, Y., Yu, B., Jiang, S.: SDA: software-defined accelerator for large-scale DNN systems. In: 2014 IEEE Hot Chips 26 Symposium (HCS), Cupertino, CA, USA, pp. 1–23. IEEE, August 2014
Google Scholar
Koenig, J., Biancolin, D., Bachrach, J., Asanovic, K.: A hardware accelerator for computing an exact dot product. In: 2017 IEEE 24th Symposium on Computer Arithmetic (ARITH), London, United Kingdom, pp. 114–121. IEEE, July 2017
Google Scholar
Pedram, A., Gerstlauer, A., van de Geijn, R.A.: A high-performance, low-power linear algebra core. In: ASAP 2011–22nd IEEE International Conference on Application-Specific Systems, Architectures and Processors, Santa Monica, CA, USA, pp. 35–42. IEEE, September 2011
Google Scholar
Underwood, K.D., Hemmert, K.S.: Chapter 31 - The implications of floating point for FPGAs. In: Hauck, S., Dehon, A. (eds.) Reconfigurable Computing, pp. 671–695. Systems on Silicon, Morgan Kaufmann, Burlington (2008)
Chapter Google Scholar
Zhao, Z., Hoe, J.C.: Using Vivado-HLS for structural design: a NoC case study. arXiv:1710.10290 [cs], October 2017

Download references

Author information

Authors and Affiliations

Univ. Grenoble Alpes, CNRS, Grenoble INP, TIMA, 38000, Grenoble, France
Bruno Ferres, Olivier Muller & Frédéric Rousseau

Authors

Bruno Ferres
View author publications
You can also search for this author in PubMed Google Scholar
Olivier Muller
View author publications
You can also search for this author in PubMed Google Scholar
Frédéric Rousseau
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Bruno Ferres .

Editor information

Editors and Affiliations

Technology and Information Systems, University of Castilla-La Mancha, Ciudad Real, Spain
Fernando Rincón
Technology and Information Systems, University of Castilla-La Mancha, Ciudad Real, Spain
Jesús Barba
Department of Electrical and Electronic Engineering, University of Hong Kong, Hong Kong, China
Hayden K. H. So
INESC-ID, Lisbon, Portugal
Pedro Diniz
Technology and Information Systems, University of Castilla-La Mancha, Ciudad Real, Spain
Julián Caba

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ferres, B., Muller, O., Rousseau, F. (2020). Chisel Usecase: Designing General Matrix Multiply for FPGA. In: Rincón, F., Barba, J., So, H., Diniz, P., Caba, J. (eds) Applied Reconfigurable Computing. Architectures, Tools, and Applications. ARC 2020. Lecture Notes in Computer Science(), vol 12083. Springer, Cham. https://doi.org/10.1007/978-3-030-44534-8_5

Download citation

DOI: https://doi.org/10.1007/978-3-030-44534-8_5
Published: 25 March 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-44533-1
Online ISBN: 978-3-030-44534-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics