Abstract
Advances in process technology have enabled Field Programmable Gate Arrays (FPGAs) to grow in capacity and implement large heterogenous systems in a monolithic device. With the slowdown of Moore’s law, computer architects now consider domain-specific architectures as the only path left for major improvements in performance-cost-energy. FPGAs are highly adaptable devices, making them the prime candidates for a wide range of emerging domains, including compute-intensive machine learning. Upcoming programmable devices in 7 nm process technology offer a hybrid compute platform that tightly integrates traditional FPGA programmable fabric, multiple CPUs, and an array of reconfigurable vector processors. This platform combines bit-level programmable customization using FPGA-like architecture for ultimate domain optimization with traditional compute using CPUs. Moreover, highly parallelized byte-level processing using CGRA (Coarse Grained Reconfigurable Architecture) is integrated on the same platform. Looking forward, we anticipate that the rising cost of building monolithic devices will also set a disaggregation trend towards using multiple dies. Building systems in the package will provide another cost dimension for programmable devices, further expanding the possibility of new domain-specific architectures and tools. As a consequence of these hardware evolutions, the FPGA tools are also raising the abstraction of design entry with the goal of catering to software programmers and domain experts. We will end the chapter by discussing the automation required for the transition of traditional EDA-like tools to emerging domain-specific compilers for future programmable devices.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Steve Trimberger, Three ages of FPGAs: a retrospective on the first thirty years of FPGA technology, in Proceedings of the IEEE, vol. 103, issue 3 (2015 Mar)
J.W. Lockwood, N. Naufel, J.S. Turner, D.E. Taylor, Reprogrammable network packet processing on the field programmable port extender (FPX), in Proceedings of ISFPGA 2001, ACM, pp. 87–93
Xilinx white paper, Versal: The First Adaptive Compute Acceleration Platform (ACAP), WP505 (v1.0.1) (23 Sept 2019)
B. Gaide et. al., Xilinx Adaptive Compute Acceleration Platform: Versal™ Architecture, FPGA ’2019 (24–26 Feb 2019)
D. Lewis et al., The Stratix™ 10 Highly Pipelined FPGA Architecture (2016)
Xilinx advance product specification, Versal Architecture and Product Data Sheet: Overview, DS950 (v1.2) (3 July 2019)
N. Napre, J. Gray, Hoplite: building austere overlay NoCs for FPGAs, in International Conference on Field-Programmable Logic and Applications (2015)
P. Maidee, A. Kaviani, K. Zeng, LinkBlaze: efficient global data movement for FPGAs, in IEEE reconfig (2017)
M.S. Abdelfattah, V. Betz, LYNX: CAD for FPGA-based network-on-chip, in International Conference on Field-Programmable Logic and Applications (2016)
I. Swarbrick et.al., Network-on-chip programmable platform in Versal™ ACAP architecture, in ACM FPGA ’2019 (Feb 2019)
Xilinx white paper, “Xilinx AI Engines and Their Applications,” WP506 (v1.0.2) (3 Oct 2018)
Xilinx white paper, “Xilinx Stacked Silicon Interconnect Technology Delivers Breakthrough FPGA Capacity, Bandwidth, and Power Efficiency,” WP380 (v1.2) (11 Dec 2012)
G. Singh et. al., Xilinx 16 nm datacenter device family within-package HBM and CCIX interconnect, in 2017 Hot Chips
M. Wissolik et al., Virtex UltraScale+ HBM FPGA: a revolutionary increase in memory performance, WP485 (v1.1) (15 July 2019)
Intel Product announcement, Intel® Agilex™ FPGA Advanced Information Brief. www.intel.com/content/www/us/en/products/programmable/fpga/agilex.html
A. Shokrollahi et al., 10.1 A pin-efficient 20.83 Gb/s/wire 0.94pJ/bit forwarded clock CNRZ 5-coded SerDes up to 12 mm for MCM packages in 28 nm CMOS, in 2016 IEEE International Solid State Circuits Conference (ISSCC), San Francisco, CA, USA (2016)
JEDEC, Multi-wire Multi-level I/O Standard (June 2016). http://www.jedec.org/standards-documents/results/jesd247
M. Erett et al., A 126mW 56 Gb/s NRZ wireline transceiver for synchronous short-reach applications in 16 nm FinFET, in IEEE ISSCC2018
U. Cummings, CTO of DCG Connectivity Group, Intel, “From Microns to Miles—The Broad Spectrum of Intel’s Interconnect Technology Strategy”, Hot interconnect (2019)
Heterogeneous Integration Roadmap, “Interconnects for 2D and 3D architectures,” HIR version 1.0, chapter 22, eps.ieee.org/hir (2019)
Vitis Unified Software Platform, https://www.xilinx.com/products/design-tools/vitis/vitis-platform.html, Xilinx
C. Lavin et al., RapidWright: enabling custom crafted implementations for FPGAs, in IEEE FCCM (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Kaviani, A. (2020). Enabling Domain-Specific Architectures with Programmable Devices. In: Murmann, B., Hoefflinger, B. (eds) NANO-CHIPS 2030. The Frontiers Collection. Springer, Cham. https://doi.org/10.1007/978-3-030-18338-7_13
Download citation
DOI: https://doi.org/10.1007/978-3-030-18338-7_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-18337-0
Online ISBN: 978-3-030-18338-7
eBook Packages: Physics and AstronomyPhysics and Astronomy (R0)