Abstract
This paper presents a tightly-coupled FPGA cluster for custom computing of fluid dynamics simulation, and evaluates its performance with prototype implementation. For scalable and efficient computation with a lot of FPGA accelerators, we propose an accelerator-domain network (ADN) that brings low-latency and high-speed data transfer by directly connecting FPGAs. We describe implementation of a prototype cluster node with four FPGAs, and their on-chip framework for high-speed data streaming and computing. In performance evaluation, we demonstrate that our custom computing machine for fluid dynamics computation with the lattice-Boltzmann method (LBM) exploits both temporal and spatial parallelism, and scales the performance well with the number of FPGAs. As a result, we achieved 98.8 % of the peak performance of 73.0 GFlop/s with four FPGAs.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Takizawa, H., Yamada, N., Sakai, S., Kobayashi, H.: Radiative heat transfer simulation using programmable graphics hardware. In: Proceedings of the 5th IEEE/ACIS International Conference on Computer and Information Science, pp. 29–36 (2006)
TOP500. http://www.top500.org (2014)
Makino, J.: The GRAPE project. Comput. Sci. Eng. 8(1), 30–40 (2006)
Shaw David E., et al. Anton, a special-purpose machine for molecular dynamics simulation. Commun. ACM 51(7), 91–97 (2008)
Shirazi, N., Walters, A., Athanas, P.: Quantitative analysis of floating point arithmetic on FPGA based custom computing machines. In: Proceedings of the IEEE Symposium on FPGA’s for Custom Computing Machines, pp. 155–162 (1995)
deLorimier, M., DeHon, A.: Floating-point sparse matrix-vector multiply for FPGAs. In: Proceedings of the International Symposium on Field-Programmable Gate Arrays, pp. 75–85 (2005)
Zhuo, L., Prasanna, V.K.: Sparse matrix-vector multiplication on FPGAs. In: Proceedings of the International Symposium on Field-Programmable Gate Arrays, pp. 63–74 (2005)
Dou, Y., Vassiliadis, S., Kuzmanov, G.K., Gaydadjiev, G.N.: 64-bit floating-point FPGA matrix multiplication. In: Proceedings of the International Symposium on Field-Programmable Gate Arrays, pp. 86–95 (2005)
Sano, K., Pell, O., Luk, W., Yamamoto, S.: FPGA-based streaming computation for lattice boltzmann method. In: Proceedings of the International Conference on Field-Programmable Technology, pp. 233–236 (2007)
Murtaza, S., Hoekstra, A.G., Sloot P.M.: Compute bound and I/O bound cellular automata simulations on FPGA logic. ACM Trans. Reconfigurable Technol. Syst. 1(4) (2009) Article 23.
Sano, K., Luzhou, W., Hatsuda, Y., Iizuka, T., Yamamoto, S.: FPGA-array with bandwidth-reduction mechanism for scalable and power-efficient numerical simulations based on finite difference methods. ACM Trans. Reconfigurable Technol. Syst. 3(4) (2010)
Murtaza, S., Hoekstra, A.G., Sloot, P.M.: Cellular automata simulation on a FPGA cluster. Int. J. High Perform. Comput. Appl. 25(2), 193–204 (2011)
Kono, Y., Sano, K., Yamamoto, S.: Scalability analysis of tightly-coupled FPGA-cluster for lattice Boltzmann computation. In: Proceedings of the 22nd International Conference on Field-Programmable Logic and Applications, August 2012. Paper W3B1.
Sano, K., Hatsuda, Y., Yamamoto, S.: Multi-FPGA accelerator for scalable stencil computation with constant memory-bandwidth. IEEE Trans. Parallel Distrib. Syst. 25(3), 695–705 (2014)
Skordos, P.A.: Initial and boundary conditions for the lattice Boltzmann method. Phys. Rev. E 48(6), 4823–4842 (1993)
Sano, K., Nishikawa, T., Aoki, T., Yamamoto, S.: Evaluating power and energy consumption of FPGA-based custom computing machines for scientific floating-point computation. In: Proceedings of the International Conference on Field-Programmable Technology, pp. 301–304 (2008)
Terasic Technologies. http://www.terasic.com (2013)
Altera Corporation. http://www.altera.com/literature/ (2013)
Inamuro, T., Yoshino, M., Ogino, F.: A non-slip boundary condition for lattice Boltzmann simulations. Phys. Fluids 7(12), 2928–2930 (1995)
Noble, D.R., Chen, S., Georgiadis, J.G., Buckius, R.O.: A consistent hydrodynamic boundary condition for the lattice Boltzmann method. Phys. Fluids 7(1), 203–209 (1995)
Chen, S., Wang, Z., Shan, X., Doolen, G.D.: Lattice Boltzmann computational fluid dynamics in three dimensions. J. Stat. Phys. 68(3–4), 379–400 (1992)
Kobori, T., Maruyama, T., Hoshino, T.: A celluer automata system with FPGA. In: Proceedings of the IEEE Symposium on Field-Programmable Custom Computing Machines, pp. 120–129 (2001)
Sano, K., Kono, Y., Suzuki, H., Chiba, R., Ito, R., Koizumi, K., Yamamoto, S.: Efficient custom computing of fully-streamed lattice Boltzmann method on tightly-coupled FPGA cluster. ACM SIGARCH Computer Architecture News, 41(5), 47–52 (2013)
Acknowledgements
This research was partially supported by Grant-in-Aid for Scientific Research (B) No. 23300012 and Grant-in-Aid for Challenging Exploratory Research No. 23650021 from the Ministry of Education, Culture, Sports, Science and Technology, Japan. We thank the support by ALTERA university program.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Sano, K. (2015). FPGA-Based Scalable Custom Computing Accelerator for Computational Fluid Dynamics Based on Lattice BoltzmannMethod. In: Resch, M., Bez, W., Focht, E., Kobayashi, H., Patel, N. (eds) Sustained Simulation Performance 2014. Springer, Cham. https://doi.org/10.1007/978-3-319-10626-7_16
Download citation
DOI: https://doi.org/10.1007/978-3-319-10626-7_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10625-0
Online ISBN: 978-3-319-10626-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)