Abstract
This paper proposes a fully pipelined dual-mode double precision floating-point reduction circuit on the field programming gate arrays (FPGAs), which is capable of supporting one double-precision operation and two parallel single-precision operations. Through the combination of tree-traversal structure and striding mode structure, the reduction circuit can handle multiple data sets with arbitrary combination of different lengths without stall and buffer requirements, and generate in-order results. Experimental results show that the proposed reduction circuit can support the dual-mode double precision floating-point reduction at the cost of only 7% increment in the absolute latency for the double precision vector with the same length, compared with the previous single-mode double precision reduction circuits.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bowdler, H., Martin, R., Peters, G., Wilkinson, J.: Handbook Series Linear Algebra: Solution of Real and Complex Systems of Linear Equations. Numerisc He Math. 8, 217–234 (1966)
Wang, X., Leeser, M.: VFloat: A Variable Precision Fixed- and Floating-Point Library for Reconfigurable Hardware. ACM Transactions on Reconfigurable Technology and Systems 3(3) (2010)
Neelima, K., Bharathi, M.: An algorithm for FPGA based Implementation of Variable Precision MAC unit for High Performance Digital FIR Filters. International Journal of Engineering Research and Applications 2, 670–673 (2012)
Tan, D., Danysh, A., Liebelt, M.: Multiple-Precision Fixed-Point Vector Multiply-Accumulator Using Shared Segmentation. ARITH 16, 12–19 (2003)
MoMorris, G.R., Prasanna, V.K., Anderson, R.D.: A Hybrid Approach for Mapping Conjugate Gradient onto an FPGA-Augmented Reconfigurable Supercomputer. In: Proceedings of the 14th Annual IEEE Symposium on Field-Programmable Custom Computing Machines, FCCM 2006, pp. 3–12. IEEE Computer Society, Washington, DC (2006)
Zhuo, L., Morris, G.R., Prasanna, V.K.: Designing Scalable FPGA-Based Reduction Circuits Using Pipelined Floating-Point Cores. In: Proceedings of IEEE 19th International Parallel and Distributed Processing Symposiums (IPDPS 2005) (April 2005)
Zhuo, L., Morris, G.R., Prasanna, V.K.: High-Performance Reduction Circuits Using Deeply Pipelined Operators on FPGAs. IEEE Transactions on Parallel and Distributed Systems 18(10), 1377–1392 (2007)
Tai, Y.-G., Lo, C.-T.D., Psarris, K.: Accelerating Matrix Operations with Improved Deeply Pipelined Vector Reduction. IEEE Transactions on Parallel and Distributed Systems 23(2), 202–210 (2012)
Huang, M., Andrews, D.: Modular design of fully pipelined reduction circuits on fpgas. IEEE Transactions on Parallel and Distributed Systems (2012)
Nagar, K.K., Bakos, J.D.: A Sparse Matrix Personality for the Convey HC-1. In: Proceeding of IEEE 19th Symposium on Field Programmable Custom Computing Machines (FCCM 2011), pp. 1–8 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Guo, S., Dou, Y., Lei, Y. (2015). Design of Fully Pipelined Dual-Mode Double Precision Reduction Circuit on FPGAs. In: Xu, W., Xiao, L., Li, J., Zhang, C., Zhu, Z. (eds) Computer Engineering and Technology. NCCET 2014. Communications in Computer and Information Science, vol 491. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45815-0_12
Download citation
DOI: https://doi.org/10.1007/978-3-662-45815-0_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-45814-3
Online ISBN: 978-3-662-45815-0
eBook Packages: Computer ScienceComputer Science (R0)