Abstract
This paper describes FT64 and Multi-FT64, single- and multi-coprocessor systems designed for high performance scientific computing with streams. We give a detailed case study of porting the Mersenne Prime Search problem to FT64 and Multi-FT64 systems. We discuss several special problems associated with streamizing, such as kernel processing granularity, stream organization and workload partitioning for a multi-processor, which are generally applicable to other scientific codes on FT64. Finally, we perform experiments with eight typical scientific applications on FT64. The results show that a 500MHz FT64 achieves over 50% of its peak performance and a 4.2x peak speedup over 1.6GHz Itanium2. An eight processor Multi-FT64 system achieves 6.8x peak speedup over a single FT64.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Yang, X.: A 64-bit stream processor for scientific applications. In: ISCA2007 (2007)
Dally, W.J., Hanrahan, P., Erez, M., Knight, T.J.: Merrimac: Supercomputing with Streams. In: SC 2003 (November 2003)
Erez, M., Ahn, J.H., Garg, A., Dallyet, W.J., et al.: Analysis and Performance Results of a Molecular Modeling Application on Merrimac. In: SC 2004, Pittsburg, Pennsylvania, USA (November 6-12,2004)
Intel Itantium Architecture Software Developer’s Manual, Volume 3: Instruction Set Reference. 2001.12
khailany, B., Dally, W.J., Kapasi, U.J., Mattson, P., et al.: Imagine: media processing with streams. IEEE micro 3/4 (2001)
Hoare, T.: Communicating sequential processes. Communications of the ACM 8, 21, 666–677 (1978)
Taylor, M.B., et al.: Evaluation of the Raw Microprocessor: An Exposed-Wire-Delay Architecture for ILP and Streams. In: ISCA 2004 (2004)
Kozyrakis, C.E., et al.: Scalable Processors in the Billion-Transistors Era: IRAM. IEEE Computer 30(9) (September 1997)
Sankaralingam, K., et al.: Exploiting ILP, TLP, and DLP with the Polymorphous TRIPS architecture. 30th Annual International Symposium on Computer Architecture (May 2003)
Caspiet, E., et al.: A Streaming MultiThreaded Model. The Third Workshop on Media and Stream Processors. In: Conjunction with MICRO34, Austin, Texas (2001)
Wen, M., Wu, N., Xun, C., Wu, W., Zhang, C.: Analysis and Performance Results of a Fluid Dynamics Application on MASA Stream Processor. In: ICIS 2006. Proceedings of International Conference on Information Systems, pp. 350–354 (2006)
Williams, S., Shalf, J., Oliker, L., Kamil, S., Husbands, P., Yelick, K.: The potential of the cell processor for scientific computing. In: CF 2006. Proceedings of the 3rd conference on Computing frontiers, pp. 9–20 (2006)
Guoxin, Y., et al.: Evaluating high performance computer for scientific computing (2003), www.ccw.com.cn
Tian, Z.: Numerical Simulations of Multiplex Unsteady Flow in Hypersonic Free Stream, Master Thesis, Dept. of Aerospace and Material Engineering, National University of Defense Technology (December 2003)
Cain, K.C., Torres, J.A., Williams, R.T.: RT STAP: Realtime space-time adaptive processing benchmark. Technical Report MTR 96B0000021, MITRE (February 1997)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wen, M., Wu, N., Zhang, C., Wu, W., Yang, Q., Xun, C. (2007). FT64: Scientific Computing with Streams. In: Aluru, S., Parashar, M., Badrinath, R., Prasanna, V.K. (eds) High Performance Computing – HiPC 2007. HiPC 2007. Lecture Notes in Computer Science, vol 4873. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77220-0_22
Download citation
DOI: https://doi.org/10.1007/978-3-540-77220-0_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77219-4
Online ISBN: 978-3-540-77220-0
eBook Packages: Computer ScienceComputer Science (R0)