Abstract
In this paper, we describe the design and implementation of a data-movement and control substrate (DMCS) for network-based, homogeneous communication within a single multiprocessor. DMCS is an implementation of an API for communication and computation that has been proposed by the PORTS consortium. One of the goals of this consortium is to define an API that can support heterogeneous computing without undue performance penalties for homogeneous computing. Preliminary results in our implementation suggest that this is quite feasible. The DMCS implementation seeks to minimize the assumptions made about the homogeneous nature of its target architecture. Finally, we present some extensions to the API for PORTS that will improve the performance of sparse, adaptive and irregular type of numeric computations.
This work supported by the Cornell Theory Center which receives major funding from the National Science Foundation, IBM corporation, New York State and members of the its Corporate Research Institute.
Preview
Unable to display preview. Download preview PDF.
References
Thorsten von Eicken, Davin E. Culler, Seth Cooper Goldstein, and Klaus Erik Schauser, Active Messages: a mechanism for integrated communication and computation Proceedings of the 19th International Symposium on Computer Architecture, ACM Press, May 1992.
Matthew Haines, David Cronk, and Piyush Mehrotra, On the design of Chant: A talking threads package, NASA CR-194903 ICASE Report No. 94-25, Institute for Computer Applications in Science and Engineering Mail Stop 132C, NASA Langley Research Center Hampton, VA 23681-0001, April 1994.
R.S. Nikhil, Cid: A Parallel, “Shared-Memory” C for Distributed Memory Machines. In Lecture Notes in Computer Science, vol 892.
Christopher F. Joerg. The Cilk system for Parallel Multithreaded Computing. Ph.D. Thesis, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, January, 1996.
L.V. Kale and M. Bhandarkar and N. Jagathesan and S. Krishnan and J. Yelon, CONVERSE: An Interoperability Framework for Parallel Programming, Parallel Programming Laboratory Report #95-2, Dept. of Computer Science, University of Illinois, March 1995
Nikos Chrisochoides and Nikos Pitsianis, FFT Sensitive Messages, to appear as Cornell Theory Center Technical Report, 1996.
Nikos Chrisochoides and Juan Miguel del Rosario, A Remote Service Protocol for Dynamic Load Balancing of Multithreaded Parallel Computations. Poster presentation in Frontiers'95.
MPI Forum, Message-Passing Interface Standard, April 15, 1994.
Runtime Support for Portable Distributed Data Structures C.-P. Wen, S. Chakrabarti, E. Deprit, Chih-Po Wen, A. Krishnamurthy, and K. Yelick. Workshop on Languages, Compilers, and Runtime Systems for Scalable Computers, May 1995.
N. Sundaresan and L. Lee, An object-oriented thread model for parallel numerical applications. Proceedings of the 2n Annual Object-Oriented Numerics Conference — OONSKI 94, Sunriver, Oregon, pp 291–308, April 24–27 1994.
I. Foster, Carl Kesselman, Steve Tuecke, Portable Mechanisms for Multithreaded Distributed Computations Argonne National Laboratory, MCS-P494-0195.
Ian Foster, Carl Kesselman and Steven Tuecke, The NEXUS approach to integrating multithreading and communication, Argonne National Laboratory.
Ralph M. Butler, and Ewing L. Lusk, User's Guide to p4 Parallel Programming System Oct 1992, Mathematics and Computer Science division, Argonne National Laboratory.
Nikos Chrisochoides, Florian Sukup, Task parallel implementation of the Bowyer-Watson algorithm, CTC96TR235, Technical Report, Cornell Theory Center, 1996.
Portable Runtime System (PORTS) consortium, http://www.cs.uoregon.edu/research/paracomp/ports/
PORTS Level 0 Thread Modules from Argonne/CalTech, ftp://ftp.mcs.anl.gov/pub/ports/
A Proposal for PORTS Level 1 Communication Routines, http://www.cs.uoregon.edu/research/paracomp/ports
A. Belguelin, J. Dongarra, A. Geist, R. Manchek, S. Otto, and J. Walpore, PVM: Experiences, current status and future direction. Supercomputing'93 Proceedings, pp 765–6.
David Keppel, Tools and Techniques for Building Fast Portable Threads Package, UW-CSE-93-05-06, Technical Report, University of Washington at Seattle, 1993.
Data Parallel Programming in a Multithreaded Environment, (Need authors...)to appear i a Special Issue of Scientific Programming, 1996.
Chichao Chang, Grzegorz Czajkowski, Chris Hawblitzell and Thorsten von Eicken, Low-latency communication on the IBM risc system/6000 SP. To appear in Supercomputing '96.
David E. Culler, Andrea Dusseau, Seth Copen Goldstein, Arvind Krishnamurthy, Steven Lumetta, Thorsten von Eicken and Katherine Yelick. Parallel Programming in Split-C. Supercomputing'93.
Veena Avula. SplitThreads — Split-C threads. Masters thesis, Cornell University. 1994.
Portable Clock and Timer Module from Oregon, http://www.cs.uoregon.edu/research/paracomp/ports
Pete Beckman and Dennis Gannon, Tulip: Parallel Run-time Support System for pC++, http://www.extreme.indiana.edu.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1997 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chrisochoides, N., Kodukula, I., Pingali, K. (1997). Data movement and control substrate for parallel scientific computing. In: Panda, D.K., Stunkel, C.B. (eds) Communication and Architectural Support for Network-Based Parallel Computing. CANPC 1997. Lecture Notes in Computer Science, vol 1199. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-62573-9_19
Download citation
DOI: https://doi.org/10.1007/3-540-62573-9_19
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-62573-5
Online ISBN: 978-3-540-68085-7
eBook Packages: Springer Book Archive