Abstract
Main limitations of distributed memory machines involving thousands of processors, deal with network latencies for remote data and/or programs accesses. In this paper we present multithreading techniques for parallel graph reduction model that can tolerate latencies of thousands of cycles by dynamically creating a set of threads. Efficiency is achieved by introducing fast context switch, non preemptive threads and comparative long run-lenght threads (thousand of cycles). Conventional multithreaded techniques deals with statically defined threads (at compile time) and dynamically or statically scheduling of the work to be performed. Parallel graph reduction is an attractive model because of its simplicity and inherently distributed nature: parallelism is introduced by parallel evaluation of function parameters corresponding to dynamically created set of threads. The single assignment feature and the absence of side effect, since the internal representation of programs remains purely functional, overcomes the difficulties of synchronized accesses to shared data and scheduling of parallel activities.
Preview
Unable to display preview. Download preview PDF.
References
E. COUSIN, C. COUSTET, MaRS_Lisp: User Manual V1.0, Document interne ONERA-CERT-DERI, June 1991.
A. CONTESSA, E. COUSIN, C. COUSTET, M. CUBERO-CASTAN, G. DURRIEU, B. LECUSSAN, M. LEMAITRE, P. Ng, MaRS, a combinator graph reduction multi-processor, ONERA/CERT/DERI, Parallel Architectures and Languages Europe, Eindhoven, The Netherlands, Proceedings, VOL. 1, June 1989.
BENJAMIN F. GOLDBERG, Multiprocessor Execution of Functional Programs, Research report yaleu/dcs/rr-618, Department of Computer Science, Yale University, April 1988.
ROBERT H. HALSTEAD, New Ideas in Parallel Lisp: Language Design, Implementation, and Programming Tools, In T. Ito and R. H. Halstead, editors, Parallel Lisp: Language and systems, volume 441 of Lecture Notes in Computer Science, pages 2–57, Sendai, Japan, June 5–8, 1988, Springer, Berlin.
K. HAMMOND, S. PEYTON JONES, Profiling Scheduling strategies on the GRIP parallel reducer, Proc 4th Intl Workshop on Parallel Implementation of Functional Languages; Kuchen H. andLoogen R. (Eds); RWTH, Aachen 1992.
S.L. PEYTON JONES, C. CLACK, and J. SALKID, High-Performance Parallel Graph Reduction, In E. Odijk, M. Rem, and J.-C. Syre, editors, PARLE '89, Parallel Architectures and Lanquages Europe, Volume I: Parallel Architectures, volume 365 of Lecture Notes in Computer Science, pages 193–206, Eindhoven, The Netherlands, June 12–16, 1989, Springer, Berlin.
S.L. PEYTON JONES, Implemented Lazy Functional Languages on Stock Hardware: The Spineless Tagless G-machine, Journal of Functional Programming, 2(2): 127–202, April 1992.
D. E. CULLER, S. GOLDSTEIN, K. ERIK SCHAUSER, AND T. VON EICKEN, TAM — A Compiler Controlled Threaded Abstract Machine, Journal of Parallel and Distributed Computing, pages 347–370, 1993.
R.S. NIKHIL, G.M. PAPADOPOULOS, ARVIND, T: A multithreaded massively parallel architecture ISCA'92 Pages 156–167, 1992
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1995 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Caudal, F., Lecussan, B. (1995). Design and evaluation of a multi-threaded architecture for parallel graph reduction. In: Malyshkin, V. (eds) Parallel Computing Technologies. PaCT 1995. Lecture Notes in Computer Science, vol 964. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-60222-4_129
Download citation
DOI: https://doi.org/10.1007/3-540-60222-4_129
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-60222-4
Online ISBN: 978-3-540-44754-2
eBook Packages: Springer Book Archive