A framework for analyzing locality and portability issues in parallel computing

Ranade, Abhiram

doi:10.1007/3-540-56731-3_18

Abhiram Ranade¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 678))

Included in the following conference series:

Heinz Nixdorf Symposium at the University of Paderborn

116 Accesses
4 Citations

Abstract

This work potentially affect two areas: interconnection network design, and parallel programming methodology.

A key issue in designing parallel computers is the balance between computing power and communication capacity. As we have observed, there exist several problems that are inherently nonlocal, and therefore require high communication capability for efficient implementation. We also listed several problems for which fast network implementations can be designed. Some of these problems, however, possess only limited locality, and thus require relatively powerful communication networks (e.g. Butterflies). To summarize, we cannot give a clear answer to the question of how powerful communication networks we must build; but as more results become known about locality of different problems and as we develop locality exploiting algorithms for more problems, we will have a more complete answer.

Our ideas provide a methodology for developing portable parallel programs. The first step given a problem is to determine its gross locality. This determines a native architecture for the problem. The next step is to design an algorithm on the native model that fully exploits locality. This algorithm can now be simulated on different architectures, and is guaranteed to have good efficiency.

Supported in part by NSF-DARPA grant # CCR-9005448 and Air Force Office of Scientific Research, Grant # F49620-90-C-0029.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

F. Abolhassan, R. Drefenstedt, J. Keller, W. Paul, and D. Scheerer. On the physical design of PRAMS. In J. Buchmann, H. Ganziger, and W. Paul, editors, Informatik-Festschrift zum 60. Geburstag von Gunter Hotz. Teubner Verlag, 1992.
Google Scholar
F. Abolhassan, J. Keller, and W. Paul. On the cost-effectiveness of PRAMS. In IEEE Symposium on Parallel and Distributed Processing, pages 2–9, December 1991.
Google Scholar
K. Abrahamson, N. Dadoun, D. Kirkpatrick, and T. Pryztycka. A simple parallel tree contraction algorithm. Technical Report 87-30, University of British Columbia, 1987.
Google Scholar
Alok Aggarwal, Ashok Chandra, and Marc Snir. Communication Complexity of PRAMS. Theoretical Computer Science, pages 3–28, March 1990.
Google Scholar
Robert Alverson, David Callahan, Daniel Cummings, et al. The TERA Computer System. In Proceedings of Supercomputing 90, pages pp1–6, 1990.
Google Scholar
M. Atallah and M. Goodrich. Efficient parallel solutions to some geometric problems. Journal of Parallel and Distributed Computing, 3:492–507, 1986.
Google Scholar
S. N. Bhatt, F. R. K. Chung, J. W. Hong, F. T. Leighton, and A. L. Rosenberg. Optimal Simulations by Butterfly Networks. In Proceedings of STOC 88, pages 192–204, 1988.
Google Scholar
S. N. Bhatt, F. R. K. Chung, F. T. Leighton, and A. L. Rosenberg. Optimal simulations of tree machines. In Proceedings of the IEEE Annual Symposium on The Foundations of Computer Science, pages 274–282, 1986.
Google Scholar
David Blackston and Abhiram Ranade. Snakesort: A family of optimal randomized sorting algorithms, 1993. manuscript.
Google Scholar
Joseph Cheriyan, Torben Hagerup, and Kurt Mehlhorn. Can maximum flow be computed in o(nm) time? Technical Report A 90/07, Universitat des Saarlandes, May 1990.
Google Scholar
R. Cole and U. Vishkin. Approximate and exact parallel scheduling with application to list, tree and graph problems. In Proceedings of the IEEE Annual Symposium on The Foundations of Computer Science, pages 478–491, 1986.
Google Scholar
D. Culler, R. Karp, D. Patterson, A. Sahay, K. Schauser, E. Santos, R. Subramonian, and T. Eicken. LogP: Towards a realistic model of Parallel Computation. In Principles and Practice of Parallel Programming, 1992. To appear.
Google Scholar
H. Gazit. An optimal randomized parallel algorithm for finding connected components in a graph. In Proceedings of the IEEE Annual Symposium on The Foundations of Computer Science, pages 492–501, 1986.
Google Scholar
Joseph Ja'Ja'. The VLSI Complexity of Selected Graph Problems. Journal of the ACM, 31:377–391, April 1984.
Google Scholar
R. Koch, T. Leighton, B. Maggs, S. Rao, and A. Rosenberg. Work-preserving emulations of fixed-connection networks. In Proceedings of the ACM Annual Symposium on Theory of Computing, pages 227–240, May 1989.
Google Scholar
Ernst Mayr, 1992. Personal Communication.
Google Scholar
Abhiram G. Ranade. Optimal speedup for backtrack search on a butterfly network. In Proceedings of the ACM Symposium on Parallel Algorithms and Architectures, pages 40–48, July 1991.
Google Scholar
Abhiram G. Ranade. Communication efficient algorithms for some geometric problems. In preparation., 1992.
Google Scholar
Abhiram G. Ranade. Maintaining dynamic ordered sets on processor networks. In Proceedings of the ACM Symposium on Parallel Algorithms and Architectures, pages 127–137, June–July 1992.
Google Scholar
Abhiram G. Ranade, Sandeep N. Bhatt, and S. Lennart Johnsson. The Fluent Abstract Machine. In Proceedings of the Fifth MIT Conference on Advanced Research in VLSI, pages 71–94, March 1988. Also available as Yale Univ. Comp. Sc. TR-573.
Google Scholar
L. G. Valiant. A Bridging Model for Parallel Computation. Communications of the ACM, 33(8):103–111, August 1990.
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Division, University of California, 94720, Berkeley, CA
Abhiram Ranade

Authors

Abhiram Ranade
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

F. Meyer B. Monien A. L. Rosenberg

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ranade, A. (1993). A framework for analyzing locality and portability issues in parallel computing. In: Meyer, F., Monien, B., Rosenberg, A.L. (eds) Parallel Architectures and Their Efficient Use. Nixdorf 1992. Lecture Notes in Computer Science, vol 678. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-56731-3_18

Download citation

DOI: https://doi.org/10.1007/3-540-56731-3_18
Published: 28 May 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-56731-8
Online ISBN: 978-3-540-47637-5
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics