Skip to main content

Partitioning Regular Domains on Modern Parallel Computers

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1573))

Abstract

It has become apparent in recent years that the performance of current high performance computers, from powerful workstations to massively parallel processors, is strongly dependent on the behaviour of the memory hierarchy. In fact, it does not only affect the computation time but the time consumed in performing communications. In this research, the impact of the memory hierarchy usage on the partitioning of multidimensional regular domain problems is studied. We use as an example the numerical solution of a three-dimensional partial differential equation in a regular mesh, by means of a multigrid-like iterative method. Experimental results contradict the traditional regular partitioning techniques on some present parallel computers like the Cray T3E or the SGI Origin 2000: a linear decomposition is more efficient than a three dimensional one due to the better exploitation of the spatial data locality. For similar reasons, computation-communication overlapping increases also execution time.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Wulf, W.A., McKee, S.A..: Hitting the Memory Wall: Implications of the Obvious. Comp. Arch. News, Assoc. for Computing Mach. (March 1995)

    Google Scholar 

  2. Saulsbury, A., Pong, F., Nowatzyk, A.: Missing the Memory Wall: The Case for Processor/Memory Integration. In: Proceeding of ISCA 1996 (May 1996)

    Google Scholar 

  3. Douglas, C.C.: Caching in with multigrid algorithms: Problems in two dimensions. Parallel Algorithms and Applications, 195–204 (1996)

    Google Scholar 

  4. Stals, L., Rude, U.: Techniques for improving the data locality of iterative methods. Tech. Report MRR97-038, School of Math. Sc. of the Australian National University (1997)

    Google Scholar 

  5. Foster, I.T.: Designing and building parallel programs. Concepts and tools for parallel software engineering. Addison-Wesley Publishing Company, Reading (1995)

    MATH  Google Scholar 

  6. Llorente, I.M., Tirado, F., Vázquez, L.: Some Aspects about the Scalability of Scientific Applications on Parallel Computers. Parallel Computing 22, 1169–1195 (1997)

    Article  Google Scholar 

  7. Pérez-García, V.M., et al.: Low Energy Excitations of a Bose-Einstein Condensate. Physical Review Letters 77, 5320–5323 (1996)

    Article  Google Scholar 

  8. Llorente, I.M., Tirado, F.: Relationships between Efficiency an Execution Time of Full Multigrid Methods on Parallel Computers. IEEE Trans. on Parallel and Distributed Systems 8(6) (1997)

    Google Scholar 

  9. Culler, D., Singh, J.P., Gupta, A.: Parallel Computer Architecture. A hardware /software approach. Morgan-Kaufmann Publishers, San Francisco (1998)

    Google Scholar 

  10. Scott, S.L.: Synchronization and Communication in the T3E Multiprocessor. In: Proceeding of the ASPLOS VII (October 1996)

    Google Scholar 

  11. Anderson, E., Brooks, J., Grassl, C., Scott, S.: Performance of the CRAY T3E Multiprocessor. In: Proceeding of SC 1997 (November 1997)

    Google Scholar 

  12. Cray C/C++ Reference Manual, SR-2179 3.0

    Google Scholar 

  13. Laudon, J., Lenoski, D.: The SGI Origin: A ccNUMA Highly Scalable Server. In: Proceeding of ISCA 1997 (May 1997)

    Google Scholar 

  14. Wassermann, H.J., Lubeck, O.M., Bassetti, F.: Performance Evaluation of the SGI Origin 2000: A Memory-Centric Characterization of LANL ASCI Applications. In: Proceeding of the SC 1997 (November 1997)

    Google Scholar 

  15. Silicon Graphics Inc., Origin Servers, Technical Report (April 1997)

    Google Scholar 

  16. Matías, M.P., Espadas, D., Llorente, I.M., Tirado, F.: Experimental results of different partitionings of a regular domain on the Cray T3E, the SGI Origin 2000 and the IBM SP2, Tech. Report 98-001, Dept. of Computer Architecture at Complutense University, Madrid, Spain (1998)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Prieto-Matías, M., Martín-Llorente, I., Tirado, F. (1999). Partitioning Regular Domains on Modern Parallel Computers. In: Hernández, V., Palma, J.M.L.M., Dongarra, J.J. (eds) Vector and Parallel Processing – VECPAR’98. VECPAR 1998. Lecture Notes in Computer Science, vol 1573. Springer, Berlin, Heidelberg. https://doi.org/10.1007/10703040_31

Download citation

  • DOI: https://doi.org/10.1007/10703040_31

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-66228-0

  • Online ISBN: 978-3-540-48516-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics