Abstract
We show in this paper the memory management issues raised by a parallel irregular and dynamic hierarchical application, which constantly allocates and deallocates data over an extremely large virtual address space. First, we show that if memory caches data locality is necessary, a lack of virtual pages locality may greatly affect the obtained performance. Second, fragmentation and contention problems associated with the required parallel dynamic memory allocation are presented. We propose practical solutions and discuss experimentation results obtained on a cache-coherent non uniform memory access (ccNUMA) distributed shared memory SGI Origin2000 machine.
Institut National Polytechnique de Lorraine.
INRIA Lorraine.
UMR7503, a joint research laboratory between CNRS, Institut National Polytechnique de Lorraine, INRIA, Université Henri Poincaré and Université Nancy 2.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Xavier Cavin, Laurent Alonso, and Jean-Claude Paul. Experimentation of Data Locality Performance for a Parallel Hierarchical Algorithm on the Origin2000. In Fourth European CRAY-SGI MPP Workshop, Garching/Munich, Germany, September 1998.
Xavier Cavin. Load Balancing Analysis of a Parallel Hierarchical Algorithm on the Origin2000. In Fifth European SGI/Gray MPP Workshop, Bologna, Italy, September 1999.
David Cortesi. Origin2000 (TM) and Onyx2 (TM) Performance Tuning and Optimization Guide. Tech Pubs Library guide Number 007-3430-002, Silicon Graphics, Inc., 1998.
Pat Hanrahan, David Salzman, and Larry Aupperle. A Rapid Hierarchical Radiosity Algorithm. In Computer Graphics (ACM SIGGRAPH’ 91 Proceedings), volume 25, pages 197–206, July 1991.
James T. Kajiya. The Rendering Equation. In Computer Graphics (ACM SIGGRAPH’ 86 Proceedings), volume 20, pages 143–150, August 1986.
James Laudon and Daniel Lenoski. The SGI Origin: A ccNUMA Highly Scalable Server. In Proceedings of the 24th Annual International Symposium on Computer Architecture, pages 241–251, Denver, June 1997. ACM Press.
Jaswinder Pal Singh, John L. Hennessy, and Annoop Gupta. Implications of Hierarchical N-body Methods for Multiprocessor Architectures. ACM Transactions on Computer Systems, 13(2): 141–202, May 1995.
Paul R. Wilson, Mark S. Johnstone, Michael Neely, and David Boles. Dynamic storage allocation: A survey and critical review. In Proceedings of International Workshop on Memory Management, volume 986 of Lecture Notes in Computer Science. Springer-Verlag, September 1995.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cavin, X., Alonso, L. (2000). Parallel Management of Large Dynamic Shared Memory Space: A Hierarchical FEM Application. In: Rolim, J. (eds) Parallel and Distributed Processing. IPDPS 2000. Lecture Notes in Computer Science, vol 1800. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45591-4_57
Download citation
DOI: https://doi.org/10.1007/3-540-45591-4_57
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67442-9
Online ISBN: 978-3-540-45591-2
eBook Packages: Springer Book Archive