Precise Management of Scratchpad Memories for Localising Array Accesses in Scientific Codes

Größlinger, Armin

doi:10.1007/978-3-642-00722-4_17

Armin Größlinger¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5501))

Included in the following conference series:

International Conference on Compiler Construction

856 Accesses
11 Citations

Abstract

Unlike desktop and server CPUs, special-purpose processors found in embedded systems and on graphics cards often do not have a cache memory which is managed automatically by hardware logic. Instead, they offer a so-called scratchpad memory which is fast like a cache but, unlike a cache, has to be managed explicitly, i.e., the burden of its efficient use is imposed on the software. We present a method for computing precisely which memory cells are reused due to temporal locality of a certain class of codes, namely codes which can be modelled in the well-known polyhedron model. We present some examples demonstrating the effectiveness of our method for scientific codes.

Download to read the full chapter text

Chapter PDF

Pattern-Aware Staging for Hybrid Memory Systems

Astute Approach to Handling Memory Layouts of Regular Data Structures

OpenSHMEM Sets and Groups: An Approach to Worksharing and Memory Management

Keywords

References

NVIDIA CUDA. http://www.nvidia.com/cuda
Baskaran, M.M., Bondhugula, U., Krishnamoorthy, S., Ramanujam, J., Rountev, A., Sadayappan, P.: Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories. In: PPoPP 2008: Proc. of the 13th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pp. 1–10. ACM Press, New York (2008)
Google Scholar
Bastoul, C.: Code generation in the polyhedral model is easier than you think. In: PACT 2004: Proc. of the 13th Int. Conf. on Parallel Architectures and Compilation Techniques, Washington, DC, USA, pp. 7–16. IEEE Computer Society Press, Los Alamitos (2004)
Chapter Google Scholar
Bastoul, C., Feautrier, P.: Improving data locality by chunking. In: Hedin, G. (ed.) CC 2003. LNCS, vol. 2622, pp. 320–335. Springer, Heidelberg (2003)
Chapter Google Scholar
Bondhugula, U., Baskaran, M.M., Krishnamoorthy, S., Ramanujam, J., Rountev, A., Sadayappan, P.: Automatic transformations for communication-minimized parallelization and locality optimization in the polyhedral model. In: Hendren, L. (ed.) CC 2008. LNCS, vol. 4959, pp. 132–146. Springer, Heidelberg (2008)
Chapter Google Scholar
Chen, G., Kandemir, M.: Compiler-directed code restructuring for improving performance of MPSoCs. IEEE Transactions on Parallel and Distributed Systems 19(9), 1201–1214 (2008)
Article Google Scholar
Clauss, P., Meister, B.: Automatic memory layout transformations to optimize spatial locality in parameterized loop nests. In: 4th Annual Workshop on Interaction between Compilers and Computer Architectures, INTERACT-4, Toulouse, France (January 2000)
Google Scholar
Issenin, I., Brockmeyer, E., Miranda, M., Dutt, N.: Data reuse analysis technique for software-controlled memory hierarchies. In: DATE 2004: Proc. of the Conf. on Design, Automation and Test in Europe, Washington, DC, USA, pp. 202–207. IEEE Computer Society Press, Los Alamitos (2004)
Google Scholar
Kandemir, M., Choudhary, A.: Compiler-directed scratch pad memory hierarchy design and management. In: DAC 2002: Proc. of the 39th Conf. on Design Automation, pp. 628–633. ACM Press, New York (2002)
Google Scholar
Kandemir, M., Ramanujam, J., Choudhary, A.: A compiler algorithm for optimizing locality in loop nests. In: Proc. of the 11th Int. Conf. on Supercomputing (ICS), July 1997, pp. 269–276 (1997)
Google Scholar
Kandemir, M., Ramanujam, J., Irwin, J., Vijaykrishnan, N., Kadayif, I., Parikh, A.: Dynamic management of scratch-pad memory space. In: DAC 2001: Proc. of the 38th Conf. on Design Automation, pp. 690–695. ACM, New York (2001)
Google Scholar
Karp, R.M., Miller, R.E., Winograd, S.: The organization of computations for uniform recurrence equations. Journal of the ACM 14(3), 563–590 (1967)
Article MathSciNet MATH Google Scholar
Lamport, L.: The parallel execution of DO loops. Communications of the ACM 17(2), 83–93 (1974)
Article MathSciNet MATH Google Scholar
Lengauer, C.: Loop parallelization in the polytope model. In: Best, E. (ed.) CONCUR 1993. LNCS, vol. 715, pp. 398–416. Springer, Heidelberg (1993)
Google Scholar
Loechner, V., Meister, B., Clauss, P.: Precise data locality optimization of nested loops. J. Supercomput. 21(1), 37–76 (2002)
Article MATH Google Scholar
Panda, P.R., Dutt, N.D., Nicolau, A.: Efficient utilization of scratch-pad memory in embedded processor applications. In: EDTC 1997: Proc. of the 1997 European Conf. on Design and Test, Washington, DC, USA, p. 7. IEEE Computer Society Press, Los Alamitos (1997)
Google Scholar
Verdoolaege, S., Seghir, R., Beyls, K., Loechner, V., Bruynooghe, M.: Analytical computation of ehrhart polynomials: Enabling more compiler analyses and optimizations. In: Irwin, M.J., Zhao, W., Lavagno, L., Mahlke, S. (eds.) Proc. of the 2004 Int. Conf. on Compilers, Architecture, and Synthesis for Embedded Systems (CASES), Washington DC, USA, pp. 248–258. ACM Press, New York (2004)
Google Scholar
Verdoolaege, S., Seghir, R., Beyls, K., Loechner, V., Bruynooghe, M.: Counting integer points in parametric polytopes using Barvinok’s rational functions. Algorithmica 48(1), 37–66 (2007)
Article MathSciNet MATH Google Scholar
Wolf, M.E., Lam, M.S.: A data locality optimizing algorithm. In: PLDI 1991: Proc. of the ACM SIGPLAN 1991 Conf. on Programming Language Design and Implementation, pp. 30–44. ACM Press, New York (1991)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Department of Informatics and Mathematics, University of Passau, Innstraße 33, 94032, Passau, Germany
Armin Größlinger

Authors

Armin Größlinger
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computing Laboratory, Oxford University, Wolfson Building, Parks Road, OX1 3QD, Oxford, UK
Oege de Moor
Department of Computer Science, Aarhus University, Aabogade 34, 8200, Aarhus N., Denmark
Michael I. Schwartzbach

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Größlinger, A. (2009). Precise Management of Scratchpad Memories for Localising Array Accesses in Scientific Codes. In: de Moor, O., Schwartzbach, M.I. (eds) Compiler Construction. CC 2009. Lecture Notes in Computer Science, vol 5501. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00722-4_17

Download citation

DOI: https://doi.org/10.1007/978-3-642-00722-4_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00721-7
Online ISBN: 978-3-642-00722-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Precise Management of Scratchpad Memories for Localising Array Accesses in Scientific Codes

Abstract

Chapter PDF

Similar content being viewed by others

Pattern-Aware Staging for Hybrid Memory Systems

Astute Approach to Handling Memory Layouts of Regular Data Structures

OpenSHMEM Sets and Groups: An Approach to Worksharing and Memory Management

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Precise Management of Scratchpad Memories for Localising Array Accesses in Scientific Codes

Abstract

Chapter PDF

Similar content being viewed by others

Pattern-Aware Staging for Hybrid Memory Systems

Astute Approach to Handling Memory Layouts of Regular Data Structures

OpenSHMEM Sets and Groups: An Approach to Worksharing and Memory Management

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation