Conclusions

H. M. Cruz, Eduardo; Diener, Matthias; O. A. Navaux, Philippe

doi:10.1007/978-3-319-91074-1_5

Eduardo H. M. Cruz¹⁷,
Matthias Diener¹⁸ &
Philippe O. A. Navaux¹⁹

Part of the book series: SpringerBriefs in Computer Science ((BRIEFSCOMPUTER))

739 Accesses

Abstract

Locality of memory accesses is one of the most important aspects to be considered when designing an architecture or developing software. With the introduction of multicore architectures, the memory hierarchy had to evolve to able to provide the necessary bandwidth to several cores operating in parallel. With this evolution, memory hierarchies started to present several caches in the same level, some levels shared by multiple cores, and other private to a core. Another important step was the incorporation of a memory controller inside the processor, in which multiprocessor systems presented NUMA characteristics. Due to the introduction of such technologies, the performance of memory hierarchies and the systems as a whole were even more dependent on memory locality. In this context, techniques such as sharing-aware thread and data mapping are able to increase memory locality and thereby performance. Our experiments indicate performance improvements of up to 200% in a scientific application.

You have full access to this open access chapter, Download chapter PDF

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Locality of memory accesses is one of the most important aspects to be considered when designing an architecture or developing software. With the introduction of multicore architectures, the memory hierarchy had to evolve to able to provide the necessary bandwidth to several cores operating in parallel. With this evolution, memory hierarchies started to present several caches in the same level, some levels shared by multiple cores, and other private to a core. Another important step was the incorporation of a memory controller inside the processor, in which multiprocessor systems presented NUMA characteristics. Due to the introduction of such technologies, the performance of memory hierarchies and the systems as a whole were even more dependent on memory locality. In this context, techniques such as sharing-aware thread and data mapping are able to increase memory locality and thereby performance. Our experiments indicate performance improvements of up to 200% in a scientific application.

Lots of related work on the area of sharing-aware mapping has been proposed, with a wide variety of characteristics and features. The majority of the proposals perform only static mapping, which are able to handle only applications whose memory access behavior keeps the same along different executions. Most work also only handles thread or data mapping alone, not both together. Most related work that is able to handle both thread and data mappings and operate online, during the execution of the application, have a high trade-off between accuracy and overhead. To achieve a higher accuracy, they have to increase the overhead of their memory access behavior detection as well. Some proposals are able to achieve high accuracy with low overhead, but require special hardware support.

Author information

Authors and Affiliations

Federal Institute of Parana (IFPR), Paranavai, Parana, Brazil
Eduardo H. M. Cruz
University of Illinois at Urbana-Champaign, Urbana, IL, USA
Matthias Diener
Informatics Institute, Federal University of Rio Grande do Sul (UFRGS), Porto Alegre, Rio Grande do Sul, Brazil
Philippe O. A. Navaux

Authors

Eduardo H. M. Cruz
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Diener
View author publications
You can also search for this author in PubMed Google Scholar
Philippe O. A. Navaux
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

H. M. Cruz, E., Diener, M., O. A. Navaux, P. (2018). Conclusions. In: Thread and Data Mapping for Multicore Systems. SpringerBriefs in Computer Science. Springer, Cham. https://doi.org/10.1007/978-3-319-91074-1_5

Download citation

DOI: https://doi.org/10.1007/978-3-319-91074-1_5
Published: 05 July 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-91073-4
Online ISBN: 978-3-319-91074-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics