Reduced Complexity Many-Core: Timing Predictability Due to Message-Passing

Mische, Jörg; Frieb, Martin; Stegmeier, Alexander; Ungerer, Theo

doi:10.1007/978-3-319-54999-6_11

Jörg Mische¹⁸,
Martin Frieb¹⁸,
Alexander Stegmeier¹⁸ &
…
Theo Ungerer¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10172))

Included in the following conference series:

International Conference on Architecture of Computing Systems

1074 Accesses
9 Citations

Abstract

The Reduced Complexity Many-Core architecture (RC/MC) targets to simplify timing analysis by increasing the predictability of all components. Since shared memory interference is a major source of pessimism in many-core systems, fine-grained message passing between small cores with private memories is used instead of a global shared memory.

In this paper, the RC/MC architecture is presented and evaluated by three models: a VHDL model that can be used to synthesise prototypes with up to \(6\times 6\) cores on an FPGA; a simulation model written in C that can be used for cycle-accurate simulation of more than 4096 cores; and a timing model for static timing analysis.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Altera uses the term Adaptive Logic Module (ALM) for their elementary logic block, basically a lookup table with 6 inputs and 2 outputs (6-LUT). One ALM is equivalent to approximately 2.5 lookup tables with 4 inputs and 1 output (4-LUT).

References

Bailey, D.H., et al.: The NAS parallel benchmarks. Int. J. High Perform. Comput. Appl. 5(3), 63–73 (1991)
Article Google Scholar
Ballabriga, C., Cassé, H., Rochange, C., Sainrat, P.: OTAWA: an open toolbox for adaptive WCET analysis. In: Min, S.L., Pettit, R., Puschner, P., Ungerer, T. (eds.) SEUS 2010. LNCS, vol. 6399, pp. 35–46. Springer, Heidelberg (2010). doi:10.1007/978-3-642-16256-5_6
Chapter Google Scholar
Bell, S., et al.: Tile64-processor: a 64-core soc with mesh interconnect. In: International Solid-State Circuits Conference (ISSCC), pp. 88–598 (2008)
Google Scholar
Berg, C., Engblom, J., Wilhelm, R.: Requirements for and design of a processor with predictable timing. In: Perspectives Workshop: Design of Systems with Predictable Behaviour. No. 03471 in Dagstuhl Seminar Proceedings (2004)
Google Scholar
d’Ausbourg, B., Boyer, M., Noulard, E., Pagetti, C.: Deterministic execution on many-core platforms: application to the SCC. In: Many-core Applications Research Community Symposium (MARC), December 2011
Google Scholar
de Dinechin, B.D., et al.: A distributed run-time environment for the Kalray MPPA-256 integrated manycore processor. Procedia Comput. Sci. 18, 1654–1663 (2013)
Article Google Scholar
Dongarra, J., Heroux, M.A.: Toward a new metric for ranking high performance computing systems. Sandia Report, SAND2013-4744 312 (2013)
Google Scholar
Fan, D., et al.: Godson-T: an efficient many-core processor exploring thread-level parallelism. IEEE Micro 32(2), 38–47 (2012)
Article Google Scholar
Frieb, M., Stegmeier, A., Mische, J., Ungerer, T.: Employing MPI collectives for timing analysis on embedded multi-cores. In: 16th International Workshop on Worst-Case Execution Time Analysis (2016)
Google Scholar
Goossens, K., et al.: Virtual execution platforms for mixed-time-criticality systems: the CompSOC architecture and design flow. ACM SIGBED Rev. 10(3), 23–34 (2013)
Article Google Scholar
Gorlatch, S.: Send-receive considered harmful: myths and realities of message passing. ACM Trans. Program. Lang. Syst. (TOPLAS) 26(1), 47–56 (2004)
Article MathSciNet Google Scholar
Corporation, I.: Intel Xeon Phi Coprocessor System Software Developers Guide, 2.03 edn., November 2012
Google Scholar
Lee, Y., et al.: A 45 nm 1.3 GHz 16.7 double-precision GFLOPS/W RISC-V processor with vector accelerators. In: European Solid State Circuits Conference (ESSCIRC), pp. 199–202. IEEE (2014)
Google Scholar
Mattson, T.G., et al.: The 48-core SCC processor: the programmer’s view. In: International Conference for High Performance Computing, Networking, Storage and Analysis (SC), pp. 1–11 (2010)
Google Scholar
Message Passing Interface Forum. University of Tennesse: MPI: a Message-Passing Interface Standard. Version 3.1, June 2015
Google Scholar
Mische, J., Ungerer, T.: Guaranteed service independent of the task placement in NoCs with torus topology. In: Proceedings of the 22nd International Conference on Real-Time Networks and Systems, p. 151. ACM (2014)
Google Scholar
Mische, J., Metzlaff, S., Ungerer, T.: Distributed memory on chip - bringing together low power and real-time. In: Workshop on Reconciling Performance and Predictability (2014)
Google Scholar
Mische, J., Ungerer, T.: Low power flitwise routing in an unidirectional torus with minimal buffering. In: International Workshop on Network on Chip Architectures (NoCArc), pp. 63–68 (2012)
Google Scholar
Olofsson, A.: Epiphany-V: a 1024 processor 64-bit RISC system-on-chip. Technical report, Adapteva Inc. https://www.parallella.org/wp-content/uploads/2016/10/e5_1024core_soc.pdf
Reineke, J.: Caches in WCET analysis. Universität des Saarlandes, Saarbrücken, PhD Thesis (2008)
Google Scholar
Rochange, C., et al.: WCET analysis of a parallel 3D multigrid solver executed on the MERASA multi-core. In: OASIcs-OpenAccess Series in Informatics, vol. 15. Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik (2010)
Google Scholar
Schoeberl, M., et al.: T-CREST: time-predictable multi-core architecture for embedded systems. J. Syst. Archit. 61(9), 449–471 (2015)
Article Google Scholar
Stegmeier, A., Frieb, M., Mische, J., Ungerer, T.: WCTT bounds for MPI primitives in the PaterNoster NoC. In: 14th International Workshop on Real-Time Networks (2016)
Google Scholar
Ungerer, T., et al.: MERASA: multicore execution of hard real-time applications supporting analyzability. IEEE Micro 30(5), 66–75 (2010)
Article Google Scholar
Ungerer, T., et al.: Parallelizing industrial hard real-time applications for the parMERASA multicore. ACM Trans. Embed. Comput. Syst. (TECS) 15(3), 53 (2016)
MathSciNet Google Scholar
Waterman, A., Lee, Y., Patterson, D.A., Asanovi, K.: The RISC-V instruction set manual, volume I: user-level ISA, version 2.1. Technical report UCB/EECS-2016-118, EECS Department, University of California, Berkeley, May 2016. http://www2.eecs.berkeley.edu/Pubs/TechRpts/2016/EECS-2016-118.html
Wilhelm, R., et al.: Memory hierarchies, pipelines, and buses for future architectures in time-critical embedded systems. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 28(7), 966–978 (2009)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computer Science University of Augsburg, 86159, Augsburg, Germany
Jörg Mische, Martin Frieb, Alexander Stegmeier & Theo Ungerer

Authors

Jörg Mische
View author publications
You can also search for this author in PubMed Google Scholar
Martin Frieb
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Stegmeier
View author publications
You can also search for this author in PubMed Google Scholar
Theo Ungerer
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jörg Mische .

Editor information

Editors and Affiliations

Vienna University of Technology, Vienna, Austria
Jens Knoop
Karlsruhe Institute of Technology, Karlsruhe, Germany
Wolfgang Karl
Lawrence Livermore National Laboratory, Livermore, USA
Martin Schulz
Kyushu University, Fukuoka, Japan
Koji Inoue
Otto-von-Guericke Universität Magdeburg, Magdeburg, Germany
Thilo Pionteck

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mische, J., Frieb, M., Stegmeier, A., Ungerer, T. (2017). Reduced Complexity Many-Core: Timing Predictability Due to Message-Passing. In: Knoop, J., Karl, W., Schulz, M., Inoue, K., Pionteck, T. (eds) Architecture of Computing Systems - ARCS 2017. ARCS 2017. Lecture Notes in Computer Science(), vol 10172. Springer, Cham. https://doi.org/10.1007/978-3-319-54999-6_11

Download citation

DOI: https://doi.org/10.1007/978-3-319-54999-6_11
Published: 04 March 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-54998-9
Online ISBN: 978-3-319-54999-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics