Exploiting Heterogeneous Mobile Architectures Through a Unified Runtime Framework

Hsieh, Chenying; Sani, Ardalan Amiri; Dutt, Nikil

doi:10.1007/978-3-030-53273-4_15

Chenying Hsieh²⁰,
Ardalan Amiri Sani²⁰ &
Nikil Dutt²⁰

Part of the book series: IFIP Advances in Information and Communication Technology ((IFIPAICT,volume 586))

Included in the following conference series:

IFIP/IEEE International Conference on Very Large Scale Integration - System on a Chip

447 Accesses

Abstract

Modern mobile SoCs are typically integrated with multiple heterogeneous hardware accelerators such as GPU and DSP. Resource heavy applications such as object detection and image recognition based on convolutional neural networks are accelerated by offloading these computation-intensive algorithms to the accelerators to meet their stringent performance constraints. Conventionally there are device-specific runtime and programming languages supported for programming each accelerator, and these offloading tasks are typically pre-mapped to a specific compute unit at compile time, missing the opportunity to exploit other underutilized compute resources to gain better performance. To address this shortcoming, we present SURF: a Self-aware Unified Runtime Framework for Parallel Programs on Heterogeneous Mobile Architectures. SURF supports several heterogeneous parallel programming languages (including OpenMP and OpenCL), and enables dynamic task-mapping to heterogeneous resources based on runtime measurement and prediction. The measurement and monitoring loop enables self-aware adaptation of run-time mapping to exploit the best available resource dynamically. Our SURF framework has been implemented on a Qualcomm Snapdragon 835 development board and evaluated on a mix of image recognition (CNN), image filtering applications and synthetic benchmarks to demonstrate the versatility and efficacy of our unified runtime framework.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Hardcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
In this article we use the terms “compute unit” and “device” interchangeably.

References

Aldegheri, S., Manzato, S., Bombieri, N.: Enhancing performance of computer vision applications on low-power embedded systems through heterogeneous parallel programming. In: IFIP/IEEE International Conference on Very Large Scale Integration, VLSI-SoC 2018, Verona, Italy, October 8–10, 2018, pp. 119–124 (2018). https://doi.org/10.1109/VLSI-SoC.2018.8644937
Augonnet, C., Thibault, S., Namyst, R., Wacrenier, P.A.: StarPU: a unified platform for task scheduling on heterogeneous multicore architectures. Concurr. Comput.: Pract. Exp. 23(2), 187–198 (2011). https://doi.org/10.1002/cpe.1631
Article Google Scholar
Bolchini, C., Cherubin, S., Durelli, G.C., Libutti, S., Miele, A., Santambrogio, M.D.: A runtime controller for OpenCL applications on heterogeneous system architectures. SIGBED Rev. 15(1), 29–35 (2018). https://doi.org/10.1145/3199610.3199614
Article Google Scholar
Choi, H.J., Son, D.O., Kang, S.G., Kim, J.M., Lee, H.H., Kim, C.H.: An efficient scheduling scheme using estimated execution time for heterogeneous computing systems. J. Supercomput. 65(2), 886–902 (2013). https://doi.org/10.1007/s11227-013-0870-6
Article Google Scholar
Dutt, N.D., Jantsch, A., Sarma, S.: Toward smart embedded systems: a self-aware system-on-chip (SoC) perspective. ACM Trans. Embed. Comput. Syst. 15(2), 22:1–22:27 (2016). https://doi.org/10.1145/2872936
Article Google Scholar
Georgiev, P., Lane, N.D., Rachuri, K.K., Mascolo, C.: Leo: scheduling sensor inference algorithms across heterogeneous mobile processors and network resources. In: Proceedings of the 22nd Annual International Conference on Mobile Computing and Networking, MobiCom 2016, pp. 320–333. ACM, New York (2016). https://doi.org/10.1145/2973750.2973777
Grauer-Gray, S., Xu, L., Searles, R., Ayalasomayajula, S., Cavazos, J.: Auto-tuning a high-level language targeted to GPU codes. In: 2012 Innovative Parallel Computing (InPar), pp. 1–10 (May 2012)
Google Scholar
Gregg, C., Boyer, M., Hazelwood, K., Skadron, K.: Dynamic heterogeneous scheduling decisions using historical runtime data. In: Workshop on Applications for Multi-and Many-Core Processors (A4MMC) (2011)
Google Scholar
Gómez-Luna, J., et al.: Chai: collaborative heterogeneous applications for integrated-architectures. In: 2017 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) (2017)
Google Scholar
Hsieh, C., Sani, A.A., Dutt, N.: The case for exploiting underutilized resources in heterogeneous mobile architectures. In: 2019 Design, Automation Test in Europe Conference Exhibition (DATE) (March 2019)
Google Scholar
Hsieh, C., Sani, A.A., Dutt, N.: SURF: self-aware unified runtime framework for parallel programs on heterogeneous mobile architectures. In: 2019 IFIP/IEEE 27th International Conference on Very Large Scale Integration (VLSI-SoC), pp. 136–141 (October 2019)
Google Scholar
Jia, Y., et al.: Caffe: Convolutional architecture for fast feature embedding. In: Proceedings of the 22nd ACM International Conference on Multimedia, MM 2014, pp. 675–678. ACM, New York (2014). https://doi.org/10.1145/2647868.2654889
Kadjo, D., Ayoub, R., Kishinevsky, M., Gratz, P.V.: A control-theoretic approach for energy efficient CPU-GPU subsystem in mobile platforms. In: 2015 52nd ACM/EDAC/IEEE Design Automation Conference (DAC), pp. 1–6 (June 2015)
Google Scholar
Kumar, A.: Multiprocessing with the completely fair scheduler (2008)
Google Scholar
Qualcomm: Snapdrgon profiler (2013). https://developer.qualcomm.com/software/hexagon-dsp-sdk
Qualcomm: Hexagon DSP SDK (2017). https://developer.qualcomm.com/software/snapdragon-profiler
Topcuoglu, H., Hariri, S., Wu, M.-Y.: Performance-effective and low-complexity task scheduling for heterogeneous computing. IEEE Trans. Parallel Distrib. Syst. 13(3), 260–274 (2002)
Article Google Scholar
Wen, Y., Wang, Z., O’Boyle, M.F.P.: Smart multi-task scheduling for OpenCL programs on CPU/GPU heterogeneous platforms. In: 2014 21st International Conference on High Performance Computing (HiPC), pp. 1–10 (December 2014)
Google Scholar
Zhou, H., Liu, C.: Task mapping in heterogeneous embedded systems for fast completion time. In: 2014 International Conference on Embedded Software (EMSOFT), pp. 1–10 (October 2014)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of California, Irvine, CA, 92697, USA
Chenying Hsieh, Ardalan Amiri Sani & Nikil Dutt

Authors

Chenying Hsieh
View author publications
You can also search for this author in PubMed Google Scholar
Ardalan Amiri Sani
View author publications
You can also search for this author in PubMed Google Scholar
Nikil Dutt
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chenying Hsieh .

Editor information

Editors and Affiliations

Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil
Carolina Metzler
University of Utah, Salt Lake City, UT, USA
Pierre-Emmanuel Gaillardon
EPFL, Lausanne, Switzerland
Giovanni De Micheli
Pontificia Universidad Católica del Perú, Lima, Peru
Carlos Silva-Cardenas
Universidade Federal do Rio Grande do Sul, Porto Alegre, Brazil
Ricardo Reis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hsieh, C., Sani, A.A., Dutt, N. (2020). Exploiting Heterogeneous Mobile Architectures Through a Unified Runtime Framework. In: Metzler, C., Gaillardon, PE., De Micheli, G., Silva-Cardenas, C., Reis, R. (eds) VLSI-SoC: New Technology Enabler. VLSI-SoC 2019. IFIP Advances in Information and Communication Technology, vol 586. Springer, Cham. https://doi.org/10.1007/978-3-030-53273-4_15

Download citation

DOI: https://doi.org/10.1007/978-3-030-53273-4_15
Published: 22 July 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-53272-7
Online ISBN: 978-3-030-53273-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Federation for Information Processing (opens in a new tab)