GPU-based embedded edge server configuration and offloading for a neural network service

Kim, JooHwan; Ullah, Shan; Kim, Deok-Hwan

doi:10.1007/s11227-021-03623-9

GPU-based embedded edge server configuration and offloading for a neural network service

Published: 25 January 2021

Volume 77, pages 8593–8621, (2021)
Cite this article

The Journal of Supercomputing Aims and scope Submit manuscript

573 Accesses
7 Citations
Explore all metrics

Abstract

Recently, emerging edge computing technology has been proposed as a new paradigm that compensates for the disadvantages of the current cloud computing. In particular, edge computing is used for service applications with low latency while using local data. For this emerging technology, a neural network approach is required to run large-scale machine learning on edge servers. In this paper, we propose a pod allocation method by adding various graphics processing unit (GPU) resources to increase the efficiency of a Kubernetes-based edge server configuration using a GPU-based embedded board and a TensorFlow-based neural network service application. As a result of experiments performed on the proposed edge server, the following are inferred: 1) The bandwidth, according to the time and data size, changes in local (20.4–42.4 Mbps) and Internet environments (6.31–25.5 Mbps) for service applications. 2) When two neural network applications are run on an edge server consisted with Xavier, TX2 and Nano, the network times of the object detection application are from 112.2 ms (Xavier) to 515.8 ms (Nano); the network times of the driver profiling application are from 321.8 ms (Xavier) to 495.7 ms (Nano). 3) The proposed pod allocation method demonstrates better performance than the default pod allocation method. We observe that the number of allocatable pods on three worker nodes increases from five to seven, and compared to other papers, the proposed offloading shows similar or better response times in environments where multiple deep learning applications are implemented.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

DNN Model Deployment on Distributed Edges

Deploying a smart queuing system on edge with Intel OpenVINO toolkit

Article 28 June 2021

On the CPU Usage of Deep Learning Models on an Edge Device

References

Duan Q (2017) Cloud service performance evaluation: status, challenges, and opportunities—a survey from the system modeling perspective. Digital Commun Networks 3(2):101–111
Article Google Scholar
Shirazi SN, Gouglidis A, Farshad A, Hutchison D (2017) The extended cloud: Review and analysis of mobile edge computing and fog from a security and resilience perspective. IEEE J Sel Areas Commun 35(11):2586–2595
Article Google Scholar
Burke B, Cearley D, Jones N, Smith D, Chandrasekaran A, Lu CK, Panetta K (2019) Gartner top 10 strategic technology trends for 2020-Smarter with Gartner
Contributor IBM (2019, December 13) IBM BrandVoice: IBM Tech Trends To Watch In 2020 ... And Beyond. Retrieved from https://www.forbes.com/sites/ibm/2019/12/09/ibm-tech-trends-to-watch-in-2020--and-beyond/#280a11974c1c
Shi W, Pallis G, Xu Z (2019) Edge computing [Scanning the Issue]. Proc IEEE 107(8):1474–1481
Article Google Scholar
Yousefpour A, Fung C, Nguyen T, Kadiyala K, Jalali F, Niakanlahiji A, Jue JP (2019) All one needs to know about fog computing and related edge computing paradigms: a complete survey. J Syst Architec 98:289–330
Article Google Scholar
Lyu X, Tian H, Jiang L, Vinel A, Maharjan S, Gjessing S, Zhang Y (2018) Selective offloading in mobile edge computing for the green internet of things. IEEE Network 32(1):54–60
Article Google Scholar
Markakis EK, Karras K, Sideris A, Alexiou G, & Pallis E (2017) Computing, caching, and communication at the edge: The cornerstone for building a versatile 5G ecosystem. In: IEEE Communications Magazine, 55(11), 152–157.]
Kiani A, Ansari N (2017) Toward hierarchical mobile edge computing: an auction-based profit maximization approach. IEEE Internet Things J 4(6):2082–2091
Article Google Scholar
Ren J, Guo H, Xu C, Zhang Y (2017) Serving at the edge: a scalable IoT architecture based on transparent computing. IEEE Network 31(5):96–105
Article Google Scholar
Ryden M, Oh K, Chandra A & Weissman J (2014, March) Nebula: Distributed edge cloud for data intensive computing. In: 2014 IEEE International Conference on Cloud Engineering (pp. 57–66). IEEE
Noreikis, M., Xiao, Y., & Ylä-Jaäiski, A. (2017, May). QoS-oriented capacity planning for edge computing. In: 2017 IEEE International Conference on Communications (ICC) (pp. 1–6). IEEE
Malandrino F, Kirkpatrick S & Chiasserini CF (2016, December) How close to the edge? delay/utilization trends in mec. In: Proceedings of the 2016 ACM Workshop on Cloud-Assisted Networking (pp. 37–42)
Kamiyama N, Nakano Y, Shiomoto K, Hasegawa G, Murata M & Miyahara H (2016, December) Analyzing effect of edge computing on reduction of web response time. In: 2016 IEEE Global Communications Conference (GLOBECOM) (pp. 1–6). IEEE
Hou IH, Zhao T, Wang S & Chan K (2016, July) Asymptotically optimal algorithm for online reconfiguration of edge-clouds. In: Proceedings of the 17th ACM International Symposium on Mobile Ad Hoc Networking and Computing (pp. 291–300)
Zhang W, Hu Y, Zhang Y & Raychaudhuri D (2016, December) Segue: Quality of service aware edge cloud service migration. In: 2016 IEEE International Conference on Cloud Computing Technology and Science (CloudCom) (pp. 344–351). IEEE
Ismail BI, Goortani EM, Ab Karim MB, Tat WM, Setapa S, Luke JY & Hoe OH 2015, August) Evaluation of docker as edge computing platform. In: 2015 IEEE Conference on Open Systems (ICOS) (pp. 130–135). IEEE
Pahl C, Helmer S, Miori L, Sanin J & Lee B (2016, August) A container-based edge cloud paas architecture based on raspberry pi clusters. In: 2016 IEEE 4th International Conference on Future Internet of Things and Cloud Workshops (FiCloudW) (pp. 117–124). IEEE
Helmer S, Pahl C, Sanin J, Miori L, Brocanelli S, Cardano F & Sharear AM (2016, November) Bringing the cloud to rural and remote areas via cloudlets. In: Proceedings of the 7th Annual Symposium on Computing for Development (pp. 1–10)
Elkhatib Y, Porter B, Ribeiro HB, Zhani MF, Qadir J, Rivière E (2017) On using micro-clouds to deliver the fog. IEEE Internet Comput 21(2):8–15
Article Google Scholar
Zhang X, Wang Y & Shi W (2018) pcamp: Performance comparison of machine learning packages on the edges. In: {USENIX} Workshop on Hot Topics in Edge Computing (HotEdge 18)
Pahl C & Lee B (2015, August) Containers and clusters for edge cloud architectures—a technology review. In: 2015 3rd international conference on future internet of things and cloud (pp. 379–386). IEEE
Kim JH, Tulkinbekov K, Kim DH (2019) Benchmarking Kubernetes based Edge Server in Embedded Environment (pp. 49–52). In: The 5th International Conference on Next Generation Computing 2019 Proceeding
Kubernetes Runtimes (2018, November 18) Retrieved April 20, 2020, from https://docs.google.com/spreadsheets/d/17ak_fVtWNUwMMJNpo7dNkoR8KK1ezTZVvh6v_tBcP7Y/edit#gid=0
Bernstein D (2014) Containers and cloud: from lxc to docker to kubernetes. IEEE Cloud Comp 1(3):81–84
Article Google Scholar
Kang H, Le M & Tao S (2016, April) Container and microservice driven design for cloud infrastructure devops. In: 2016 IEEE International Conference on Cloud Engineering (IC2E) (pp. 202–211). IEEE
Docker (2020, March 31) docker/classicswarm. Retrieved from https://github.com/docker/swarm/
Production-Grade Container Orchestration (n.d.). Retrieved from https://kubernetes.io/
Hindman B, Konwinski A., Zaharia M, Ghodsi A., Joseph AD, Katz RH & Stoica I (2011, March) Mesos: A platform for fine-grained resource sharing in the data center. In: NSDI (Vol. 11, No. 2011, pp. 22–22)
Hoque S, de Brito MS, Willner A, Keil O & Magedanz T (2017, July) Towards container orchestration in fog computing infrastructures. In: 2017 IEEE 41st Annual Computer Software and Applications Conference (COMPSAC) (Vol. 2, pp. 294–299). IEEE
NVIDIA DeepStream SDK (2020, April 25) Retrieved April 27, 2020, from https://developer.nvidia.com/deepstream-sdkNvidia. (2020, February 26). NVIDIA/nvidia-docker. Retrieved from https://github.com/NVIDIA/nvidia-docker
Nvidia (2020, February 26) NVIDIA/nvidia-docker. Retrieved from https://github.com/NVIDIA/nvidia-docker
GPU-Accelerated Innovation with NGC (n.d.). Retrieved from https://www.nvidia.com/en-us/gpu-cloud/
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C. Y., & Berg, A. C. (2016, October). Ssd: Single shot multibox detector. In: European conference on computer vision (pp. 21–37). Springer, Cham
Sandler M, Howard A., Zhu M, Zhmoginov A & Chen LC (2018) Mobilenetv2: Inverted residuals and linear bottlenecks. In: Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 4510–4520)
Zhang J, Wu Z, Li F, Xie C, Ren T, Chen J, Liu L (2019) A deep learning framework for driving behavior identification on in-vehicle CAN-BUS sensor data. Sensors 19(6):1356
Article Google Scholar
Lin TY, Maire M, Belongie S, Hays J, Perona P, Ramanan D & Zitnick CL (2014, September) Microsoft coco: Common objects in context. In: European conference on computer vision (pp. 740–755). Springer, Cham
Kwak BI, Woo J & Kim HK (2016, December) Know your master: Driver profiling-based anti-theft method. In: 2016 14th Annual Conference on Privacy, Security and Trust (PST) (pp. 211–218). IEEE
Tensorflow (2020, April 13). tensorflow/models. Retrieved from https://github.com/tensorflow/models/tree/master/research/object_detection
Mortimer M. (2018) iperf3 documentation
Ran X, Chen H, Zhu X, Liu Z & Chen J (2018, April) Deepdecision: A mobile deep learning framework for edge video analytics. In: IEEE INFOCOM 2018-IEEE Conference on Computer Communications (pp. 1421–1429). IEEE
Liu P, Qi B & Banerjee S (2018, June) Edgeeye: An edge service framework for real-time intelligent video analytics. In: Proceedings of the 1st International Workshop on Edge Systems, Analytics and Networking (pp. 1–6)
Jeong HJ, Jeong I, Lee HJ & Moon SM (2018, July) Computation offloading for machine learning web apps in the edge server environment. In: 2018 IEEE 38th International Conference on Distributed Computing Systems (ICDCS) (pp. 1492–1499). IEEE
Ullah S and Kim DH 2020 Benchmarking Jetson Platform for 3D Point-Cloud and Hyper-Spectral Image Classification. In: 2020 IEEE International Conference on Big Data and Smart Computing (BigComp), pp. 477–482. IEEE
Ullah S, Kim D-H (2020) Lightweight driver behavior identification model with sparse learning on In-Vehicle CAN-BUS sensor data. Sensors 20(18):5030
Article Google Scholar

Download references

Acknowledgments

This research was supported by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (2018R1D1A1B07042602) and in part by the Institute for Information & Communications Technology Promotion (IITP) grant funded by the Korean government (MSIT) (No. 2019-0-00064, Intelligent Mobile Edge Cloud Solution for Connected Car, No. 2019-0-00240, Deep Partition-and-Merge: Merging and Splitting Deep Neural Networks on Smart Embedded Devices for Real Time Inference).

Author information

Authors and Affiliations

Department of Electronic Engineering, Inha University, Incheon, South Korea
JooHwan Kim, Shan Ullah & Deok-Hwan Kim

Authors

JooHwan Kim
View author publications
You can also search for this author in PubMed Google Scholar
Shan Ullah
View author publications
You can also search for this author in PubMed Google Scholar
Deok-Hwan Kim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Deok-Hwan Kim.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kim, J., Ullah, S. & Kim, DH. GPU-based embedded edge server configuration and offloading for a neural network service. J Supercomput 77, 8593–8621 (2021). https://doi.org/10.1007/s11227-021-03623-9

Download citation

Accepted: 05 January 2021
Published: 25 January 2021
Issue Date: August 2021
DOI: https://doi.org/10.1007/s11227-021-03623-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

GPU-based embedded edge server configuration and offloading for a neural network service

Abstract

Access this article

Similar content being viewed by others

DNN Model Deployment on Distributed Edges

Deploying a smart queuing system on edge with Intel OpenVINO toolkit

On the CPU Usage of Deep Learning Models on an Edge Device

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

GPU-based embedded edge server configuration and offloading for a neural network service

Abstract

Access this article

Similar content being viewed by others

DNN Model Deployment on Distributed Edges

Deploying a smart queuing system on edge with Intel OpenVINO toolkit

On the CPU Usage of Deep Learning Models on an Edge Device

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation