Applying Reinforcement Learning to Basic Routing Problem

Samúelsson, Sigurður Gauti; Hyytiä, Esa

doi:10.1007/978-3-319-93736-6_18

Sigurður Gauti Samúelsson¹⁷ &
Esa Hyytiä¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10932))

Included in the following conference series:

International Conference on Queueing Theory and Network Applications

544 Accesses
2 Citations

Abstract

Routing jobs to parallel servers is a common and important task in today’s computer and communication systems. As each routing decision affects the jobs arriving later, determining the (near) optimal decisions is non-trivial. In this paper, we apply reinforcement learning techniques to the job routing problem with heterogeneous servers and a general cost structure. We study the convergence of the reinforcement learning to a near-optimal policy (that we can determine by other means), and compare its performance against heuristic policies such as Join-the-Shortest-Queue (JSQ) and Shortest-Expected-Delay (SED).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
RND (random) chooses the server independently in random using some probabilities \(p_k\), JSQ chooses the queue with the least number of jobs, and SED the queue with the shortest expected response time, i.e., the admission cost to queue i is \((n_i+1)/\mu _i\).

References

Haight, F.A.: Two queues in parallel. Biometrika 45(3–4), 401–410 (1958)
Article MathSciNet Google Scholar
Watkins, C.: Learning from delayed rewards, Ph.D. dissertation, Cambridge University (1989)
Google Scholar
Hyytiä, E., Righter, R., Samúelsson, S.G.: Beyond the shortest queue routing with heterogeneous servers and general cost function. In: ValueTools, December 2017
Google Scholar
Whittle, P.: Optimal Control: Basics and Beyond. Wiley, New York (1996)
MATH Google Scholar
Hyytiä, E.: Lookahead actions in dispatching to parallel queues. Perform. Eval. 70(10), 859–872 (2013). (IFIP Performance 2013)
Article Google Scholar
Hyytiä, E., Virtamo, J.: Dynamic routing and wavelength assignment using first policy iteration. In: The Fifth IEEE ISCC 2000, pp. 146–151, July 2000
Google Scholar
Howard, R.A.: Dynamic Probabilistic Systems, Volume II: Semi-Markov and Decision Processes. Wiley Interscience, New York (1971)
MATH Google Scholar
Bellman, R.: Dynamic Programming. Princeton University Press, Princeton (1957)
MATH Google Scholar

Download references

Acknowledgements

This work was supported by the Academy of Finland in the FQ4BD project (grant no. 296206) and by the University of Iceland Research Fund in the RL-STAR project.

Author information

Authors and Affiliations

Department of Computer Science, University of Iceland, Reykjavík, Iceland
Sigurður Gauti Samúelsson & Esa Hyytiä

Authors

Sigurður Gauti Samúelsson
View author publications
You can also search for this author in PubMed Google Scholar
Esa Hyytiä
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Esa Hyytiä .

Editor information

Editors and Affiliations

Kyoto University, Kyoto, Japan
Yutaka Takahashi
University of Tsukuba, Tsukuba, Japan
Tuan Phung-Duc
Ghent University, Gent, Belgium
Sabine Wittevrongel
Konan University, Kobe, Japan
Wuyi Yue

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Samúelsson, S.G., Hyytiä, E. (2018). Applying Reinforcement Learning to Basic Routing Problem. In: Takahashi, Y., Phung-Duc, T., Wittevrongel, S., Yue, W. (eds) Queueing Theory and Network Applications. QTNA 2018. Lecture Notes in Computer Science(), vol 10932. Springer, Cham. https://doi.org/10.1007/978-3-319-93736-6_18

Download citation

DOI: https://doi.org/10.1007/978-3-319-93736-6_18
Published: 10 June 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-93735-9
Online ISBN: 978-3-319-93736-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics