ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation

Pendyala, Abhijeet; Dettmer, Justin; Glasmachers, Tobias; Atamna, Asma

doi:10.1007/978-3-031-53969-5_7

Abhijeet Pendyala¹³,
Justin Dettmer¹³,
Tobias Glasmachers¹³ &
…
Asma Atamna¹³

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14505))

Included in the following conference series:

International Conference on Machine Learning, Optimization, and Data Science

28 Accesses
1 Citations

Abstract

We present ContainerGym, a benchmark for reinforcement learning inspired by a real-world industrial resource allocation task. The proposed benchmark encodes a range of challenges commonly encountered in real-world sequential decision making problems, such as uncertainty. It can be configured to instantiate problems of varying degrees of difficulty, e.g., in terms of variable dimensionality. Our benchmark differs from other reinforcement learning benchmarks, including the ones aiming to encode real-world difficulties, in that it is directly derived from a real-world industrial problem, which underwent minimal simplification and streamlining. It is sufficiently versatile to evaluate reinforcement learning algorithms on any real-world problem that fits our resource allocation framework. We provide results of standard baseline methods. Going beyond the usual training reward curves, our results and the statistical tools used to interpret them allow to highlight interesting limitations of well-known deep reinforcement learning algorithms, namely PPO, TRPO and DQN.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Capacity Requirements Planning for Production Companies Using Deep Reinforcement Learning

Complementary Reward Function Based Learning Enhancement for Deep Reinforcement Learning

Optimization Planning Scheduling Problem in Industry 4.0 Using Deep Reinforcement Learning

Notes

1.
We use the terms “episode” and “rollout” interchangeably in this paper.
2.
The ContainerGym software is available on the following GitHub repository: https://github.com/Pendu/ContainerGym.
3.
Increasing the timestep length \(\delta \) should be done carefully. Otherwise, the problem could become trivial. In our case, we choose \(\delta \) such that it is smaller than the minimum time it takes a PU to process the volume equivalent to one product.

References

Bellemare, M.G., Naddaf, Y., Veness, J., Bowling, M.: The arcade learning environment: an evaluation platform for general agents. J. Artif. Intell. Res. 47, 253–279 (2013)
Article Google Scholar
Brockman, G., et al.: Openai gym (2016)
Google Scholar
Compare, M., Bellani, L., Cobelli, E., Zio, E.: Reinforcement learning-based flow management of gas turbine parts under stochastic failures. Int. J. Adv. Manuf. Technol. 99(9–12), 2981–2992 (2018)
Article Google Scholar
Degrave, J., et al.: Magnetic control of tokamak plasmas through deep reinforcement learning. Nature 602(7897), 414–419 (2022)
Article Google Scholar
Dulac-Arnold, G., et al.: An empirical investigation of the challenges of real-world reinforcement learning. CoRR arxiv:2003.11881 (2020)
Haarnoja, T., et al.: Soft actor-critic algorithms and applications. CoRR arxiv:1812.05905 (2018)
Hein, D., et al.: A benchmark environment motivated by industrial control problems. In: 2017 IEEE Symposium Series on Computational Intelligence (SSCI). IEEE (2017)
Google Scholar
Lazic, N., et al.: Data center cooling using model-predictive control. In: Proceedings of the Thirty-Second Conference on Neural Information Processing Systems (NeurIPS-2018), Montreal, QC, pp. 3818–3827 (2018)
Google Scholar
Mnih, V., et al.: Playing atari with deep reinforcement learning. CoRR arxiv:1312.5602 (2013)
Osiński, B., et al.: Simulation-based reinforcement learning for real-world autonomous driving. In: 2020 IEEE International Conference on Robotics and Automation (ICRA), pp. 6411–6418 (2020)
Google Scholar
Raffin, A., Hill, A., Gleave, A., Kanervisto, A., Ernestus, M., Dormann, N.: Stable-baselines3: reliable reinforcement learning implementations. J. Mach. Learn. Res. 22(268), 1–8 (2021)
Google Scholar
Schulman, J., Levine, S., Abbeel, P., Jordan, M., Moritz, P.: Trust region policy optimization. In: Bach, F., Blei, D. (eds.) Proceedings of the 32nd International Conference on Machine Learning. Proceedings of Machine Learning Research, vol. 37, pp. 1889–1897. PMLR (2015)
Google Scholar
Schulman, J., Wolski, F., Dhariwal, P., Radford, A., Klimov, O.: Proximal policy optimization algorithms. CoRR arxiv:1707.06347 (2017)
Silver, D., et al.: Mastering the game of go with deep neural networks and tree search. Nature 529(7587), 484–489 (2016)
Article Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT press, Cambridge (2018)
Google Scholar
Todorov, E., Erez, T., Tassa, Y.: Mujoco: a physics engine for model-based control. In: 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 5026–5033 (2012)
Google Scholar
Vinyals, O., et al.: Grandmaster level in StarCraft II using multi-agent reinforcement learning. Nature 575(7782), 350–354 (2019)
Article Google Scholar

Download references

Acknowledgements

This work was funded by the German federal ministry of economic affairs and climate action through the “ecoKI” grant.

Author information

Authors and Affiliations

Ruhr-University Bochum, Bochum, Germany
Abhijeet Pendyala, Justin Dettmer, Tobias Glasmachers & Asma Atamna

Authors

Abhijeet Pendyala
View author publications
You can also search for this author in PubMed Google Scholar
Justin Dettmer
View author publications
You can also search for this author in PubMed Google Scholar
Tobias Glasmachers
View author publications
You can also search for this author in PubMed Google Scholar
Asma Atamna
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Asma Atamna .

Editor information

Editors and Affiliations

University of Catania, Catania, Catania, Italy
Giuseppe Nicosia
Newcastle University, Newcastle upon Tyne, UK
Varun Ojha
University of Oxford, Oxford, UK
Emanuele La Malfa
University of Cambridge, Cambridge, UK
Gabriele La Malfa
University of Florida, Gainesville, FL, USA
Panos M. Pardalos
Dana-Farber Cancer Institute, Boston, MA, USA
Renato Umeton

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pendyala, A., Dettmer, J., Glasmachers, T., Atamna, A. (2024). ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation. In: Nicosia, G., Ojha, V., La Malfa, E., La Malfa, G., Pardalos, P.M., Umeton, R. (eds) Machine Learning, Optimization, and Data Science. LOD 2023. Lecture Notes in Computer Science, vol 14505. Springer, Cham. https://doi.org/10.1007/978-3-031-53969-5_7

Download citation

DOI: https://doi.org/10.1007/978-3-031-53969-5_7
Published: 16 February 2024
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-53968-8
Online ISBN: 978-3-031-53969-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation

Abstract

Access this chapter

Similar content being viewed by others

Capacity Requirements Planning for Production Companies Using Deep Reinforcement Learning

Complementary Reward Function Based Learning Enhancement for Deep Reinforcement Learning

Optimization Planning Scheduling Problem in Industry 4.0 Using Deep Reinforcement Learning

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation

Abstract

Access this chapter

Similar content being viewed by others

Capacity Requirements Planning for Production Companies Using Deep Reinforcement Learning

Complementary Reward Function Based Learning Enhancement for Deep Reinforcement Learning

Optimization Planning Scheduling Problem in Industry 4.0 Using Deep Reinforcement Learning

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation