Distributed Real-Time Scheduling by Using Multi-agent Reinforcement Learning

Iwamura, Koji; Sugimura, Nobuhiro

doi:10.1007/978-0-85729-652-8_11

Koji Iwamura⁴ &
Nobuhiro Sugimura⁴

4004 Accesses

Abstract

Autonomous Distributed Manufacturing Systems (ADMS) have been proposed to realise flexible control structures of manufacturing systems. In the previous researches, a real-time scheduling method based on utility values has been proposed and applied to the ADMS. Multi-agent reinforcement learning is newly proposed and implemented to the job agents and resource agents, in order to improve their coordination processes. The status, the action and the reward are defined for the individual job agents and the resource agents to evaluate the suitable utility values based on the status of the ADMS. Some case studies of the real-time scheduling have been carried out to verify the effectiveness of the proposed methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Moriwaki, & T., Sugimura, N. (1992). Object-oriented modelling of autonomous distributed manufacturing system and its application to real-time scheduling. Proceedings of the ICOOMS ’92 (pp. 207–212).
Google Scholar
Kadar, B., Monostori, L., & Szelke, E. (1998). An object-oriented framework for developing distributed manufacturing architectures. Journal of Intelligent Manufacturing, 9, 173–179.
Article Google Scholar
Ueda, K. (1992). An approach to bionic manufacturing systems based on DNA-type information. Proceedings of the ICOOMS ‘92, (pp. 303–308).
Google Scholar
Ueda, K., Hatono, I., Fujii, N., & Vaario, J. (2000). Reinforcement learning approach to biological manufacturing systems. Annals of the CIRP, 49, 343–346.
Article Google Scholar
Hendrik, B., Jo, W., Paul, V., Luc, B., & Patrick, P. (1998). Reference architecture for holonic manufacturing systems: PROSA. Computers in Industry, 37, 255–274.
Article Google Scholar
Sugimura, N., Tanimizu, Y., & Iwamura, K. (2004). A Study on real-time scheduling for holonic manufacturing system. CIRP Journal of Manufacturing Systems, 33(5), 467–475.
Google Scholar
Iwamura, K., Okubo, N., Tanimizu, Y., & Sugimura, N. (2006). Real-time scheduling for holonic manufacturing systems based on estimation of future status. International Journal of Production Research, 44(18–19), 3657–3675.
Article MATH Google Scholar
Iwamura, K., Nakano, A., Tanimizu, Y., & Sugimura, N. (2007). A study on real-time scheduling for holonic manufacturing systems -Simulation for estimation of future status by individual holons-. In M. Vladimir, V. Valeriy, & W. C. Armando (Eds.), LNAI 4659 HoloMAS 2007 (pp. 205–214). Heidelberg: Springer.
Google Scholar
Sutton, R., & Barto, A. (1998). Reinforcement learning: an introduction. Cambridge: The MIT Press.
Google Scholar
Paternina-Arboleda, C., & Das, T. (2005). A multi-agent reinforcement learning approach to obtaining dynamic control policies for stochastic lot scheduling problem. Simulation Modelling Practice and Theory, 13, 389–406.
Article Google Scholar
Kaelbling, L., Littman, M., & Moore, A. (1996). Reinforcement learning: a survey. Journal of Artificial Intelligence Research, 4, 237–285.
Google Scholar
Sutton, R. (1988). Learning to predict by the methods of temporal differences. Machine Learning, 3, 9–44.
Google Scholar
Wang, Y., & Usher, J. (2005). Application of reinforcement learning for agent-based production scheduling. Engineering Applications of Artificial Intelligence, 18, 73–82.
Article Google Scholar
Fujii, N., Takasu, R., Kobayashi, M., Ueda, K. (2005). Reinforcement learning based product dispatching scheduling in a semiconductor manufacturing system. Proceedings of the 38th CIRP International seminar on manufacturing systems, CD-ROM
Google Scholar
Aydin, E., & Oztemel, E. (2000). Dynamic job-shop scheduling using reinforcement learning agents. Robotics and Autonomous Systems, 33, 169–178.
Article Google Scholar
Aissani, N., Beldjilali, B., & Trentesaux, D. (2009). Dynamic scheduling of maintenance tasks in the petroleum industry: A reinforcement approach. Engineering Applications of Artificial Intelligence, 22, 1089–1103.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Engineering, Osaka Prefecture University, Osaka, 599-8531, Japan
Koji Iwamura & Nobuhiro Sugimura

Authors

Koji Iwamura
View author publications
You can also search for this author in PubMed Google Scholar
Nobuhiro Sugimura
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Koji Iwamura .

Editor information

Editors and Affiliations

, Virtual Systems Research Centre, University of Skövde, Skövde, 541 28, Sweden
Lihui Wang
, Virtual Systems Research Centre, University of Skövde, Skövde, 541 28, Sweden
Amos H. C. Ng
, Department of Mechanical Engineering, Indian Institute of Technology, Kanpur, 208016, Uttar Pradesh, India
Kalyanmoy Deb

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Iwamura, K., Sugimura, N. (2011). Distributed Real-Time Scheduling by Using Multi-agent Reinforcement Learning. In: Wang, L., Ng, A., Deb, K. (eds) Multi-objective Evolutionary Optimisation for Product Design and Manufacturing. Springer, London. https://doi.org/10.1007/978-0-85729-652-8_11

Download citation

DOI: https://doi.org/10.1007/978-0-85729-652-8_11
Published: 03 September 2011
Publisher Name: Springer, London
Print ISBN: 978-0-85729-617-7
Online ISBN: 978-0-85729-652-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics