Dynamic Resource Allocation During Natural Disasters Using Multi-agent Environment

Vereshchaka, Alina; Dong, Wen

doi:10.1007/978-3-030-21741-9_13

Alina Vereshchaka¹⁸ &
Wen Dong¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 11549))

Included in the following conference series:

International Conference on Social Computing, Behavioral-Cultural Modeling and Prediction and Behavior Representation in Modeling and Simulation

1677 Accesses
4 Citations

Abstract

Natural disasters are devastating for a country and effective allocation of critical resources can mitigate the impact. While traditional approaches usually have difficulties in making optimal critical resource allocation, in this paper we introduce a novel hierarchical multi-agent reinforcement learning framework to model optimal resource allocation for natural disasters in real-time. On the lower level a set of agents navigate with the continuous time environment using deep reinforcement algorithms. On the higher level, a lead agent takes care of the global decision-making. Our framework achieves more efficient resource allocation in response to dynamic events and is applicable to problems where disaster evolves alongside the response efforts, where delays in response can lead to increased disaster severity and thus a greater need for resources.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 49.99; Price excludes VAT (USA)

Softcover Book: USD 64.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

References

United Nations Office for Disaster Risk Reduction (UNISDR) and Centre for Research on the Epidemiology of Disasters (CRED), Economic losses, poverty and disasters, pp. 1998–2017 (2018)
Google Scholar
Frias-Martinez, E., Williamson, G., Frias-Martinez, V.: An agent-based model of epidemic spread using human mobility and social network information. In: 2011 IEEE Third International Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third International Conference on Social Computing, pp. 57–64. IEEE (2011)
Google Scholar
Wesolowski, A., et al.: Quantifying the impact of human mobility on malaria. Science 338(6104), 267–270 (2012)
Article Google Scholar
Bengtsson, L., Lu, X., Thorson, A., Garfield, R., Von Schreeb, J.: Improved response to disasters and outbreaks by tracking population movements with mobile phone network data: a post-earthquake geospatial study in Haiti. PLoS Med. 8(8), e1001083 (2011)
Article Google Scholar
Abbasi, M.-A., Kumar, S., Filho, J.A.A., Liu, H.: Lessons learned in using social media for disaster relief - ASU Crisis Response Game. In: Yang, S.J., Greenberg, A.M., Endsley, M. (eds.) SBP 2012. LNCS, vol. 7227, pp. 282–289. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-29047-3_34
Chapter Google Scholar
Chandan, S., et al.: Modeling the interaction between emergency communications and behavior in the aftermath of a disaster. In: Greenberg, A.M., Kennedy, W.G., Bos, N.D. (eds.) SBP 2013. LNCS, vol. 7812, pp. 476–485. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-37210-0_52
Chapter Google Scholar
Petrovic, N., Alderson, D.L., Carlson, J.M.: Dynamic resource allocation in disaster response: tradeoffs in wildfire suppression. PloS one 7(4), e33285 (2012)
Article Google Scholar
Estuar, M.R.J.E., Rodrigueza, R.C., Victorino, J.N.C., Sevilla, M.C.V., De Leon, M.M., Rosales, J.C.S.: Agent-Based modeling approach in understanding behavior during disasters: measuring response and rescue in eBayanihan disaster management platform. In: Lee, D., Lin, Y.-R., Osgood, N., Thomson, R. (eds.) SBP-BRiMS 2017. LNCS, vol. 10354, pp. 46–52. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-60240-0_5
Chapter Google Scholar
Albrecht, S.V., Stone, P.: Autonomous agents modelling other agents: a comprehensive survey and open problems. Artif. Intell. 258, 66–95 (2018)
Article MathSciNet Google Scholar
Yang, F., Liu, B., Dong, W.: Optimal control of complex systems through variational inference with a discrete event decision process. In: Proceedings of the 2019 International Conference on Autonomous Agents and Multiagent Systems, International Foundation for Autonomous Agents and Multiagent Systems (2019)
Google Scholar
Yang, F., Dong, W.: Integrating simulation and signal processing in tracking complex social systems. Comput. Math. Organ. Theor. 1–22 (2018). Special Issue: SBP-BRIMS2017
Google Scholar
Yang, F., Dong, W.: Integrating simulation and signal processing with stochastic social kinetic model. In: Lee, D., Lin, Y.-R., Osgood, N., Thomson, R. (eds.) SBP-BRiMS 2017. LNCS, vol. 10354, pp. 193–203. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-60240-0_23
Chapter Google Scholar
Peng, P., et al.: Multiagent bidirectionally-coordinated nets for learning to play starcraft combat games, arXiv preprint arXiv:1703.10069 (2017)
Matignon, L., Jeanpierre, L., Mouaddib, A.-I.: Coordinated multi-robot exploration under communication constraints using decentralized Markov decision processes. In: Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, pp. 2017–2023. AAAI Press (2012)
Google Scholar
Ye, H., Li, G.Y., Juang, B.-H.F.: Deep reinforcement learning based resource allocation for V2V communications, arXiv preprint arXiv:1805.07222 (2018)
Mirzaei, H., Sharon, G., Boyles, S., Givargis, T., Stone, P.: Enhanced delta-tolling: traffic optimization via policy gradient reinforcement learning. In: 2018 21st International Conference on Intelligent Transportation Systems (ITSC), pp. 47–52. IEEE (2018)
Google Scholar
Albrecht, S.V., Ramamoorthy, S.: A game-theoretic model and best-response learning method for ad hoc coordination in multiagent systems. In: Proceedings of the 2013 International Conference on Autonomous Agents and Multi-agent Systems, International Foundation for Autonomous Agents and Multiagent Systems, pp. 1155–1156 (2013)
Google Scholar
Peters, J., Schaal, S.: Policy gradient methods for robotics. In: 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 2219–2225. IEEE (2006)
Google Scholar
Schulman, J., Wolski, F., Dhariwal, P., Radford, A., Klimov, O.: Proximal policy optimization algorithms, arXiv preprint arXiv:1707.06347 (2017)

Download references

Acknowledgements

We would like to thank Nathan Margaglio for insightful discussions of the resource allocation problem and SBP-BRiMS reviewers for feedback that has improved this work.

Author information

Authors and Affiliations

Department of Computer Science and Engineering, State University of New York at Buffalo, Buffalo, USA
Alina Vereshchaka & Wen Dong

Authors

Alina Vereshchaka
View author publications
You can also search for this author in PubMed Google Scholar
Wen Dong
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alina Vereshchaka .

Editor information

Editors and Affiliations

United States Military Academy, West Point, NY, USA
Robert Thomson
University of Michigan–Flint, Flint, MI, USA
Halil Bisgin
Bucknell University, Lewisburg, PA, USA
Christopher Dancy
The Ohio State University, Columbus, OH, USA
Ayaz Hyder

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vereshchaka, A., Dong, W. (2019). Dynamic Resource Allocation During Natural Disasters Using Multi-agent Environment. In: Thomson, R., Bisgin, H., Dancy, C., Hyder, A. (eds) Social, Cultural, and Behavioral Modeling. SBP-BRiMS 2019. Lecture Notes in Computer Science(), vol 11549. Springer, Cham. https://doi.org/10.1007/978-3-030-21741-9_13

Download citation

DOI: https://doi.org/10.1007/978-3-030-21741-9_13
Published: 05 June 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-21740-2
Online ISBN: 978-3-030-21741-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics