Skip to main content

Dynamic Resource Allocation During Natural Disasters Using Multi-agent Environment

  • Conference paper
  • First Online:
Social, Cultural, and Behavioral Modeling (SBP-BRiMS 2019)

Abstract

Natural disasters are devastating for a country and effective allocation of critical resources can mitigate the impact. While traditional approaches usually have difficulties in making optimal critical resource allocation, in this paper we introduce a novel hierarchical multi-agent reinforcement learning framework to model optimal resource allocation for natural disasters in real-time. On the lower level a set of agents navigate with the continuous time environment using deep reinforcement algorithms. On the higher level, a lead agent takes care of the global decision-making. Our framework achieves more efficient resource allocation in response to dynamic events and is applicable to problems where disaster evolves alongside the response efforts, where delays in response can lead to increased disaster severity and thus a greater need for resources.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 49.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 64.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.ncdc.noaa.gov/snow-and-ice/daily-snow/NY/snow-depth/20190131.

  2. 2.

    https://data.buffalony.gov/Transportation/Annual-Average-Daily-Traffic-Volume-Counts/y93c-u65y.

References

  1. United Nations Office for Disaster Risk Reduction (UNISDR) and Centre for Research on the Epidemiology of Disasters (CRED), Economic losses, poverty and disasters, pp. 1998–2017 (2018)

    Google Scholar 

  2. Frias-Martinez, E., Williamson, G., Frias-Martinez, V.: An agent-based model of epidemic spread using human mobility and social network information. In: 2011 IEEE Third International Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third International Conference on Social Computing, pp. 57–64. IEEE (2011)

    Google Scholar 

  3. Wesolowski, A., et al.: Quantifying the impact of human mobility on malaria. Science 338(6104), 267–270 (2012)

    Article  Google Scholar 

  4. Bengtsson, L., Lu, X., Thorson, A., Garfield, R., Von Schreeb, J.: Improved response to disasters and outbreaks by tracking population movements with mobile phone network data: a post-earthquake geospatial study in Haiti. PLoS Med. 8(8), e1001083 (2011)

    Article  Google Scholar 

  5. Abbasi, M.-A., Kumar, S., Filho, J.A.A., Liu, H.: Lessons learned in using social media for disaster relief - ASU Crisis Response Game. In: Yang, S.J., Greenberg, A.M., Endsley, M. (eds.) SBP 2012. LNCS, vol. 7227, pp. 282–289. Springer, Heidelberg (2012). https://doi.org/10.1007/978-3-642-29047-3_34

    Chapter  Google Scholar 

  6. Chandan, S., et al.: Modeling the interaction between emergency communications and behavior in the aftermath of a disaster. In: Greenberg, A.M., Kennedy, W.G., Bos, N.D. (eds.) SBP 2013. LNCS, vol. 7812, pp. 476–485. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-37210-0_52

    Chapter  Google Scholar 

  7. Petrovic, N., Alderson, D.L., Carlson, J.M.: Dynamic resource allocation in disaster response: tradeoffs in wildfire suppression. PloS one 7(4), e33285 (2012)

    Article  Google Scholar 

  8. Estuar, M.R.J.E., Rodrigueza, R.C., Victorino, J.N.C., Sevilla, M.C.V., De Leon, M.M., Rosales, J.C.S.: Agent-Based modeling approach in understanding behavior during disasters: measuring response and rescue in eBayanihan disaster management platform. In: Lee, D., Lin, Y.-R., Osgood, N., Thomson, R. (eds.) SBP-BRiMS 2017. LNCS, vol. 10354, pp. 46–52. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-60240-0_5

    Chapter  Google Scholar 

  9. Albrecht, S.V., Stone, P.: Autonomous agents modelling other agents: a comprehensive survey and open problems. Artif. Intell. 258, 66–95 (2018)

    Article  MathSciNet  Google Scholar 

  10. Yang, F., Liu, B., Dong, W.: Optimal control of complex systems through variational inference with a discrete event decision process. In: Proceedings of the 2019 International Conference on Autonomous Agents and Multiagent Systems, International Foundation for Autonomous Agents and Multiagent Systems (2019)

    Google Scholar 

  11. Yang, F., Dong, W.: Integrating simulation and signal processing in tracking complex social systems. Comput. Math. Organ. Theor. 1–22 (2018). Special Issue: SBP-BRIMS2017

    Google Scholar 

  12. Yang, F., Dong, W.: Integrating simulation and signal processing with stochastic social kinetic model. In: Lee, D., Lin, Y.-R., Osgood, N., Thomson, R. (eds.) SBP-BRiMS 2017. LNCS, vol. 10354, pp. 193–203. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-60240-0_23

    Chapter  Google Scholar 

  13. Peng, P., et al.: Multiagent bidirectionally-coordinated nets for learning to play starcraft combat games, arXiv preprint arXiv:1703.10069 (2017)

  14. Matignon, L., Jeanpierre, L., Mouaddib, A.-I.: Coordinated multi-robot exploration under communication constraints using decentralized Markov decision processes. In: Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, pp. 2017–2023. AAAI Press (2012)

    Google Scholar 

  15. Ye, H., Li, G.Y., Juang, B.-H.F.: Deep reinforcement learning based resource allocation for V2V communications, arXiv preprint arXiv:1805.07222 (2018)

  16. Mirzaei, H., Sharon, G., Boyles, S., Givargis, T., Stone, P.: Enhanced delta-tolling: traffic optimization via policy gradient reinforcement learning. In: 2018 21st International Conference on Intelligent Transportation Systems (ITSC), pp. 47–52. IEEE (2018)

    Google Scholar 

  17. Albrecht, S.V., Ramamoorthy, S.: A game-theoretic model and best-response learning method for ad hoc coordination in multiagent systems. In: Proceedings of the 2013 International Conference on Autonomous Agents and Multi-agent Systems, International Foundation for Autonomous Agents and Multiagent Systems, pp. 1155–1156 (2013)

    Google Scholar 

  18. Peters, J., Schaal, S.: Policy gradient methods for robotics. In: 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 2219–2225. IEEE (2006)

    Google Scholar 

  19. Schulman, J., Wolski, F., Dhariwal, P., Radford, A., Klimov, O.: Proximal policy optimization algorithms, arXiv preprint arXiv:1707.06347 (2017)

Download references

Acknowledgements

We would like to thank Nathan Margaglio for insightful discussions of the resource allocation problem and SBP-BRiMS reviewers for feedback that has improved this work.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alina Vereshchaka .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Vereshchaka, A., Dong, W. (2019). Dynamic Resource Allocation During Natural Disasters Using Multi-agent Environment. In: Thomson, R., Bisgin, H., Dancy, C., Hyder, A. (eds) Social, Cultural, and Behavioral Modeling. SBP-BRiMS 2019. Lecture Notes in Computer Science(), vol 11549. Springer, Cham. https://doi.org/10.1007/978-3-030-21741-9_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-21741-9_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-21740-2

  • Online ISBN: 978-3-030-21741-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics