Skip to main content

When a Robot Reaches Out for Human Help

  • Conference paper
  • First Online:
Advances in Artificial Intelligence - IBERAMIA 2018 (IBERAMIA 2018)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11238))

Included in the following conference series:

Abstract

In many realistic planning situations, any policy has a non-zero probability of reaching a dead-end. In such cases, a popular approach is to plan to maximize the probability of reaching the goal. While this strategy increases the robustness and expected autonomy of the robot, it considers that the robot gives up on the task whenever a dead-end is encountered. In this work, we consider planning for agents that pro-actively and autonomously resort to human help when an unavoidable dead-end is encountered (the so-called symbiotic agents). To this end, we develop a new class of Goal-Oriented Markov Decision Process that includes a set of human actions that ensures the existence of a proper policy, one that possibly resorts to human help. We discuss two different optimization criteria: minimizing the probability to use human help and minimizing the expected cumulative cost with a finite penalty for using human help for the first time. We show that for a large enough penalty both criteria are equivalent. We report on experiments with standard probabilistic planning domains for reasonably large problems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    We implicitly assume that every state has at least one applicable action.

References

  1. Armstrong Crews, N., Veloso, M.: Oracular partially observable markov decision processes: a very special case. In: Proceedings of the IEEE ICRA (2007)

    Google Scholar 

  2. Bellman, R.: Dynamic Programming. Princeton University Press, Princeton (1957)

    MATH  Google Scholar 

  3. Bertsekas, D.P., Tsitsiklis, J.N.: An analysis of stochastic shortest path problems. Math. Oper. Res. 16(3), 580–595 (1991). INFORMS

    Article  MathSciNet  Google Scholar 

  4. Bonet, B.: Labeled RTDP: improving the convergence of real-time dynamic programming. In: Proceedings ICAPS-03 (2003)

    Google Scholar 

  5. Bonet, B., Geffner, H.: Faster heuristic search algorithms for planning with uncertainty and full feedback. In: Proceedings of the IJCAI (2003)

    Google Scholar 

  6. Bonet, B., Geffner, H.: mGPT: a probabilistic planner based on heuristic search. J. Artif. Intell. Res. 24, 933–944 (2005)

    Article  Google Scholar 

  7. Göbelbecker, M., Keller, T., Eyerich, P., Brenner, M., Nebel, B.: Coming up with good excuses: what to do when no plan can be found. In: ICAPS (2010)

    Google Scholar 

  8. Helmert, M.: The fast downward planning system. J. Artif. Intell. Res. 26, 191–246 (2006)

    Article  Google Scholar 

  9. Karami, A.B., Jeanpierre, L., Mouaddib, A.I.: Partially observable markov decision process for managing robot collaboration with human. In: Proceedings of the 21st IEEE ICTAI (2009)

    Google Scholar 

  10. Kolobov, A., Daniel, M., Weld, S., Geffner, H.: Heuristic search for generalized stochastic shortest path MDPs. In: Proceedings of the ICAPS (2011)

    Google Scholar 

  11. Kolobov, A., Mausam, M., Weld, D.: A theory of goal-oriented MDPs with dead ends. In: Proceedings of the 28th Conference on UAI (2012)

    Google Scholar 

  12. Puterman, M.L.: Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley (2014)

    Google Scholar 

  13. Rosenthal, S., Biswas, J., Veloso, M.: An effective personal mobile robot agent through symbiotic human-robot interaction. In: Proceedings of the AAMAS (2010)

    Google Scholar 

  14. Rosenthal, S., Veloso, M., Dey, A.K.: Learning accuracy and availability of humans who help mobile robots. In: Proceedings of the AAAI (2011)

    Google Scholar 

  15. Schmidt-Rohr, S.R., Knoop, S., Lösch, M., Dillmann, R.: Reasoning for a multi-modal service robot considering uncertainty in human-robot interaction. In: Proceedings of the 3rd HRI (2008)

    Google Scholar 

  16. Steinmetz, M., Hoffmann, J., Buffet, O.: Revisiting goal probability analysis in probabilistic planning. In: Proceedings of the 26th ICAPS (2016)

    Google Scholar 

  17. Teichteil-Königsbuch, F.: Stochastic safest and shortest path problems. In: Proceedings of the NCAI (2012)

    Google Scholar 

  18. Trevizan, F., Teichteil-Königsbuch, F., Thiébaux, S.: Efficient solutions for stochastic shortest path problems with dead ends. In: Proceedings of 33rd Conference on UAI (2017)

    Google Scholar 

  19. Younes, H.L., Littman, M.L.: PPDDL1.0: an extension to PDDL for expressing planning domains with probabilistic effects. Technical report CMU-CS-04-162 (2004)

    Google Scholar 

Download references

Acknowledgments

Authors received financial support from CAPES, FAPESP (grants #2015/01587-0 and #2016/01055-1) and CNPq (grants #303920/2016-5 and #420669/2016-7).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ignasi Andrés .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Andrés, I., de Barros, L.N., Mauá, D.D., Simão, T.D. (2018). When a Robot Reaches Out for Human Help. In: Simari, G., Fermé, E., Gutiérrez Segura, F., Rodríguez Melquiades, J. (eds) Advances in Artificial Intelligence - IBERAMIA 2018. IBERAMIA 2018. Lecture Notes in Computer Science(), vol 11238. Springer, Cham. https://doi.org/10.1007/978-3-030-03928-8_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-03928-8_23

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-03927-1

  • Online ISBN: 978-3-030-03928-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics