When a Robot Reaches Out for Human Help

Andrés, Ignasi; de Barros, Leliane Nunes; Mauá, Denis D.; Simão, Thiago D.

doi:10.1007/978-3-030-03928-8_23

Ignasi Andrés¹⁷,
Leliane Nunes de Barros¹⁷,
Denis D. Mauá¹⁷ &
…
Thiago D. Simão¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11238))

Included in the following conference series:

Ibero-American Conference on Artificial Intelligence

1274 Accesses
1 Citations

Abstract

In many realistic planning situations, any policy has a non-zero probability of reaching a dead-end. In such cases, a popular approach is to plan to maximize the probability of reaching the goal. While this strategy increases the robustness and expected autonomy of the robot, it considers that the robot gives up on the task whenever a dead-end is encountered. In this work, we consider planning for agents that pro-actively and autonomously resort to human help when an unavoidable dead-end is encountered (the so-called symbiotic agents). To this end, we develop a new class of Goal-Oriented Markov Decision Process that includes a set of human actions that ensures the existence of a proper policy, one that possibly resorts to human help. We discuss two different optimization criteria: minimizing the probability to use human help and minimizing the expected cumulative cost with a finite penalty for using human help for the first time. We show that for a large enough penalty both criteria are equivalent. We report on experiments with standard probabilistic planning domains for reasonably large problems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
We implicitly assume that every state has at least one applicable action.

References

Armstrong Crews, N., Veloso, M.: Oracular partially observable markov decision processes: a very special case. In: Proceedings of the IEEE ICRA (2007)
Google Scholar
Bellman, R.: Dynamic Programming. Princeton University Press, Princeton (1957)
MATH Google Scholar
Bertsekas, D.P., Tsitsiklis, J.N.: An analysis of stochastic shortest path problems. Math. Oper. Res. 16(3), 580–595 (1991). INFORMS
Article MathSciNet Google Scholar
Bonet, B.: Labeled RTDP: improving the convergence of real-time dynamic programming. In: Proceedings ICAPS-03 (2003)
Google Scholar
Bonet, B., Geffner, H.: Faster heuristic search algorithms for planning with uncertainty and full feedback. In: Proceedings of the IJCAI (2003)
Google Scholar
Bonet, B., Geffner, H.: mGPT: a probabilistic planner based on heuristic search. J. Artif. Intell. Res. 24, 933–944 (2005)
Article Google Scholar
Göbelbecker, M., Keller, T., Eyerich, P., Brenner, M., Nebel, B.: Coming up with good excuses: what to do when no plan can be found. In: ICAPS (2010)
Google Scholar
Helmert, M.: The fast downward planning system. J. Artif. Intell. Res. 26, 191–246 (2006)
Article Google Scholar
Karami, A.B., Jeanpierre, L., Mouaddib, A.I.: Partially observable markov decision process for managing robot collaboration with human. In: Proceedings of the 21st IEEE ICTAI (2009)
Google Scholar
Kolobov, A., Daniel, M., Weld, S., Geffner, H.: Heuristic search for generalized stochastic shortest path MDPs. In: Proceedings of the ICAPS (2011)
Google Scholar
Kolobov, A., Mausam, M., Weld, D.: A theory of goal-oriented MDPs with dead ends. In: Proceedings of the 28th Conference on UAI (2012)
Google Scholar
Puterman, M.L.: Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley (2014)
Google Scholar
Rosenthal, S., Biswas, J., Veloso, M.: An effective personal mobile robot agent through symbiotic human-robot interaction. In: Proceedings of the AAMAS (2010)
Google Scholar
Rosenthal, S., Veloso, M., Dey, A.K.: Learning accuracy and availability of humans who help mobile robots. In: Proceedings of the AAAI (2011)
Google Scholar
Schmidt-Rohr, S.R., Knoop, S., Lösch, M., Dillmann, R.: Reasoning for a multi-modal service robot considering uncertainty in human-robot interaction. In: Proceedings of the 3rd HRI (2008)
Google Scholar
Steinmetz, M., Hoffmann, J., Buffet, O.: Revisiting goal probability analysis in probabilistic planning. In: Proceedings of the 26th ICAPS (2016)
Google Scholar
Teichteil-Königsbuch, F.: Stochastic safest and shortest path problems. In: Proceedings of the NCAI (2012)
Google Scholar
Trevizan, F., Teichteil-Königsbuch, F., Thiébaux, S.: Efficient solutions for stochastic shortest path problems with dead ends. In: Proceedings of 33rd Conference on UAI (2017)
Google Scholar
Younes, H.L., Littman, M.L.: PPDDL1.0: an extension to PDDL for expressing planning domains with probabilistic effects. Technical report CMU-CS-04-162 (2004)
Google Scholar

Download references

Acknowledgments

Authors received financial support from CAPES, FAPESP (grants #2015/01587-0 and #2016/01055-1) and CNPq (grants #303920/2016-5 and #420669/2016-7).

Author information

Authors and Affiliations

University of São Paulo, São Paulo, Brazil
Ignasi Andrés, Leliane Nunes de Barros & Denis D. Mauá
Delft University of Technology, Delft, The Netherlands
Thiago D. Simão

Authors

Ignasi Andrés
View author publications
You can also search for this author in PubMed Google Scholar
Leliane Nunes de Barros
View author publications
You can also search for this author in PubMed Google Scholar
Denis D. Mauá
View author publications
You can also search for this author in PubMed Google Scholar
Thiago D. Simão
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ignasi Andrés .

Editor information

Editors and Affiliations

Universidad Nacional del Sur, Bahía Blanca, Buenos Aires, Argentina
Guillermo R. Simari
University of Madeira, Funchal, Portugal
Eduardo Fermé
Universidad Nacional de Piura, Castilla-Piura, Peru
Flabio Gutiérrez Segura
Universidad Nacional de Trujillo, Trujillo, Peru
José Antonio Rodríguez Melquiades

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Andrés, I., de Barros, L.N., Mauá, D.D., Simão, T.D. (2018). When a Robot Reaches Out for Human Help. In: Simari, G., Fermé, E., Gutiérrez Segura, F., Rodríguez Melquiades, J. (eds) Advances in Artificial Intelligence - IBERAMIA 2018. IBERAMIA 2018. Lecture Notes in Computer Science(), vol 11238. Springer, Cham. https://doi.org/10.1007/978-3-030-03928-8_23

Download citation

DOI: https://doi.org/10.1007/978-3-030-03928-8_23
Published: 09 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-03927-1
Online ISBN: 978-3-030-03928-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics