Synthesis for Multi-objective Stochastic Games: An Application to Autonomous Urban Driving

Chen, Taolue; Kwiatkowska, Marta; Simaitis, Aistis; Wiltsche, Clemens

doi:10.1007/978-3-642-40196-1_28

Taolue Chen²⁰,
Marta Kwiatkowska²⁰,
Aistis Simaitis²⁰ &
…
Clemens Wiltsche²⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8054))

Included in the following conference series:

International Conference on Quantitative Evaluation of Systems

1352 Accesses
30 Citations

Abstract

We study strategy synthesis for stochastic two-player games with multiple objectives expressed as a conjunction of LTL and expected total reward goals. For stopping games, the strategies are constructed from the Pareto frontiers that we compute via value iteration. Since, in general, infinite memory is required for deterministic winning strategies in such games, our construction takes advantage of randomised memory updates in order to provide compact strategies. We implement our methods in PRISM-games, a model checker for stochastic multi-player games, and present a case study motivated by the DARPA Urban Challenge, illustrating how our methods can be used to synthesise strategies for high-level control of autonomous vehicles.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bagnara, R., Hill, P.M., Zaffanella, E.: The Parma Polyhedra Library: Toward a complete set of numerical abstractions for the analysis and verification of hardware and software systems. Sci. Comput. Program. 72(1-2), 3–21 (2008)
Article MathSciNet Google Scholar
Brázdil, T., Brozek, V., Chatterjee, K., Forejt, V., Kučera, A.: Two views on multiple mean-payoff objectives in Markov decision processes. In: LICS, pp. 33–42 (2011)
Google Scholar
Campbell, M., Egerstedt, M., How, J.P., Murray, R.M.: Autonomous driving in urban environments: approaches, lessons and challenges. Phil. Trans. R. Soc. A 368(1928), 4649–4672 (2010)
Article Google Scholar
Chatterjee, K., Doyen, L., Henzinger, T.A., Raskin, J.F.: Generalized mean-payoff and energy games. In: FSTTCS. LIPIcs, vol. 8, pp. 505–516 (2010)
Google Scholar
Chatterjee, K., Majumdar, R., Henzinger, T.A.: Markov decision processes with multiple objectives. In: Durand, B., Thomas, W. (eds.) STACS 2006. LNCS, vol. 3884, pp. 325–336. Springer, Heidelberg (2006)
Chapter Google Scholar
Chatterjee, K., Randour, M., Raskin, J.-F.: Strategy synthesis for multi-dimensional quantitative objectives. In: Koutny, M., Ulidowski, I. (eds.) CONCUR 2012. LNCS, vol. 7454, pp. 115–131. Springer, Heidelberg (2012)
Chapter Google Scholar
Chen, T., Forejt, V., Kwiatkowska, M., Parker, D., Simaitis, A.: PRISM-games: A model checker for stochastic multi-player games. In: Piterman, N., Smolka, S.A. (eds.) TACAS 2013. LNCS, vol. 7795, pp. 185–191. Springer, Heidelberg (2013)
Chapter Google Scholar
Chen, T., Forejt, V., Kwiatkowska, M., Simaitis, A., Trivedi, A., Ummels, M.: Playing stochastic games precisely. In: Koutny, M., Ulidowski, I. (eds.) CONCUR 2012. LNCS, vol. 7454, pp. 348–363. Springer, Heidelberg (2012)
Chapter Google Scholar
Chen, T., Forejt, V., Kwiatkowska, M., Simaitis, A., Wiltsche, C.: On stochastic games with multiple objectives. In: MFCS (accepted, 2013)
Google Scholar
Courcoubetis, C., Yannakakis, M.: Markov decision processes and regular events. IEEE Trans. Autom. Control 43(10), 1399–1418 (1998)
Article MathSciNet MATH Google Scholar
DARPA. Urban Challenge (2007) (online accessed March 8, 2013)
Google Scholar
Dziembowski, S., Jurdzinski, M., Walukiewicz, I.: How much memory is needed to win infinite games? In: LICS, pp. 99–110 (1997)
Google Scholar
Etessami, K., Kwiatkowska, M.Z., Vardi, M.Y., Yannakakis, M.: Multi-objective model checking of Markov decision processes. LMCS 4(4) (2008)
Google Scholar
Fahrenberg, U., Juhl, L., Larsen, K.G., Srba, J.: Energy games in multiweighted automata. In: Cerone, A., Pihlajasaari, P. (eds.) ICTAC 2011. LNCS, vol. 6916, pp. 95–115. Springer, Heidelberg (2011)
Chapter Google Scholar
Forejt, V., Kwiatkowska, M., Norman, G., Parker, D., Qu, H.: Quantitative multi-objective verification for probabilistic systems. In: Abdulla, P.A., Leino, K.R.M. (eds.) TACAS 2011. LNCS, vol. 6605, pp. 112–127. Springer, Heidelberg (2011)
Chapter Google Scholar
OpenStreetMap (2013) (online; accessed March 8, 2013)
Google Scholar
Urmson, C., Anhalt, J., Bagnell, D., Baker, C., Bittner, R., Clark, M.N., Dolan, J., Duggins, D., Galatali, T., Geyer, C., et al.: Autonomous driving in urban environments: Boss and the urban challenge. J. Field Robot. 25(8), 425–466 (2008)
Article Google Scholar
Velner, Y., Chatterjee, K., Doyen, L., Henzinger, T.A., Rabinovich, A., Raskin, J.F.: The complexity of multi-mean-payoff and multi-energy games. CoRR, abs/1209.3234 (2012)
Google Scholar
Wongpiromsarn, T., Frazzoli, E.: Control of probabilistic systems under dynamic, partially known environments with temporal logic specifications. In: CDC, pp. 7644–7651 (2012)
Google Scholar
Wongpiromsarn, T., Topcu, U., Murray, R.M.: Receding horizon temporal logic planning. IEEE Trans. Automat. Contr. 57(11), 2817–2830 (2012)
Article MathSciNet Google Scholar
Wongpiromsarn, T., Ulusoy, A., Belta, C., Frazzoli, E., Rus, D.: Incremental synthesis of control policies for heterogeneous multi-agent systems with linear temporal logic specification. In: ICRA (accepted, 2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Oxford, United Kingdom
Taolue Chen, Marta Kwiatkowska, Aistis Simaitis & Clemens Wiltsche

Authors

Taolue Chen
View author publications
You can also search for this author in PubMed Google Scholar
Marta Kwiatkowska
View author publications
You can also search for this author in PubMed Google Scholar
Aistis Simaitis
View author publications
You can also search for this author in PubMed Google Scholar
Clemens Wiltsche
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

AT&T Labs Research, 180 Park Avenue, Building 103, 07932, Florham Park, NJ, USA
Kaustubh Joshi
Institut für Technische Informatik, Universität der Bundeswehr München, Werner-Heisenberg Weg 39, 85577, Neubiberg, Germany
Markus Siegle
Faculty of Electrical Engineering, Mathematics and Computer Science, University of Twente, Drienerlolaan 5, Zilverling Building, 7522 NB, Enschede, The Netherlands
Mariëlle Stoelinga
Facultad de Matemáticas, Astronomía y Física, Universidad Nacional de Córdoba – CONICET, Medina Allende s/n, X5000HUA, Córdoba, Argentina
Pedro R. D’Argenio

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, T., Kwiatkowska, M., Simaitis, A., Wiltsche, C. (2013). Synthesis for Multi-objective Stochastic Games: An Application to Autonomous Urban Driving. In: Joshi, K., Siegle, M., Stoelinga, M., D’Argenio, P.R. (eds) Quantitative Evaluation of Systems. QEST 2013. Lecture Notes in Computer Science, vol 8054. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40196-1_28

Download citation

DOI: https://doi.org/10.1007/978-3-642-40196-1_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40195-4
Online ISBN: 978-3-642-40196-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics