Skip to main content

Guaranteed performance regions for multi-user Markov models

  • Conference paper
  • First Online:
Stochastic Theory and Adaptive Control

Part of the book series: Lecture Notes in Control and Information Sciences ((LNCIS,volume 184))

  • 147 Accesses

Abstract

A user facing a multi-user resource-sharing system considers a vector of performance measures (e.g. response times to various tasks). Acceptable performance is defined through a set in the space of performance vectors. Can the user obtain a (time-average) performance vector which approaches this desired set? We consider the worst-case scenario, where other users may, for selfish reasons, try to exclude his vector from the desired set. For a controlled Markov model of the system, we give a sufficient condition for approachability, and construct appropriate policies. Under certain recurrence conditions, a complete characterization of approachability is then provided for convex sets. The mathematical formulation leads to a theory of approachability for stochastic games. A simple queueing example is analyzed to illustrate the applicability of this approach.

Research performed in part while this author was visiting the Systems Research Center, University of Maryland, College Park, where he was supported in part through NSF Grant NSF ECS-83-51836.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. E. Altman and A. Shwartz, “Non stationary policies for controlled Markov chains,” EE Pub. 633, Technion, June 1987.

    Google Scholar 

  2. E. Altman and A. Shwartz, “Markov decision problems and state-action frequencies,” SIAM J. Control and Optimization 29 No. 4, July 1991.

    Google Scholar 

  3. R. J. Aumann and M. Maschler, Game theoretic aspects of gradual disarmament. Chapter V, Report to the U.S. Arms Control and Disarmament Agency, Contract S.T.80, prepared by Mathematica, Inc., Princeton, N.J., 1996.

    Google Scholar 

  4. D. Blackwell, “An analogue for the minimax theorem for vector payoffs,” Pacific J. Math., 6, pp. 1–8, 1956.

    MATH  MathSciNet  Google Scholar 

  5. D. Blackwell, “Controlled random walks,” Proc. Internat. Congress Math., 3, pp. 336–338, 1954.

    Google Scholar 

  6. D. Blackwell, “On multi-component attrition games,” Naval Res. Log. Quart., pp. 210–216, 1954.

    Google Scholar 

  7. V. S. Borkar, “Control of Markov chains with long-run average cost criterion,” in Stochastic Differential Systems, Stochastic Control and Application, W. Fleming and P.L. Lions, eds., IMA Vol. 10, Springer-Verlag, pp. 57–77, 1988.

    Google Scholar 

  8. C. Derman, Finite State Markovian Decision Processes, Academic Press, New-York, 1970.

    MATH  Google Scholar 

  9. A. Federgruen, “On N-person games with denumerable state space,” Adv. Appl. Prob. 10, pp. 452–471, 1978.

    Article  MATH  MathSciNet  Google Scholar 

  10. D. Gillette, “Stochastic games with zero stop probabilities,” in Contributions to the Theory of Games, III, (Annals of Math. Studies 39), M. Dresher et al., editors, Princeton Univ. Press, pp. 179–188, 1957.

    Google Scholar 

  11. S. Hart, “Nonzero-sum two-person repeated games with incomplete information,” Math. of Oper. Res., 10, pp. 117–153, 1985.

    Article  MATH  MathSciNet  Google Scholar 

  12. J. F. Hannan, “Approximation to Bayes risk in repeated play,’ in Contributions to the Theory of Games, III, (Annals of Math. Studies 39), M. Dresher et al., editors, Princeton Univ. Press, pp. 97–139, 1957.

    Google Scholar 

  13. T. F. Hou, “Weak approachability in a two-person game,” Ann. Math. Statist., 40, pp. 789–813, 1969.

    MathSciNet  MATH  Google Scholar 

  14. T. F. Hou, “Approachability in a two-person game,” Ann. Math. Stat., 42, pp. 735–744, 1971.

    MATH  Google Scholar 

  15. M. Katz, “Infinitely repeatable games,” Pac. J. Math., 10, pp. 879–885, 1960.

    MATH  Google Scholar 

  16. R. D. Luce and H. Raiffa, Games and Decisions, Wiley, New-York, 1957.

    MATH  Google Scholar 

  17. A. Maitra and T. Parthasarathy, “On stochastic games, II,” Journal of Optimization Theory and Applications, Vol. 8, pp. 155–160, 1971.

    Article  MathSciNet  Google Scholar 

  18. T. Parthasarathy and M. Stern, “Markov games: a survey,” in Differential Games and Control Theory, P.L.E. Roxin and R. Sternberg, eds., Marcel Dekker, 1977.

    Google Scholar 

  19. T. E. S. Raghavan and J.A. Filar, “Algorithms for stochastic games — a survey,” Preprint, June 1989.

    Google Scholar 

  20. S. M. Ross, Applied Probability Models with Optimization Applications, Holden-Day, San Francisco, 1970.

    MATH  Google Scholar 

  21. H. Sackrowitz, “A note on approachability in a two-person game,” Ann. Math. Stat., 43, pp. 1017–1019, 1972.

    MathSciNet  MATH  Google Scholar 

  22. L. I. Sennott, “Average cost optimal stationary policies in infinite state Markov decision processes with unbounded costs,” Operations Research, Vol. 37, pp. 626–633, 1989.

    MATH  MathSciNet  Google Scholar 

  23. A. N. Shiryayev, Probability, Springer-Verlag, 1984.

    Google Scholar 

  24. N. Shimkin and A. Shwartz, “Guaranteed performance regions for multi-use Markov models,” submitted to the IEEE Trans. Automat. Contr., 1991.

    Google Scholar 

  25. J. Sorin, An Introduction to Two-Person-Zero-Sum Repeated Games with Incomplete Information. IMSSS-Economics TR-312, Stanford University, memo.

    Google Scholar 

  26. M. A. Stern, On Stochastic Games with Limiting Average Pay-Off. Ph.D. dissertation, submitted to the University of Illinois, Circle Campus, Chicago, 1975.

    Google Scholar 

  27. N. Vieille, “Weak approachability,” Preprint, 1991.

    Google Scholar 

  28. J. Walrand, An Introduction to Queueing Networks, Prentice-Hall, New Jersey, 1988.

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

T. E. Duncan B. Pasik-Duncan

Rights and permissions

Reprints and permissions

Copyright information

© 1992 Springer-Verlag

About this paper

Cite this paper

Shimkin, N., Shwartz, A. (1992). Guaranteed performance regions for multi-user Markov models. In: Duncan, T.E., Pasik-Duncan, B. (eds) Stochastic Theory and Adaptive Control. Lecture Notes in Control and Information Sciences, vol 184. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0113259

Download citation

  • DOI: https://doi.org/10.1007/BFb0113259

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-55962-7

  • Online ISBN: 978-3-540-47327-5

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics