Better-Reply Strategies with Bounded Recall

Zapechelnyuk, Andriy

doi:10.1007/978-3-540-73135-1_19

Andriy Zapechelnyuk⁴

Part of the book series: Lecture Notes in Economics and Mathematical Systems ((LNE,volume 599))

588 Accesses

Abstract

In every (discrete) period of time a decision maker (for short, an agent) makes a decision and, simultaneously, Nature selects a state of the world. The agent receives a payoff which depends on both his action and the state. Nature’s behavior is ex-ante unknown to the agent, it may be as simple as an i.i.d. environment or as sophisticated as a strategic play of a rational player. The agent’s objective is to select a sequence of decisions which guarantees to him the long-run average payoff as large as the best-reply payoff against Nature’s empirical distribution of play, no matter what Nature does. A behavior rule of the agent which fulfills this objective is called universally consistent1: the rule is “consistent” if it is optimized against the empirical play of Nature; the word “ universally” refers to its applicability to any behavior of Nature.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

D. Blackwell. An analog of the minmax theorem for vector payoffs. Pacific Journal of Mathematics, 6:1–8, 1956.
Google Scholar
N. Cesa-Bianchi and G. Lugosi. Potential-based algorithms in on-line prediction and game theory. Machine Learning, 51:239–261, 2003.
Article Google Scholar
N. Cesa-Bianchi, Y. Freund, D. Helmbold, D. Haussler, R. Shapire, and M. Warmuth. How to use expert advice. Journal of the ACM, 44:427–485, 1997.
Article Google Scholar
Dean Foster and Rakesh Vohra. Regret in the online decision problem. Games and Economic Behavior, 29:7–35, 1999.
Article Google Scholar
Yoav Freund and Robert Schapire. Game theory, on-line prediction and boosting. In Proceedings of the Ninth Annual Conference on Computational Learning Theory, pages 325–332, 1996.
Google Scholar
Drew Fudenberg and David Levine. Universal consistency and cautious fictitious play. Journal of Economic Dynamics and Control, 19:1065–1089, 1995.
Article Google Scholar
J Hannan. Approximation to Bayes risk in repeated play. In M. Dresher, A. W. Tucker, and P. Wolfe, editors, Contributions to the Theory of Games, Vol. III, Annals of Mathematics Studies 39, pages 97–139. Princeton University Press, 1957.
Google Scholar
Sergiu Hart and Andreu Mas-Colell. A simple adaptive procedure leading to correlated equilibrium. Econometrica, 68:1127–1150, 2000.
Article Google Scholar
Sergiu Hart and Andreu Mas-Colell. A general class of adaptive procedures. Journal of Economic Theory, 98:26–54, 2001.
Article Google Scholar
Ehud Lehrer and Eilon Solan. No regret with bounded computational capacity. The Center for Mathematical Studies in Economics and Management Science, Northwestern University. Discussion Paper 1373, 2003.
Google Scholar
N. Littlestone and M. Warmuth. The weighted majority algorithm. Information and Computation, 108:212–261, 1994.
Article Google Scholar
V. Vovk. A game of prediction with expert advice. Journal of Computer and System Sciences, 56:153–173, 1998.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Center for Rationality, the Hebrew University, Israel
Andriy Zapechelnyuk

Authors

Andriy Zapechelnyuk
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Statistics and Mathematics Silvio Vianelli, University of Palermo, Viale delle Scienze - Ed. 13, 90128, Palermo, Italy
Andrea Consiglio

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Zapechelnyuk, A. (2007). Better-Reply Strategies with Bounded Recall. In: Consiglio, A. (eds) Artificial Markets Modeling. Lecture Notes in Economics and Mathematical Systems, vol 599. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73135-1_19

Download citation

DOI: https://doi.org/10.1007/978-3-540-73135-1_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73134-4
Online ISBN: 978-3-540-73135-1
eBook Packages: Business and EconomicsEconomics and Finance (R0)

Publish with us

Policies and ethics