Abstract
An optimal model for the Prisoner’s Dilemma game is suggested. The model is normative in the sense that given few assumptions about the way the game is perceived by the players, an optimal policy is prescribed to each player maximizing his long-run expected gain. The dilemma is “resolved” by restructuring the game as a supergame composed of several component games such that transitions among them are possible. Dynamic programming is used to derive the optimal policy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bellman, R. Dynamic programming. Princeton: Princeton University Press, 1957. (a)
Bellman, R. A Markovian decision process. Journal of Mathematics and Mechanics, 1957, 6, 679–684. (b)
Bellman, R. Adaptive control processes: A guided tour. Princeton: Princeton University Press, 1961.
Bellman, R., & Dreyfus, S. E. Applied dynamic programming. Princeton: Princeton University Press, 1962.
Gallo, P. S., & McClintock, C. G. Cooperative and competitive behavior in mixed-motive games. Journal of Conflict Resolution, 1965, 9, 68–78.
Howard, R. A. Dynamic programming and Markov processes. New York: M. I. T. and Wiley, 1960.
Luce, R. D., & Raiffa, H. Games and decisions. New York: Wiley, 1957.
Rapoport, Amnon. A study of human decisions in a stationary Markov process with rewards. Chapel Hill: Psychometric Laboratory Report No. 48, 1966.
Rapoport, Amnon, & Mowshowitz, A. Experimental studies of stochiastic models for the Prisoner’s Dilemma. Behavioral Science, 1966, 11, 444 458.
Rapoport, Anatol. A stochastic model for the Prisoner’s Dilemma. In J. Garland (Ed.), Stochastic models in medicine and biology. Madison: University of Wisconsin Press, 1964.
Rapoport, Anatol, & Chammah, A. M. Prisoner’s Dilemma: A study in conflict and cooperation. Ann Arbor: University of Michigan Press, 1965.
Rapoport, Anatol, & Orwant, C. Experimental games: A review. Behavioral Science, 1962, 7, 1–37.
Riis, J. O. Discounted Markov programming in a periodic process. Operations Research, 1965, 13, 920–929.
Scodel, A., Minas, J. S., Ratoosh, P., & Lipetz, M. Some descriptive aspects of two-person non-zero-sum games. Journal of Conflict Resolution, 1959, 3, 114–119.
Shubik, M. Some reflections on the design of game theoretic models for the study of negotiation and threats. Journal of Conflict Resolution, 1963, 7, 1–12.
Suppes, P., & Atkinson, R. C. Markov learning models for multiperson interaction. Stanford: Stanford University Press, 1960.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 1990 Kluwer Academic Publishers
About this chapter
Cite this chapter
Rapoport, A. (1990). Optimal Policies for the Prisoner’s Dilemma. In: Experimental Studies of Interactive Decisions. Theory and Decision Library, vol 5. Springer, Dordrecht. https://doi.org/10.1007/978-94-009-1992-1_1
Download citation
DOI: https://doi.org/10.1007/978-94-009-1992-1_1
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-010-7391-2
Online ISBN: 978-94-009-1992-1
eBook Packages: Springer Book Archive