Abstract
This article provides an experimental analysis of two-armed bandit problems that have a different structure in which the first unsuccessful outcome leads to termination of the game. It differs from a conventional two-armed bandit problem in that there is no opportunity to alter behavior after an unsuccessful outcome. Introducing the risk of death into a sequential decision problem alters the structure of the problem. Even though play ends after an unsuccessful outcome, Bayesian learning after successful outcomes has a potential function in this class of two-armed bandit problems. Increasing uncertainty boosts the chance of long-term survival since ambiguous probabilities of survival are increased more after each successful outcome. In the independent choice experiments, a slim majority of participants displayed a preference for greater risk ambiguity. Particularly in the interdependent choice experiments, participants were overly deterred by ambiguity. For both independent and interdependent choices, there were several dimensions on which participants displayed within session rationality. However, participants failed to learn and improve their strategy over a series of rounds, which is consistent with evidence of bounded rationality in other challenging games.
Similar content being viewed by others
Notes
This formulation is equivalent to a more standard parameterization in which the prior is characterized as equivalent to having observed d successes and e failures, leading to a mean prior of d/(d + e), and a posterior after a successful outcome of (d + 1)/(d + e + 1). Thus, γ equals d + e, and q equals d/(d + e). The advantage of adopting the γ, q parameterization is that it is feasible to take derivatives with respect to γ to analyze the effect of increased precision, i.e., less uncertainty.
While we collected information on participants’ gender and major, these variables did not have a statistically significant effect on whether the participant made optimal choices.
The various measures of risk aversion derived from this part of the study never had statistically significant effects on decisions. The absence of such an effect does not necessarily imply that risk aversion is not influential since the Holt-Laury procedure may not always generate an accurate index of the subject’s underlying risk preferences.
Bayes’ theorem allows one to calculate the probability of a certain scenario based on conditional probabilities. Here, using Bayes’ theorem would allow participants to update their beliefs about being in Situation A after a successful outcome. The formula used to calculate this Bayesian updating is as follows:
P(A| B) = P(B| A)P(A)/P(B).
Applying the formula to this experiment results in the following:
\( {\displaystyle \begin{array}{c}P\left( Situation\ A| Winning\ Draw\right)=\\ {}P\left( Winning\ Draw| Situation\ A\right)P\left( Situation\ A\right)/P\left( Winning\ Draw\right).\end{array}} \)
Using known and updated probabilities allows one to calculate the probability of being in Situation A. The equation can continue to be used in successive rounds using the updated probabilities solved for in the prior round.
The same regression analysis that is reported in Table 4 was also conducted with the inclusion of individual fixed effects. The results remain quite similar with one exception—the coefficient of the session variable Treatment 2 in Panel B is statistically significant (p < 0.01) when including individual fixed effects.
Had the decisions been made in a group rather than on an individual basis, one would have expected even stronger performance on this dimension (Charness et al. 2007).
References
Anderson, C. M. (2012). Ambiguity aversion in multi-armed bandit problems. Theory and Decision, 72(1), 15–33.
Bellemare, C., Kröger, S., & Sossou, K. M. (2018). Reporting probabilistic expectations with dynamic uncertainty about possible distributions. Journal of Risk and Uncertainty, 57(2), 153–176.
Berry, D. A. (1972). A Bernoulli two-armed bandit. The Annals of Mathematical Statistics, 43(3), 871–897.
Berry, D. A., & Fristedt, B. (1985). Bandit problems: Sequential allocation of experiments. London: Chapman and Hall.
Berry, D. A., & Viscusi, W. K. (1981). Bernoulli two-armed bandits with geometric termination. Stochastic Processes and their Applications, 11(1), 35–45.
Bradt, R. N., Johnson, S. M., & Karlin, S. (1956). On sequential designs for maximizing the sum of n observations. The Annals of Mathematical Statistics, 2(4), 1060–1074.
Camerer, C., & Weber, M. (1992). Recent developments in modeling preferences: Uncertainty and ambiguity. Journal of Risk and Uncertainty, 5(4), 325–370.
Charness, G., & Levin, D. (2005). When optimal choices feel wrong: A laboratory study of Bayesian updating, complexity, and affect. American Economic Review, 95(4), 1300–1309.
Charness, G., & Levin, D. (2009). The origin of the winner’s curse: A laboratory study. American Economic Journal: Microeconomics, 1(1), 207–236.
Charness, G., Karni, E., & Levin, D. (2007). Individual and group decision making under risk: An experimental study of Bayesian updating and violations of first-order stochastic dominance. Journal of Risk and Uncertainty, 35(2), 129–148.
Ellsberg, D. (1961). Risk, ambiguity, and the Savage axioms. Quarterly Journal of Economics, 75(4), 643–669.
Engle-Warnick, J., & Laszlo, S. (2017). Learning-by-doing in an ambiguous environment. Journal of Risk and Uncertainty, 55(1), 71–94.
Holt, C. A., & Laury, S. K. (2002). Risk aversion and incentive effects. American Economic Review, 92(5), 1644–1655.
Knight, F. H. (1921). Risk, uncertainty, and profit. Chicago: University of Chicago Press.
Liu, H.-H., & Colman, A. M. (2009). Ambiguity aversion in the long run: Repeated decisions under risk and uncertainty. Journal of Economic Psychology, 30(3), 277–284.
Machina, M. J., & Siniscalchi, M. (2014). Ambiguity and ambiguity aversion. In M. J. Machina & W. K. Viscusi (Eds.), Handbook of the economics of risk and uncertainty (Vol. 1, pp. 729–807). Amsterdam: Elsevier B.V.
Moreno, O. M., & Rosokha, Y. (2016). Learning under compound risk vs. learning under ambiguity—An experiment. Journal of Risk and Uncertainty, 53(2–3), 137–162.
Park, H. M. (2009). Comparing group means: T-tests and one-way ANOVA using STATA, SAS, R, and SPSS. Working Paper. The University Information Technology Services (UITS) Center for Statistical and Mathematical Computing, Indiana University.
Trautmann, S. T., & Zeckhauser, R. J. (2013). Shunning uncertainty: The neglect of learning opportunities. Games and Economic Behavior, 79(1), 44–55.
Viscusi, W. K. (1979). Employment hazards: An investigation of market performance. Cambridge: Harvard University Press.
Yakowitz, S. J. (1969). Mathematics of adaptive control processes. New York: American Elsevier.
Acknowledgements
The authors are indebted to Gary Charness for superb suggestions that greatly improved the manuscript.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
Rights and permissions
About this article
Cite this article
Kip Viscusi, W., DeAngelis, S. Decision irrationalities involving deadly risks. J Risk Uncertain 57, 225–252 (2018). https://doi.org/10.1007/s11166-018-9292-4
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11166-018-9292-4