A Representation Theorem for Decisions about Causal Models

Dewey, Daniel

doi:10.1007/978-3-642-35506-6_7

Daniel Dewey²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7716))

Included in the following conference series:

International Conference on Artificial General Intelligence

1281 Accesses

Abstract

Given the likely large impact of artificial general intelligence, a formal theory of intelligence is desirable. To further this research program, we present a representation theorem governing the integration of causal models with decision theory. This theorem puts formal bounds on the applicability of the submodel hypothesis, a normative theory of decision counterfactuals that has previously been argued on a priori and practical grounds, as well as by comparison to theories of counterfactual cognition in humans. We are able to prove four conditions under which the submodel hypothesis holds, forcing any preference between acts to be consistent with some utility function over causal submodels.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Arel, I.: Reward Driven Learning and the Risk of an Adversarial Artificial General Intelligence. Talk at “The Future of AGI Workshop Part 1 - Ethics of Advanced AGI,” The Fourth Conference on Artificial General Intelligence (2011)
Google Scholar
Bostrom, N.: The Superintelligent Will: Motivation and Instrumental Rationality in Advanced Artificial Agents. Minds and Machines 22, 71–85 (2012)
Article Google Scholar
Chalmers, D.: The Singularity: A Philosophical Analysis. Journal of Consciousness Studies 17, 7–65 (2010)
Google Scholar
Chalmers, D.: The Singularity: a Reply. Journal of Consciousness Studies 19 (2012)
Google Scholar
Drescher, G.: Good and real: Demystifying paradoxes from physics to ethics. Bradford Books, MIT Press, Cambridge, MA (2006)
Google Scholar
Galles, D., Pearl, J.: An Axiomatic Characterization of Counterfactuals. Foundations of Science III, 151–182 (1998)
Article MathSciNet Google Scholar
Goertzel, B.: Should Humanity Build a Global AI Nanny to Delay the Singularity Until It’s Better Understood? Journal of Consciousness Studies 19, 96–111 (2012)
Google Scholar
Goertzel, B.: Nine Ways to Bias Open-Source AGI Toward Friendliness. Journal of Evolution and Technology 22, 116–131 (2012)
Google Scholar
Good, I.J.: Speculations Concerning the First Ultraintelligent Machine. In: Alt, F.L., Rubinoff, M. (eds.) Advances in Computers, vol. 6, pp. 31–88 (1965)
Google Scholar
Hutter, M.: Can Intelligence Explode? Journal of Consciousness Studies 19, 143–166 (2012)
Google Scholar
Hutter, M.: Universal algorithmic intelligence: A mathematical top-down approach. In: Artificial General Intelligence, pp. 227–290. Springer, Berlin (2007)
Chapter Google Scholar
Legg, S.: Is there an Elegant Universal Theory of Prediction? IDSIA Technical Report No. IDSIA-12-06 (2006)
Google Scholar
Legg, S.: Machine Super Intelligence. PhD dissertation, University of Lugano (2008)
Google Scholar
Muehlhauser, L., Salamon, A.: Intelligence Explosion: Evidence and Import. In: The Singularity Hypothesis: A Scientific and Philosophical Assessment. Springer, Berlin (2012)
Google Scholar
Pearl, J.: Causality: Models, Reasoning, and Inference. Cambridge University Press (2000)
Google Scholar
Russell, S., Norvig, P.: AI: A Modern Approach, 3rd edn. Prentice-Hall, Englewood Cliffs (1995)
Google Scholar
Schmidhuber, J.: Philosophers & Futurists, Catch Up! Journal of Consciousness Studies 19, 173–182 (2012)
Google Scholar
Schmidhuber, J.: Gödel machines: Fully Self-Referential Optimal Universal Self-Improvers. In: Artificial General Intelligence, pp. 119–226 (2006)
Google Scholar
Solomonoff, R.: A Formal Theory of Inductive Inference, Part I. Information and Control 7(1), 1–22 (1964)
Article MathSciNet MATH Google Scholar
Solomonoff, R.: A Formal Theory of Inductive Inference, Part II. Information and Control 7(2), 224–254 (1964)
Article MathSciNet MATH Google Scholar
Voorneveld, M.: Mathematical Foundations of Microeconomic Theory: Preference, Utility, Choice (2010), https://studentweb.hhs.se/CourseWeb/CourseWeb/Public/PhD501/1001/micro1.pdf
Yudkowsky, E.: Artificial intelligence as a positive and negative factor in global risk. In: Global Catastrophic Risks. Oxford University Press, Oxford (2008)
Google Scholar
Yudkowsky, E.: Complex Value Systems are Required to Realize Valuable Futures. In: The Proceedings of the Fourth Conference on Artificial General Intelligence (2011)
Google Scholar
Yudkowsky, E., et al.: Reducing Long-Term Catastrophic Risks from Artificial Intelligence. The Singularity Institute, San Francisco (2010)
Google Scholar
Yudkowsky, E.: Timeless decision theory. The Singularity Institute, San Francisco (2010)
Google Scholar
Yudkowsky, E.: Ingredients of Timeless Decision Theory (2009), http://lesswrong.com/lw/15z/ingredients_of_timeless_decision_theory/

Download references

Author information

Authors and Affiliations

Future of Humanity Institute, UK
Daniel Dewey

Authors

Daniel Dewey
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Humboldt Universität Berlin, Raumerstr. 11, 10437, Berlin, Germany
Joscha Bach
Aidyia Ltd., Unit 612, 6/F, Lu Plaza, 2 Wing Yip Street, Kwun Tong, Hong Kong
Ben Goertzel
Adams State University, Suite 3060, 81101, Alamosa, CO, USA
Matthew Iklé

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dewey, D. (2012). A Representation Theorem for Decisions about Causal Models. In: Bach, J., Goertzel, B., Iklé, M. (eds) Artificial General Intelligence. AGI 2012. Lecture Notes in Computer Science(), vol 7716. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35506-6_7

Download citation

DOI: https://doi.org/10.1007/978-3-642-35506-6_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35505-9
Online ISBN: 978-3-642-35506-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics