Skip to main content

Evolutionary Machine Ethics

  • Living reference work entry
  • First Online:
Handbuch Maschinenethik

Part of the book series: Springer Reference Geisteswissenschaften ((SPREFGEIST))

Abstract

Machine ethics is a sprouting interdisciplinary field of enquiry arising from the need of imbuing autonomous agents with some capacity for moral decision-making. Its overall results are not only important for equipping agents with a capacity for moral judgment, but also for helping better understand morality, through the creation and testing of computational models of ethics theories. Computer models have become well defined, eminently observable in their dynamics, and can be transformed incrementally in expeditious ways. We address, in work reported and surveyed here, the emergence and evolution of cooperation in the collective realm. We discuss how our own research with Evolutionary Game Theory (EGT) modelling and experimentation leads to important insights for machine ethics, such as the design of moral machines, multi-agent systems, and contractual algorithms, plus their potential application in human settings too.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

References

  • Abeler, Johannes, Juljana Calaki, Kai Andree, and Christoph Basek. 2010. The power of apology. Economics Letters 107(2): 233–235.

    Google Scholar 

  • Axelrod, Robert. 1984. The evolution of cooperation, vol. 5145. New York: Basic Books (AZ).

    Google Scholar 

  • Back, Istvan, and Andreas Flache. 2008. The adaptive rationality of interpersonal commitment. Rationality and Society 20(1): 65–83.

    Google Scholar 

  • Boden, M. A. 2008. Information, computation, and cognitive science. Handbook of the Philosophy of Science 8: 749–769.

    Google Scholar 

  • Byrne, Ruth M. J. 2007. The rational imagination: How people create alternatives to reality. Cambridge, MA: MIT Press.

    Google Scholar 

  • Charniak, Eugene, and Robert P. Goldman. 1993. A Bayesian model of plan recognition. Artificial Intelligence 64(1): 53–79.

    Google Scholar 

  • Cherry, Todd L., and David M. McEvoy. 2013. Enforcing compliance with environmental agreements in the absence of strong institutions: An experimental analysis. Environmental and Resource Economics 54(1): 1–15.

    Google Scholar 

  • Chopra, Amit K., and Munindar P. Singh. 2009. Multiagent commitment alignment. In Proceedings of The 8th international conference on autonomous agents and multiagent systems-volume 2, 937–944. Budapest: International Foundation for Autonomous Agents and Multiagent Systems.

    Google Scholar 

  • Chopra, Amit K., Samuel H. Christie V, and Munindar P. Singh. 2017. Splee: a declarative information-based language for multiagent interaction protocols. In Proceedings of the 16th conference on autonomous agents and multiagent systems, 1054–1063. International Foundation for Autonomous Agents and Multiagent Systems.

    Google Scholar 

  • Cohen, Philip R., and Hector J. Levesque. 1990. Intention is choice with commitment. Artificial Intelligence 42(2–3: 213–261.

    Google Scholar 

  • Collins, John David, Edward Jonathan Hall, and Laurie Ann Paul, eds. 2004. Causation and counterfactuals. Cambridge, MA: MIT Press.

    Google Scholar 

  • Correia, Vasco. 2016. Weakness of will and self-control’. In Morality and emotion, 35. London: Routledge, 83–98. ISBN 978-1-138-12130-0.

    Google Scholar 

  • Deacon, Terrence W. 2003. The hierarchic logic of emergence: Untangling the interdependence of evolution and self-organization. In Evolution and learning: The Baldwin effect reconsidered (Chapter 14): 273–308, Cambridge, MA.

    Google Scholar 

  • Dennett, Daniel Clement. 2005. Sweet dreams: Philosophical obstacles to a science of consciousness. Cambridge, MA: MIT Press.

    Google Scholar 

  • Fehr, Ernst, and Simon Gächter. 2002. Altruistic punishment in humans. Nature 415(6868): 137–140.

    Google Scholar 

  • Fischbacher, Urs, and Verena Utikal. 2013. On the acceptance of apologies. Games and Economic Behavior 82: 592–608.

    Google Scholar 

  • Fodor, Jerry A. 1974. Special sciences (or: the disunity of science as a working hypothesis). Synthese 28(2): 97–115.

    Article  Google Scholar 

  • Gaspar, Augusta. 2016. Morality and empathy vs. empathy and morality. In Morality and Emotion, 62–82. London: Routledge, 83–98. ISBN 978-1-138-12130-0.

    Google Scholar 

  • Hamilton, William D., and Robert Axelrod. 1981. The evolution of cooperation. Science 211(27): 1390–1396.

    Google Scholar 

  • Han, The Anh. 2013. Intention recognition, commitments and their roles in the evolution of cooperation: from artificial intelligence techniques to evolutionary game theory models. SAPERE series, vol. 9. Berlin.

    Google Scholar 

  • Han, The Anh. 2016. Emergence of social punishment and cooperation through prior commitments. In Proceedings of the 30th AAAI conference on artificial intelligence (AAAI 2016), 2494–2500, Phoenix.

    Google Scholar 

  • Han, The Anh, and Tom Lenaerts. 2016. A synergy of costly punishment and commitment in cooperation dilemmas. Adaptive Behavior 24(4): 237–248.

    Google Scholar 

  • Han, The Anh, and Luis Moniz Pereira. 2013a. Context-dependent incremental decision making scrutinizing the intentions of others via bayesian network model construction. Intelligent Decision Technologies 7(4): 293–317.

    Google Scholar 

  • Han, The Anh, and Luis Moniz Pereira. 2013b. Intention-based decision making via intention recognition and its applications. In Human Behavior Recognition Technologies: Intelligent Applications for Monitoring and Security. Hershey, PA: IGI Global, pp 174–211.

    Google Scholar 

  • Han, The Anh, and Luis Moniz Pereira. 2013c. State-of-the-art of intention recognition and its use in decision making. AI Communications 26(2): 237–246.

    Google Scholar 

  • Han The Anh, Luis Moniz Pereira, and Francisco C. Santos. 2011. Intention recognition promotes the emergence of cooperation. Adaptive Behavior 19(4): 264–279.

    Google Scholar 

  • Han, The Anh, Luis Moniz Pereira, and Francisco C. Santos. 2012a. Corpus-based intention recognition in cooperation dilemmas. Artificial Life 18(4): 365–383.

    Google Scholar 

  • Han, The Anh, Luis Moniz Pereira, and Francisco C. Santos. 2012b. Intention recognition, commitment and the evolution of cooperation. In Evolutionary Computation (CEC), 2012 I.E. Congress on, 1–8. Brisbane: IEEE.

    Google Scholar 

  • Han, The Anh, Luis Moniz Pereira, and Francisco C. Santos. 2012c. The emergence of commitments and cooperation. In Proceedings of the 11th international conference on autonomous agents and multiagent systems-volume 1, 559–566. Valencia: International Foundation for Autonomous Agents and Multiagent Systems.

    Google Scholar 

  • Han, The Anh, Ari Saptawijaya, and Luis Moniz Pereira. 2012d. Moral reasoning under uncertainty. In International conference on logic for programming artificial intelligence and reasoning, 212–227. Springer, Berlin/Heidelberg.

    Google Scholar 

  • Han, The Anh, Luis Moniz Pereira, Francisco C. Santos, and Tom Lenaerts. 2013a. Good agreements make good friends. Scientific Reports 3.

    Google Scholar 

  • Han, The Anh, Luis Moniz Pereira, Francisco C. Santos, and Tom Lenaerts. 2013b. Why is it so hard to say sorry? evolution of apology with commitments in the iterated Prisoner’s Dilemma. In Proceedings of the twenty-third international joint conference on artificial intelligence, 177–183. Beijing: AAAI Press.

    Google Scholar 

  • Han, The Anh, Luis Moniz Pereira, and Tom Lenaerts. 2015a. Avoiding or restricting defectors in public goods games?. Journal of the Royal Society Interface 12(103): 20141203.

    Google Scholar 

  • Han, The Anh, Luis Moniz Pereira, Francisco C. Santos, and Tom Lenaerts. 2015b. Emergence of cooperation via intention recognition, commitment and apology–a research summary. AI Communications 28(4): 709–715.

    Google Scholar 

  • Han, The Anh, Francisco C. Santos, Tom Lenaerts, and Luis Moniz Pereira. 2015c. Synergy between intention recognition and commitments in cooperation dilemmas. Scientific Reports 5.

    Google Scholar 

  • Han, The Anh, Luis Moniz Pereira, and Tom Lenaerts. 2017a. Evolution of commitment and level of participation in public goods games. Autonomous Agents and Multi-Agent Systems 31(3): 561–583.

    Google Scholar 

  • Han, The Anh, Luís Moniz Pereira, Luis A. Martinez-Vaquero, and Tom Lenaerts. 2017b. Centralized vs. personalized commitments and their influence on cooperation in group interactions. In Proceedings of the 31st AAAI conference on artificial intelligence (AAAI 2017), 2999–3005, San Francisco.

    Google Scholar 

  • Hauser, Marc. 2006. Moral minds: How nature designed our universal sense of right and wrong. New York: Ecco/HarperCollins Publishers.

    Google Scholar 

  • Heinze, C. 2003. Modeling intention recognition for intelligent agent systems, PhD thesis, The University of Melbourne, Australia.

    Google Scholar 

  • Hodges, A. 1997. Alan Turing: One of the great philosophers. Phoenix/London: Phoenix.

    Google Scholar 

  • Hofbauer, Josef, and Karl Sigmund. 1998. Evolutionary games and population dynamics. Cambridge, UK: Cambridge University Press.

    Google Scholar 

  • John, Eileen. 2016. Moral feelings from rocky fictional ground. In Morality and emotion, 99. London: Routledge, 83–98. ISBN 978-1-138-12130-0.

    Google Scholar 

  • Levin, J. 2010. Functionalism. In The stanford encyclopaedia of philosophy, ed. E. N. Zalta. http://plato.stanford.edu/archives/sum2010/entries/functionalism/.

  • Marsella, Stacy, and Jonathan Gratch. 2014. Computationally modeling human emotion. Communications of the ACM 57(12): 56–67.

    Google Scholar 

  • Martinez-Vaquero, Luis A., and José A. Cuesta. 2013. Evolutionary stability and resistance to cheating in an indirect reciprocity model based on reputation. Physical Review E 87(5): 052810.

    Google Scholar 

  • Martinez-Vaquero, Luis A., and José A. Cuesta. 2014. Spreading of intolerance under economic stress: Results from a reputation-based model. Physical Review E 90(2): 022805.

    Google Scholar 

  • Martinez-Vaquero, Luis A., The Anh Han, Luis Moniz Pereira, and Tom Lenaerts. 2015. Apology and forgiveness evolve to resolve failures in cooperative agreements. Scientific Reports 5.

    Google Scholar 

  • Martinez-Vaquero, Luis A., The Anh Han, Luis Moniz Pereira, and Tom Lenaerts. 2017. When agreement-accepting free-riders are a necessary evil for the evolution of cooperation. Scientific Reports 7.

    Google Scholar 

  • McCullough, Michael. 2008. Beyond revenge: The evolution of the forgiveness instinct. New Jersey: Wiley.

    Google Scholar 

  • McCullough, Michael E., Robert Kurzban, and Benjamin A. Tabak. 2010. Evolved mechanisms for revenge and forgiveness. In Understanding and reducing aggression, violence, and their consequences, 221–239. Miami.

    Google Scholar 

  • McCullough, Michael E., Eric J. Pedersen, Benjamin A. Tabak, and Evan C. Carter. 2014. Conciliatory gestures promote forgiveness and reduce anger in humans. Proceedings of the National Academy of Sciences 111(30): 11211–11216.

    Google Scholar 

  • McDermott, Drew V. 2001. Mind and mechanism. Cambridge, MA: MIT Press.

    Google Scholar 

  • Mendonça, Dina. 2016. Emotions and akratic feelings. In Morality and emotion, 50. London: Routledge, 83–98. ISBN 978-1-138-12130-0.

    Google Scholar 

  • Mikhail, J. 2007. Universal moral grammar: Theory, evidence and the future. Trends in Cognitive Sciences 11(4): 143–152.

    Article  Google Scholar 

  • Nesse, Randolph M. 2001. Natural selection and the capacity for subjective commitment. In Evolution and the capacity for commitment, 1–44. New York.

    Google Scholar 

  • Nowak, M. A. 2006. Five rules for the evolution of cooperation. Science 314(5805): 1560–1563.

    Article  Google Scholar 

  • Nowak, Martin A., and Karl Sigmund. 1992. Tit for tat in heterogeneous populations. Nature 355(6357): 250–253.

    Google Scholar 

  • Ohtsubo, Yohsuke, and Esuka Watanabe. 2009. Do sincere apologies need to be costly? Test of a costly signaling model of apology. Evolution and Human Behavior 30(2): 114–123.

    Google Scholar 

  • Okamoto, Kyoko, and Shuichi Matsumura. 2000. The evolution of punishment and apology: an iterated prisoner’s dilemma model. Evolutionary Ecology 14(8): 703-720.

    Google Scholar 

  • Pereira, Luís Moniz. 2012a. Turing is among us. Journal of Logic and Computation 22(6): 1257–1277.

    Article  Google Scholar 

  • Pereira, Luís Moniz. 2012b. Evolutionary tolerance. In Philosophy and cognitive science, 263–287. Berlin/Heidelberg: Springer.

    Chapter  Google Scholar 

  • Pereira, L.M., and A. Saptawijaya. 1994. Agent morality via counterfactuals in logic programming, invited paper. In Proceedings of bridging the gap: Is logic and automated reasoning a foundation for human reasoning?, a 39th annual meeting of the Cognitive Science Society (CogSci 2017) workshop, CEUR workshop proceedings, vol. 1994, ed. U. Furbach and C. Schon, 39–53, 26 July, London.

    Google Scholar 

  • Pereira, L.M., and A. Saptawijaya. 2011. Modelling morality with prospective logic. In Machine ethics, ed. M. Anderson and S.L. Anderson, 398–421. Cambridge University Press. ISBN 978-0521112352.

    Google Scholar 

  • Pereira, Luís Moniz, and The Anh Han. 2011a. Intention recognition with evolution prospection and causal bayes networks. In Computational intelligence for engineering systems, 1–33. Springer, Netherlands.

    Google Scholar 

  • Pereira, Luís Moniz, and The Anh Han. 2011b. Elder care via intention recognition and evolution prospection. In International conference on applications of declarative programming and knowledge management, 170–187. Springer, Berlin/Heidelberg.

    Google Scholar 

  • Pereira, Luís Moniz, and Ari Saptawijaya. 2009. Modelling morality with prospective logic. International Journal of Reasoning-based Intelligent Systems 1(3–4): 209–221.

    Google Scholar 

  • Pereira, Luís Moniz, and Ari Saptawijaya. 2015. Abduction and beyond in logic programming with application to morality. In ed. L. Magnani, IfColog Journal of Logics and their Applications, Special issue on Abduction, 3(1):37–71. London: College Publications.

    Google Scholar 

  • Pereira, Luís Moniz, and Ari Saptawijaya. 2016a. Bridging two realms of machine ethics. In Programming machine ethics, 159–165. Berlin: Springer.

    Google Scholar 

  • Pereira, Luís Moniz, and Ari Saptawijaya. 2016b. Programming machine ethics, vol. 26. Berlin: Springer. ISBN 978-3-319-29353-0.

    Google Scholar 

  • Pereira, Luís Moniz, and Ari Saptawijaya. 2016c. Counterfactuals, logic programming and agent morality. In Applications of formal philosophy, 25–53. Cham: Springer, 2017, Berlin: Springer, 25–54. ISBN 978-3319585055.

    Google Scholar 

  • Pereira, Luis Moniz, Tom Lenaerts, Luis A. Martinez-Vaquero, and The Anh Han. 2017. Social manifestation of guilt leads to stable cooperation in multi-agent systems. In Proceedings of the 16th conference on autonomous agents and multiagent systems, 1422–1430. Sao Paulo: International Foundation for Autonomous Agents and Multiagent Systems.

    Google Scholar 

  • Pope, Alexander. 1931. An essay on criticism, part II. Lewis, W. Russel Street, Covent Garden (1711).

    Google Scholar 

  • Powers, Simon T., Daniel J. Taylor, and Joanna J. Bryson. 2012. Punishment can promote defection in group-structured populations. Journal of Theoretical Biology 311: 107–116.

    Google Scholar 

  • Prinz, Jesse. 2016. Emotions, morality, and identity. In Morality and emotion, 13–34. London: Routledge, 83–98. ISBN 978-1-138-12130-0.

    Google Scholar 

  • Raihani, Nichola J., and Redouan Bshary. 2015. The reputation of punishers. Trends in Ecology & Evolution 30(2): 98–103.

    Google Scholar 

  • Roy, P., B. Bouchard, A. Bouzouane, and S. Giroux. 2007. A hybrid plan recognition model for Alzheimer’s patients: Interleaved- erroneous dilemma. In Proceedings of IEEE/WIC/ACM Inter- national conference on intelligent agent technology, 131–137.

    Google Scholar 

  • Sadri, Fariba. 2011. Logic-based approaches to intention recognition. In Handbook of research on ambient intelligence and smart environments: Trends and perspectives, 346–375. Hershey, PA: IGI Global.

    Google Scholar 

  • Saptawijaya, Ari, and Luis Moniz Pereira. 2015a. Logic programming applied to machine ethics. In Portuguese conference on artificial intelligence, 414–422. Cham: Springer. LNCS vol. 9273. Berlin: Springer. ISBN 978-3-319-23485-4.

    Google Scholar 

  • Saptawijaya, Ari, and Luis Moniz Pereira. 2015b. The potential of logic programming as a computational tool to model morality. In A construction manual for robots’ ethical systems, 169–210. Switzerland: Springer.

    Google Scholar 

  • Saptawijaya, Ari, and Luis Moniz Pereira. 2015c. From logic programming to machine ethics. In Handbuch Maschinenethik, ed. O. Bendel, (2018). Berlin: Springer.

    Google Scholar 

  • Saptawijaya, A., and L.M. Pereira. 2016. Logic programming for modeling morality. In Special issue on “Formal representations of model-based reasoning and abduction”, of the logic journal of the IGPL, ed. L. Magnani and C. Casadio. 24(4): 510–525. https://doi.org/10.1093/jigpal/jzw025, online 9 May, Aug 2016.

  • Saptawijaya, Ari, and Luís Moniz Pereira. 2018. From logic programming to machine ethics. In Handbuch Maschinenethik, ed. O. Bendel. Berlin: Springer.

    Google Scholar 

  • Sarker, Md Omar Faruque, Torbjørn S. Dahl, Elsa Arcaute, and Kim Christensen. 2014. Local interactions over global broadcasts for improved task allocation in self-organized multi-robot systems. Robotics and Autonomous Systems 62(10): 1453–1462.

    Google Scholar 

  • Schneider, Frédéric, and Roberto A. Weber. 2013. Long-term commitment and cooperation. Tech. Rep., Working paper series, University of Zurich, Department of Economics.

    Google Scholar 

  • Searle, John R. 1995. The construction of social reality. New York: Simon and Schuster.

    Google Scholar 

  • Searle, John. 2010. Making the social world: The structure of human civilization. Oxford: Oxford University Press.

    Google Scholar 

  • Sigmund, Karl. 2010. The calculus of selfishness. Princeton: Princeton University Press.

    Google Scholar 

  • Smith, Nick. 2008. I was wrong: The meanings of apologies. Cambridge, UK: Cambridge University Press.

    Google Scholar 

  • Sterelny, Kim. 2012. The evolved apprentice. Cambridge, MA: MIT Press.

    Google Scholar 

  • Takaku, Seiji, Bernard Weiner, and Ken-Ichi Ohbuchi. 2001. A cross-cultural examination of the effects of apology and perspective taking on forgiveness. Journal of Language and Social Psychology 20(1–2): 144–166.

    Google Scholar 

  • Trivers, Robert L. 1971. The evolution of reciprocal altruism. The Quarterly Review of Biology 46(1): 35–57.

    Article  Google Scholar 

  • Turing, Alan M. 1950. Computing machinery and intelligence. Mind 59(236): 433–460.

    Article  Google Scholar 

  • Tzeng, Jeng-Yi. 2004. Toward a more civilized design: studying the effects of computers that apologize. International Journal of Human-Computer Studies 61(3): 319–345.

    Article  Google Scholar 

  • Utz, Sonja, Uwe Matzat, and Chris Snijders. 2009. On-line reputation systems: The effects of feedback comments and reactions on building and rebuilding trust in on-line auctions. International Journal of Electronic Commerce 13(3): 95–118.

    Google Scholar 

  • von Neumann, J., and O. von Morgenstern. 1944. Theory of games and economic behavior, vol. 1. Princeton: Princeton University Press.

    Google Scholar 

  • Winikoff, Michael. 2007. Implementing commitment-based interactions. In Proceedings of the 6th international joint conference on autonomous agents and multiagent systems, 128. Hawaii: ACM.

    Google Scholar 

  • Wooldridge, Michael, and Nicholas R. Jennings. 1999. The cooperative problem-solving process. Journal of Logic and Computation 9(4): 563–592.

    Google Scholar 

  • Wright, R. 2001. NonZero – The logic of human destiny. New York: Vintage.

    Google Scholar 

Download references

Acknowledgments

Profound thanks are due to our co-authors of joint published work herein cited, without which the personal summing up and specific philosophical viewpoint above would not have been possible at all. Alphabetically: Ari Saptawijaya, Francisco C. Santos, Luis Martinez-Vaquero, and Tom Lenaerts. Furthermore, L. M. Pereira acknowledges support from grant FCT/MEC NOVA LINCS PEst UID/CEC/04516/2013. TAH from Teesside URF funding (11200174).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Luís Moniz Pereira .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Fachmedien Wiesbaden GmbH

About this entry

Check for updates. Verify currency and authenticity via CrossMark

Cite this entry

Han, T.A., Pereira, L.M. (2018). Evolutionary Machine Ethics. In: Bendel, O. (eds) Handbuch Maschinenethik. Springer Reference Geisteswissenschaften. Springer VS, Wiesbaden. https://doi.org/10.1007/978-3-658-17484-2_15-1

Download citation

  • DOI: https://doi.org/10.1007/978-3-658-17484-2_15-1

  • Received:

  • Accepted:

  • Published:

  • Publisher Name: Springer VS, Wiesbaden

  • Print ISBN: 978-3-658-17484-2

  • Online ISBN: 978-3-658-17484-2

  • eBook Packages: Springer Referenz Sozialwissenschaften und Recht

Publish with us

Policies and ethics