ABSTRACT

Understanding how to design agents that sustain cooperation in multi-agent systems has been a long lasting goal in distributed Artificial Intelligence. Solutions proposed rely on identifying defective agents and avoid cooperating or interacting with them. These mechanisms of social control are traditionally studied in games with linear and deterministic payoffs, such as the Prisoner's Dilemma or the Public Goods Game. In reality, however, agents often face dilemmas in which payoffs are uncertain and non-linear, as collective success requires a minimum number of cooperators. These games are called Collective Risk Dilemmas (CRD), and it is unclear whether the previous mechanisms of cooperation remain effective in this case. Here we study cooperation in CRD through partner-based selection. First, we discuss an experiment in which groups of humans and robots play a CRD. We find that people only prefer cooperative partners when they lose a previous game i.e., when collective success was not previously achieved). Secondly, we develop a simplified evolutionary game theoretical model that sheds light on these results, pointing the evolutionary advantages of selecting cooperative partners only when a previous game was lost. We show that this strategy constitutes a convenient balance between strictness (only interact with cooperators) and softness (cooperate and interact with everyone), thus suggesting a new way of designing agents that promote cooperation in CRD.
References
- Daan Bloembergen, Karl Tuyls, Daniel Hennes, and Michael Kaisers. 2015. Evolutionary dynamics of multi-agent learning: a survey. Journal of Artificial Intelligence Research , Vol. 53 (2015), 659--697. Google Scholar
Digital Library
- Xiaojie Chen, Tatsuya Sasaki, Åke Br"annström, and Ulf Dieckmann. 2015. First carrot, then stick: how the adaptive hybridization of incentives promotes cooperation. Journal of the Royal Society Interface , Vol. 12, 102 (2015), 20140935.Google Scholar
- Manuel Chica, Raymond Chiong, Michael Kirley, and Hisao Ishibuchi. 2017. A Networked N-player Trust Game and its Evolutionary Dynamics. IEEE Transactions on Evolutionary Computation (2017).Google Scholar
- Filipa Correia, Samuel Mascarenhas, Samuel Gomes, Patricia Arriaga, Iolanda Leite, Rui Prada, Francisco S. Melo, and Ana Paiva. 2019. Exploring Prosociality in Human-Robot Teams. In Proceedings of the 14th ACM/IEEE International Conference on Human-Robot Interaction (HRI'19). IEEE Press.Google Scholar
- Chad Crawford, Rachna Nanda Kumar, and Sandip Sen. 2018. Resisting Exploitation Through Rewiring in Social Networks: Social Welfare Increase using Parity, Sympathy and Reciprocity. In Proceedings of AAMAS'18. IFAAMAS, 1915--1917. Google Scholar
- Soham De, Dana S Nau, and Michele J Gelfand. 2017. Understanding norm change: An evolutionary game-theoretic approach. In Proceedings of AAMAS'17. IFAAMAS, 1433--1441. Google Scholar
- Elias Fernández Domingos, Juan-Carlos Burguillo, and Tom Lenaerts. 2017. Reactive Versus Anticipative Decision Making in a Novel Gift-Giving Game.. In Proceedings of AAAI'17, Vol. 17. AAAI Press, 4399--4405. Google Scholar
- Katrin Fehl, Daniel J van der Post, and Dirk Semmann. 2011. Co-evolution of behaviour and social network structure promotes human cooperation. Ecology letters , Vol. 14, 6 (2011), 546--551.Google Scholar
- Ernst Fehr and Simon G"achter. 2002. Altruistic punishment in humans. Nature , Vol. 415, 6868 (2002), 137.Google Scholar
- Michael R Genesereth, Matthew L Ginsberg, and Jeffrey S Rosenschein. 1986. Cooperation without communication. Proceedings of AAAI'86 . Elsevier, 51--57. Google Scholar
- Whan Ghang and Martin A Nowak. 2015. Indirect reciprocity with optional interactions. Journal of Theoretical Biology , Vol. 365 (2015), 1--11.Google Scholar
- Herbert Gintis, Samuel Bowles, Robert T Boyd, Ernst Fehr, et almbox. 2005. Moral sentiments and material interests: The foundations of cooperation in economic life. Vol. 6. MIT press.Google Scholar
- Nathan Griffiths. 2008. Tags and image scoring for robust cooperation. In Proceedings of AAMAS'08. IFAAMAS, 575--582. Google Scholar
- Nathan Griffiths and Michael Luck. 2010. Changing neighbours: improving tag-based cooperation. In Proceedings of AAMAS'10. IFAAMAS, 249--256. Google Scholar
- Victoria Groom and Clifford Nass. 2007. Can robots be teammates?: Benchmarks in human--robot teams. Interaction Studies , Vol. 8, 3 (2007), 483--500.Google Scholar
- The Anh Han, Lu'is Moniz Pereira, and Tom Lenaerts. 2017. Evolution of commitment and level of participation in public goods games. Autonomous Agents and Multi-Agent Systems , Vol. 31, 3 (2017), 561--583. Google Scholar
- The Anh Han, Lu'is Moniz Pereira, and Francisco C. Santos. 2012. The Emergence of Commitments and Cooperation. In Proceedings of AAMAS'13 (AAMAS '12). IFAAMAS, Richland, SC, 559--566. Google Scholar
- Christoph Hauert, Silvia De Monte, Josef Hofbauer, and Karl Sigmund. 2002. Volunteering as red queen mechanism for cooperation in public goods games. Science , Vol. 296, 5570 (2002), 1129--1132.Google Scholar
- Tad Hogg. 1995. Social dilemmas in computational ecosystems. In Proceedings of IJCAI'95 . 711--718. Google Scholar
- Nicholas R Jennings, Katia Sycara, and Michael Wooldridge. 1998. A roadmap of agent research and development. Autonomous Agents and Multi-agent Systems , Vol. 1, 1 (1998), 7--38. Google Scholar
- Vidyadhar G Kulkarni. 2016. Modeling and analysis of stochastic systems .Chapman and Hall/CRC. Google Scholar
- Maja J Matarić. 1995. Issues and approaches in the design of collective autonomous agents. Robotics and Autonomous Systems , Vol. 16, 2--4 (1995), 321--331.Google Scholar
- Manfred Milinski, Ralf D Sommerfeld, Hans-Jürgen Krambeck, Floyd A Reed, and Jochem Marotzke. 2008. The collective-risk social dilemma and the prevention of simulated dangerous climate change. Proceedings of the National Academy of Sciences , Vol. 105, 7 (2008), 2291--2294.Google Scholar
- Yishay Mor and Jeffrey S Rosenschein. 1995. Time and the Prisoner's Dilemma.. In Proceedings of ICMAS'95 . 276--282.Google Scholar
- Javier Morales, Michael Wooldridge, Juan A Rodr'iguez-Aguilar, and Maite López-Sánchez. 2018. Off-line synthesis of evolutionarily stable normative systems. Autonomous Agents and Multi-Agent Systems (2018), 1--37. Google Scholar
- Joao A Moreira, Jorge M Pacheco, and Francisco C Santos. 2013. Evolution of collective action in adaptive social structures. Scientific Reports , Vol. 3 (2013), 1521.Google Scholar
- Martin A Nowak. 2006. Evolutionary dynamics .Harvard University Press.Google Scholar
- Martin A Nowak, Karen M Page, and Karl Sigmund. 2000. Fairness versus reason in the ultimatum game. Science , Vol. 289, 5485 (2000), 1773--1775.Google Scholar
- Jason Olejarz, Whan Ghang, and Martin A Nowak. 2015. Indirect reciprocity with optional interactions and private information. Games , Vol. 6, 4 (2015), 438--457.Google Scholar
- Jorge M Pacheco, Francisco C Santos, Max O Souza, and Brian Skyrms. 2009. Evolutionary dynamics of collective action in N-person stag hunt dilemmas. Proceedings of the Royal Society of London B , Vol. 276, 1655 (2009), 315--321.Google Scholar
- Ana Paiva, Fernando P Santos, and Francisco C Santos. 2018. Engineering Pro-Sociality with Autonomous Agents. AAAI'18 (2018), 7994--7999.Google Scholar
- Ana Peleteiro, Juan C Burguillo, and Siang Yew Chong. 2014. Exploring indirect reciprocity in complex networks using coalitions and rewiring. In Proceedings of AAMAS'14. IFAAMAS, 669--676. Google Scholar
- Flavio L Pinheiro, Vitor V Vasconcelos, Francisco C Santos, and Jorge M Pacheco. 2014. Evolution of all-or-none strategies in repeated public goods dilemmas. PLoS Computational Biology , Vol. 10, 11 (2014), e1003945.Google Scholar
- David G Rand and Martin A Nowak. 2011. The evolution of antisocial punishment in optional public goods games. Nature Communications , Vol. 2 (2011), 434.Google Scholar
- David G Rand and Martin A Nowak. 2013. Human cooperation. Trends in Cognitive Sciences , Vol. 17, 8 (2013), 413--425.Google Scholar
- David G Rand, Corina E Tarnita, Hisashi Ohtsuki, and Martin A Nowak. 2013. Evolution of fairness in the one-shot anonymous Ultimatum Game. Proceedings of the National Academy of Sciences , Vol. 110, 7 (2013), 2581--2586.Google Scholar
- Eduardo Ben'itez Sandoval, Jürgen Brandstetter, Mohammad Obaid, and Christoph Bartneck. 2016. Reciprocity in human-robot interaction: a quantitative approach through the prisoner's dilemma and the ultimatum game. International Journal of Social Robotics , Vol. 8, 2 (2016), 303--317.Google Scholar
- Francisco C Santos and Jorge M Pacheco. 2011. Risk of collective failure provides an escape from the tragedy of the commons. Proceedings of the National Academy of Sciences , Vol. 108, 26 (2011), 10421--10425.Google Scholar
- Francisco C Santos, Jorge M Pacheco, and Tom Lenaerts. 2006. Cooperation prevails when individuals adjust their social ties. PLoS Computational Biology , Vol. 2, 10 (2006), e140.Google Scholar
- Fernando P Santos. 2017. Social Norms of Cooperation in Multiagent Systems. Proceedings of AAMAS'17. IFAAMAS, 1859--1860. Google Scholar
- Fernando P Santos, Jorge M Pacheco, Ana Paiva, and Francisco C Santos. 2019. Evolution of collective fairness in hybrid populations of humans and agents. In Proceedings of AAAI'19. AAAI Press.Google Scholar
- Fernando P Santos, Jorge M Pacheco, and Francisco C Santos. 2016a. Evolution of cooperation under indirect reciprocity and arbitrary exploration rates. Scientific Reports , Vol. 6 (2016), 37517.Google Scholar
- Fernando P Santos, Jorge M Pacheco, and Francisco C Santos. 2018a. Social Norms of Cooperation with Costly Reputation Building. In Proceedings of AAAI'18 . AAAI Press, 4727--4734.Google Scholar
- Fernando P Santos, Francisco C Santos, Francisco S Melo, Ana Paiva, and Jorge M Pacheco. 2016c. Dynamics of fairness in groups of autonomous learning agents. In International Conference on Autonomous Agents and Multiagent Systems (workshops' best papers book) . Springer, 107--126.Google Scholar
- Fernando P Santos, Francisco C Santos, and Jorge M Pacheco. 2016b. Social norms of cooperation in small-scale societies. PLoS Computational Biology , Vol. 12, 1 (2016), e1004709.Google Scholar
- Fernando P Santos, Francisco C Santos, and Jorge M Pacheco. 2018b. Social norm complexity and past reputations in the evolution of cooperation. Nature , Vol. 555, 7695 (2018), 242.Google Scholar
- Sandip Sen et almbox. 2018. The Effects of Past Experience on Trust in Repeated Human-Agent Teamwork. In Proceedings of AAMAS'18 . IFAAMAS, 514--522. Google Scholar
- Pedro Sequeira, Francisco S Melo, and Ana Paiva. 2011. Emotion-based intrinsic motivation for reinforcement learning agents. In International Conference on Affective Computing and Intelligent Interaction. Springer, 326--336. Google Scholar
- Karl Sigmund. 2010. The calculus of selfishness .Princeton University Press.Google Scholar
- J Maynard Smith and George R Price. 1973. The logic of animal conflict. Nature , Vol. 246, 5427 (1973), 15.Google Scholar
- Alessandro Tavoni, Astrid Dannenberg, Giorgos Kallis, and Andreas Löschel. 2011. Inequality, communication, and the avoidance of disastrous climate change in a public goods game. Proceedings of the National Academy of Sciences , Vol. 108, 29 (2011), 11825--11829.Google Scholar
- Arne Traulsen, Christoph Hauert, Hannelore De Silva, Martin A Nowak, and Karl Sigmund. 2009. Exploration dynamics in evolutionary games. Proceedings of the National Academy of Sciences , Vol. 106, 3 (2009), 709--712.Google Scholar
- Arne Traulsen, Martin A Nowak, and Jorge M Pacheco. 2006. Stochastic dynamics of invasion and fixation. Physical Review E , Vol. 74, 1 (2006), 011909.Google Scholar
- Karl Tuyls and Ann Nowé. 2005. Evolutionary game theory and multi-agent reinforcement learning. The Knowledge Engineering Review , Vol. 20, 1 (2005), 63--90. Google Scholar
- Karl Tuyls and Simon Parsons. 2007. What evolutionary game theory tells us about multiagent learning. Artificial Intelligence , Vol. 171, 7 (2007), 406--416. Google Scholar
- Sven Van Segbroeck, Jorge M Pacheco, Tom Lenaerts, and Francisco C Santos. 2012. Emergence of fairness in repeated group interactions. Physical Review Letters , Vol. 108, 15 (2012), 158104.Google Scholar
- Vitor V Vasconcelos, Francisco C Santos, and Jorge M Pacheco. 2013. A bottom-up institutional approach to cooperative governance of risky commons. Nature Climate Change , Vol. 3, 9 (2013), 797.Google Scholar
- V'itor V Vasconcelos, Fernando P Santos, Francisco C Santos, and Jorge M Pacheco. 2017. Stochastic dynamics through hierarchically embedded Markov chains. Physical Review Letters , Vol. 118, 5 (2017), 058301.Google Scholar
- Jörgen W Weibull. 1997. Evolutionary game theory .MIT press.Google Scholar
- Gerhard Weiss. 1999. Multiagent systems: a modern approach to distributed artificial intelligence .MIT press. Google Scholar
- Ioannis Zisis, Sibilla Di Guida, TA Han, Georg Kirchsteiger, and Tom Lenaerts. 2015. Generosity motivated by acceptance-evolutionary analysis of an anticipation game. Scientific Reports , Vol. 5 (2015), 18076.Google Scholar
Index Terms
Outcome-based Partner Selection in Collective Risk Dilemmas

Filipa Correia


Comments