Benvinguts al Repositori Digital de la UPF

Cooperation is the rule, not the exception: reinforcement learning in the Battle of the Sexes

Mostra el registre parcial de l'element

dc.contributor.author Puig Camps, Bernat
dc.date.accessioned 2018-12-17T09:38:41Z
dc.date.available 2018-12-17T09:38:41Z
dc.date.issued 2018-09
dc.identifier.uri http://hdl.handle.net/10230/36102
dc.description Treball fi de màster de: Master in Intelligent Interactive Systems
dc.description Tutors: Vicenç Gómez Cerdà i Martí Sanchez Fibla
dc.description.abstract Society is highly influenced by conventions, which are a form of cooperation. In many situations, individuals act together for the benefit of the group. This phenomenon is easy to understand when all individuals share the same interest. However, when there exists conflict, it is not clear if altruism is required or pure self-interest can lead to cooperation. The repeated version of the Battle of the Sexes game can summarize this situation. Although conflict is present, players need to cooperate to obtain good rewards. Here we show experimentally that two selfish reinforcement learning agents learn to cooperate in this conflictive scenario. We found that two Q-learning agents playing this game modeled as a Markov Game reach a cooperative fair solution. That is, two agents that learn based solely on their own self-interest end up cooperating. Furthermore, we found that Q-learning is able to converge in this multi-agent situation. Our results demonstrate that cooperation among individuals in this particular conflictive scenario can be explained by means of pure self-interest. Moreover, cooperation in this setting is the rule, not the exception as the convergence to it is robust to parameter asymmetry between agents. We also introduced opponent modeling into the players as a Beta binomial model. It worked well in modeling the adversary but agents fail to properly exploit that knowledge.
dc.format.mimetype application/pdf
dc.language.iso eng
dc.rights Atribución-NoComercial-SinDerivadas 3.0 España
dc.rights.uri http://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.title Cooperation is the rule, not the exception: reinforcement learning in the Battle of the Sexes
dc.type info:eu-repo/semantics/masterThesis
dc.subject.keyword Cooperation
dc.subject.keyword Selfish agents
dc.subject.keyword Q-Learning
dc.subject.keyword Game Theory
dc.subject.keyword Opponent modeling
dc.rights.accessRights info:eu-repo/semantics/openAccess

Thumbnail

Aquest element apareix en la col·lecció o col·leccions següent(s)

Mostra el registre parcial de l'element

Cerca


Cerca avançada

Visualitza

El meu compte

Estadístiques

Amb col·laboració de Complim Participem