Learning non-linear payoff transformations in multi-agent systems

dc.contributor.authorFraxanet, Emma
dc.date.accessioned2021-12-15T12:17:43Z
dc.date.available2021-12-15T12:17:43Z
dc.date.issued2021-09
dc.descriptionTreball fi de màster de: Master in Intelligent Interactive Systemsca
dc.descriptionTutor: Vicenç Gómez
dc.description.abstractThe use of Deep Reinforcement Learning methodologies has been successful in recent years in cooperative multi-agent systems. However, this success has been mostly empirical and there is a lack of theoretical understanding and solid description of the learning process of those algorithms. The discussion of whether the limitations of these algorithms can be tackled with tuning and optimization or, contrarily, are constrained by their own definition in these models can also easily be put forward. In this work, we propose a theoretical formulation to reproduce one of the claimed limitations of Value Decomposition Networks (VDN), when compared to its improved related model QMIX, regarding their representational capacity. Both of these algorithms follow the centralized-learning-decentralized-execution fashion. For this purpose, we scale down the dimensions of the system to bypass the need for deep learning structures and work with a toy model two-step game and a series of one-shot games that are randomly generated to produce non-linear payoff growth. Despite their simplicity, these settings capture multi-agent challenges such as the scalability problem and the non-unique learning goals. Based on our analytical description, we are also able to formulate a possible alternative solution to this limitation through the use of simple non-linear transformations of the payoff, which sets a possible direction of future work regarding larger scale systems.ca
dc.format.mimetypeapplication/pdf*
dc.identifier.urihttp://hdl.handle.net/10230/49222
dc.language.isoengca
dc.rights© Tots els drets reservatsca
dc.rights.accessRightsinfo:eu-repo/semantics/openAccessca
dc.subject.keywordReinforcement learning
dc.subject.keywordMulti-agent learning
dc.subject.keywordAction-value representation
dc.subject.keywordOne-shot games
dc.titleLearning non-linear payoff transformations in multi-agent systemsca
dc.typeinfo:eu-repo/semantics/masterThesisca

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TFM_Emma_Fraxanet.pdf
Size:
2.08 MB
Format:
Adobe Portable Document Format
Description: