Freire, Ismael T.Moulin-Frier, ClémentSánchez Fibla, MartíArsiwalla, Xerxes D.Verschure, Paul F. M. J.2020-10-132020-10-132020Freire IT, Moulin-Frier C, Sanchez-Fibla M, Arsiwalla XD, Verschure PFMJ. Modeling the formation of social conventions from embodied real-time interactions. PLoS One. 2020 Jun 22;15(6):e0234434. DOI: 10.1371/journal.pone.02344341932-6203http://hdl.handle.net/10230/45465What is the role of real-time control and learning in the formation of social conventions? To answer this question, we propose a computational model that matches human behavioral data in a social decision-making game that was analyzed both in discrete-time and continuous-time setups. Furthermore, unlike previous approaches, our model takes into account the role of sensorimotor control loops in embodied decision-making scenarios. For this purpose, we introduce the Control-based Reinforcement Learning (CRL) model. CRL is grounded in the Distributed Adaptive Control (DAC) theory of mind and brain, where low-level sensorimotor control is modulated through perceptual and behavioral learning in a layered structure. CRL follows these principles by implementing a feedback control loop handling the agent’s reactive behaviors (pre-wired reflexes), along with an Adaptive Layer that uses reinforcement learning to maximize long-term reward. We test our model in a multi-agent game-theoretic task in which coordination must be achieved to find an optimal solution. We show that CRL is able to reach human-level performance on standard game-theoretic metrics such as efficiency in acquiring rewards and fairness in reward distribution.application/pdfeng© 2020 Freire et al. This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.Modeling the formation of social conventions from embodied real-time interactionsinfo:eu-repo/semantics/articlehttp://dx.doi.org/10.1371/journal.pone.0234434LearningBallisticsGame theoryAnimal socialitySensory perceptionHuman performanceVideo gamesDecision makinginfo:eu-repo/semantics/openAccess